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Geometry of chaos 
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2 



We start out with a recapitulation of the basic notions of dynamics. Our aim is 
narrow; we keep the exposition focused on prerequisites to the applications to 
be developed in this text. We assume that the reader is familiar with dynamics 
on the level of the introductory texts mentioned in remark 1.1, and concentrate here on 
developing intuition about what a dynamical system can do. It will be a coarse brush 
sketch-a full description of all possible behaviors of dynamical systems is beyond human 
ken. While for a novice there is no shortcut through this lengthy detour, a sophisticated 
traveler might bravely skip this well-trodden territory and embark upon the journey at 
chapter 15. 

The fate has handed you a flow. What are you to do about it? 

1. Define your dynamical system (At,/): the space of its possible states M, and the 
law f of their evolution in time. 

2. Pin it down locally-is there anything about it that is stationary? Try to determine its 
equilibria /fixed points (Chapter 2). 

3. Slice it, represent as a map from a section to a section (Chapter 3). 

4. Explore the neighborhood by linearizing the flow-check the linear stability of its 
equilibria / fixed points, their stability eigen-directions (Chapter 4). 

5. Go global: train by partitioning the state space of 1-dimensional maps. Label the 
regions by symbolic dynamics (Chapter 11). 

6. Now venture global distances across the system by continuing eigenvectors into 
stable I unstable manifolds. Their intersections partition the state space in a dy- 
namically invariant way (Chapter 12). 

7. Guided by this topological partition, compute a set of periodic orbits up to a given 
topological length (Chapter 13). 

Along the way you might want to learn about dynamical invariants (chapter5), Lyapunov 
exponents (chapter 6), classical mechanics (chapter 7), billiards (chapter 8), and discrete 
(chapter 9) and continuous (chapter 10) symmetries of dynamics. 
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Chapter 1 



Overture 



If I have seen less far than other men it is because I have 
stood behind giants. 

— Edoardo Specchio 



Rereading classic theoretical physics textbooks leaves a sense that there are 
holes large enough to steam a Eurostar train through them. Here we learn 
about harmonic oscillators and Keplerian ellipses - but where is the chap- 
ter on chaotic oscillators, the tumbling Hyperion? We have just quantized hydro- 
gen, where is the chapter on the classical 3-body problem and its implications for 
quantization of helium? We have learned that an instanton is a solution of field- 
theoretic equations of motion, but shouldn't a strongly nonlinear field theory have 
turbulent solutions? How are we to think about systems where things fall apart; 
the center cannot hold; every trajectory is unstable? 

This chapter offers a quick survey of the main topics covered in the book. 
Throughout the book 



indicates that the section is on a pedestrian level - you are expected to 
know/learn this material 



<*0^£j\ indicates that the section is on a somewhat advanced, cyclist level 



indicates that the section requires a hearty stomach and is probably best 
skipped on first reading 

fast track points you where to skip to 



tells you where to go for more depth on a particular topic 
[exercise 1 .2] on margin links to an exercise that might clarify a point in the text 



3 



CHAPTER 1. OVERTURE 



4 



indicates that a figure is still missing-you are urged to fetch it 

We start out by making promises-we will right wrongs, no longer shall you suffer 
the slings and arrows of outrageous Science of Perplexity. We relegate a historical 
overview of the development of chaotic dynamics to appendixA, and head straight 
to the starting line: A pinball game is used to motivate and illustrate most of the 
concepts to be developed in ChaosBook. 

This is a textbook, not a research monograph, and you should be able to follow 
the thread of the argument without constant excursions to sources. Hence there are 
no literature references in the text proper, all learned remarks and bibliographical 
pointers are relegated to the "Commentary" section at the end of each chapter. 



1.1 Why ChaosBook? 

It seems sometimes that through a preoccupation with sci- 
ence, we acquire a firmer hold over the vicissitudes of life 
and meet them with greater calm, but in reality we have 
done no more than to find a way to escape from our sor- 
rows. 

— Hermann Minkowski in a letter to David Hilbert 

The problem has been with us since Newton's first frustrating (and unsuccessful) 
crack at the 3-body problem, lunar dynamics. Nature is rich in systems governed 
by simple deterministic laws whose asymptotic dynamics are complex beyond 
belief, systems which are locally unstable (almost) everywhere but globally recur- 
rent. How do we describe their long term dynamics? 

The answer turns out to be that we have to evaluate a determinant, take a 
logarithm. It would hardly merit a learned treatise, were it not for the fact that this 
determinant that we are to compute is fashioned out of infinitely many infinitely 
small pieces. The feel is of statistical mechanics, and that is how the problem 
was solved; in the 1960's the pieces were counted, and in the 1970's they were 
weighted and assembled in a fashion that in beauty and in depth ranks along with 
thermodynamics, partition functions and path integrals amongst the crown jewels 
of theoretical physics. 

This book is not a book about periodic orbits. The red thread throughout the 
text is the duality between the local, topological, short-time dynamically invariant 
compact sets (equilibria, periodic orbits, partially hyperbolic invariant tori) and 
the global long-time evolution of densities of trajectories. Chaotic dynamics is 
generated by the interplay of locally unstable motions, and the interweaving of 
their global stable and unstable manifolds. These features are robust and acces- 
sible in systems as noisy as slices of rat brains. Poincare, the first to understand 
deterministic chaos, already said as much (modulo rat brains). Once this topology 
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is understood, a powerful theory yields the observable consequences of chaotic 
dynamics, such as atomic spectra, transport coefficients, gas pressures. 

That is what we will focus on in ChaosBook. The book is a self-contained 
graduate textbook on classical and quantum chaos. Your professor does not know 
this material, so you are on your own. We will teach you how to evaluate a deter- 
minant, take a logarithm-stuff like that. Ideally, this should take 100 pages or so. 
Well, we fail-so far we have not found a way to traverse this material in less than 
a semester, or 200-300 page subset of this text. Nothing to be done. 



1.2 Chaos ahead 

Things fall apart; the centte cannot hold. 
— W.B. Yeats: The Second Coming 

The study of chaotic dynamics is no recent fashion. It did not start with the 
widespread use of the personal computer. Chaotic systems have been studied for 
over 200 years. During this time many have contributed, and the field followed no 
single line of development; rather one sees many interwoven strands of progress. 

In retrospect many triumphs of both classical and quantum physics were a 
stroke of luck: a few integrable problems, such as the harmonic oscillator and 
the Kepler problem, though 'non-generic,' have gotten us very far. The success 
has lulled us into a habit of expecting simple solutions to simple equations-an 
expectation tempered by our recently acquired ability to numerically scan the state 
space of non-integrable dynamical systems. The initial impression might be that 
all of our analytic tools have failed us, and that the chaotic systems are amenable 
only to numerical and statistical investigations. Nevertheless, a beautiful theory 
of deterministic chaos, of predictive quality comparable to that of the traditional 
perturbation expansions for nearly integrable systems, already exists. 

In the traditional approach the integrable motions are used as zeroth-order ap- 
proximations to physical systems, and weak nonlinearities are then accounted for 
perturbatively. For strongly nonlinear, non-integrable systems such expansions 
fail completely; at asymptotic times the dynamics exhibits amazingly rich struc- 
ture which is not at all apparent in the integrable approximations. However, hidden 
in this apparent chaos is a rigid skeleton, a self-similar tree of cycles (periodic or- 
bits) of increasing lengths. The insight of the modern dynamical systems theory 
is that the zeroth-order approximations to the harshly chaotic dynamics should be 
very different from those for the nearly integrable systems: a good starting ap- 
proximation here is the stretching and folding of baker's dough, rather than the 
periodic motion of a harmonic oscillator. 

So, what is chaos, and what is to be done about it? To get some feeling for how 
and why unstable cycles come about, we start by playing a game of pinball. The 
reminder of the chapter is a quick tour through the material covered in ChaosBook. 
Do not worry if you do not understand every detail at the first reading-the intention 
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Figure 1.1: A physicist's bare bones game of pinball. 




is to give you a feeling for the main themes of the book. Details will be tilled out 

later. If you want to get a particular point clarified right now, [section 1.41 on the 

. ... section 1 .4 

margin points at the appropriate section. 



1.3 The future as in a mirror 

All you need to know about chaos is contained in the intro- 
duction of [ChaosBook]. However, in order to understand 
the introduction you will first have to read the rest of the 
book. 

— Gary Morriss 

That deterministic dynamics leads to chaos is no surprise to anyone who has tried 
pool, billiards or snooker-the game is about beating chaos-so we start our story 
about what chaos is, and what to do about it, with a game of pinball. This might 
seem a trifle, but the game of pinball is to chaotic dynamics what a pendulum is 
to integrable systems: thinking clearly about what 'chaos' in a game of pinball 
is will help us tackle more difficult problems, such as computing the diffusion 
constant of a deterministic gas, the drag coefficient of a turbulent boundary layer, 
or the helium spectrum. 

We all have an intuitive feeling for what a ball does as it bounces among the 
pinball machine's disks, and only high-school level Euclidean geometry is needed 
to describe its trajectory. A physicist's pinball game is the game of pinball strip- 
ped to its bare essentials: three equidistantly placed reflecting disks in a plane, 
figure 1.1. A physicist's pinball is free, frictionless, point-like, spin-less, perfectly 
elastic, and noiseless. Point-like pinballs are shot at the disks from random starting 
positions and angles; they spend some time bouncing between the disks and then 
escape. 

At the beginning of the 18th century Baron Gottfried Wilhelm Leibniz was 
confident that given the initial conditions one knew everything a deterministic 
system would do far into the future. He wrote [2], anticipating by a century and 
a half the oft-quoted Laplace's "Given for one instant an intelligence which could 
comprehend all the forces by which nature is animated...": 

That everything is brought forth through an established destiny is just 
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Figure 1.2: Sensitivity to initial conditions: two pin- 
balls that start out very close to each other separate ex- 
ponentially with time. 



23132321 \ 




2313 



as certain as that three times three is nine. [. . . ] If, for example, one sphere 
meets another sphere in free space and if their sizes and their paths and 
directions before collision are known, we can then foretell and calculate 
how they will rebound and what course they will take after the impact. Very 
simple laws are followed which also apply, no matter how many spheres 
are taken or whether objects are taken other than spheres. From this one 
sees then that everything proceeds mathematically-that is, infallibly-in the 
whole wide world, so that if someone could have a sufficient insight into 
the inner parts of things, and in addition had remembrance and intelligence 
enough to consider all the circumstances and to take them into account, he 
would be a prophet and would see the future in the present as in a mirror. 

Leibniz chose to illustrate his faith in determinism precisely with the type of phys- 
ical system that we shall use here as a paradigm of 'chaos.' His claim is wrong in a 
deep and subtle way: a state of a physical system can never be specified to infinite 
precision, and by this we do not mean that eventually the Heisenberg uncertainty 
principle kicks in. In the classical, deterministic dynamics there is no way to take 
all the circumstances into account, and a single trajectory cannot be tracked, only 
a ball of nearby initial points makes physical sense. 



A deterministic system is a system whose present state is in principle fully deter- 
mined by its initial conditions. 

In contrast, radioactive decay, Brownian motion and heat flow are examples 
of stochastic systems, for which the initial conditions determine the future only 
partially, due to noise, or other external circumstances beyond our control: the 
present state reflects the past initial conditions plus the particular realization of 
the noise encountered along the way. 

A deterministic system with sufficiently complicated dynamics can fool us 
into regarding it as a stochastic one; disentangling the deterministic from the 



1.3.1 What is 'chaos'? 



I accept chaos. I am not sure that it accepts me. 
— Bob Dylan, Bringing It All Back Home 



intro - 9apr2009 



ChaosBook.org versionl4.4. 1, Apr21 2013 



CHAPTER 1. OVERTURE 



8 



Figure 1.3: Unstable trajectories separate with time. 




stochastic is the main challenge in many real-life settings, from stock markets 
to palpitations of chicken hearts. So, what is 'chaos'? 

In a game of pinball, any two trajectories that start out very close to each other 
separate exponentially with time, and in a finite (and in practice, a very small) 
number of bounces their separation 6x(t) attains the magnitude of L, the charac- 
teristic linear extent of the whole system, figure 1.2. This property of sensitivity 
to initial conditions can be quantified as 

\6x(t)\ * e M \6x(0)\ 



where A, the mean rate of separation of trajectories of the system, is called the 
Lyapunov exponent. For any finite accuracy 5x = \6x(0)\ of the initial data, the 
dynamics is predictable only up to a finite Lyapunov time 



7Lyap*-^ln 



despite the deterministic and, for Baron Leibniz, infallible simple laws that rule 
the pinball motion. 

A positive Lyapunov exponent does not in itself lead to chaos. One could try 
to play 1- or 2-disk pinball game, but it would not be much of a game; trajecto- 
ries would only separate, never to meet again. What is also needed is mixing, the 
coming together again and again of trajectories. While locally the nearby trajec- 
tories separate, the interesting dynamics is confined to a globally finite region of 
the state space and thus the separated trajectories are necessarily folded back and 
can re-approach each other arbitrarily closely, infinitely many times. For the case 
at hand there are 2" topologically distinct n bounce trajectories that originate from 
a given disk. More generally, the number of distinct trajectories with n bounces 
can be quantified as 



N(n) * e hn 



chapter 6 



section 15.1 



where h, the growth rate of the number of topologically distinct trajectories, is 
called the "topological entropy" (h = In 2 in the case at hand). 

The appellation 'chaos' is a confusing misnomer, as in deterministic dynam- 
ics there is no chaos in the everyday sense of the word; everything proceeds 
mathematically-that is, as Baron Leibniz would have it, infallibly. When a physi- 
cist says that a certain system exhibits 'chaos,' he means that the system obeys 
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Figure 1.4: Dynamics of a chaotic dynamical sys- 
tem is (a) everywhere locally unstable (positive 
Lyapunov exponent) and (b) globally mixing (pos- 
itive entropy). (A. Johansen) 




deterministic laws of evolution, but that the outcome is highly sensitive to small 
uncertainties in the specification of the initial state. The word 'chaos' has in this 
context taken on a narrow technical meaning. If a deterministic system is locally 
unstable (positive Lyapunov exponent) and globally mixing (positive entropy )- 
figure 1.4— it is said to be chaotic. 

While mathematically correct, the definition of chaos as 'positive Lyapunov 
+ positive entropy' is useless in practice, as a measurement of these quantities is 
intrinsically asymptotic and beyond reach for systems observed in nature. More 
powerful is Poincare's vision of chaos as the interplay of local instability (unsta- 
ble periodic orbits) and global mixing (intertwining of their stable and unstable 
manifolds). In a chaotic system any open ball of initial conditions, no matter how 
small, will in finite time overlap with any other finite region and in this sense 
spread over the extent of the entire asymptotically accessible state space. Once 
this is grasped, the focus of theory shifts from attempting to predict individual 
trajectories (which is impossible) to a description of the geometry of the space 
of possible outcomes, and evaluation of averages over this space. How this is 
accomplished is what ChaosBook is about. 

A definition of 'turbulence' is even harder to come by. Can you recognize 
turbulence when you see it? The word comes from 'tourbillon,' French for 'vor- 
tex,' and intuitively it refers to irregular behavior of spatially extended system 
described by deterministic equations of motion-say, a bucket of sloshing water 
described by the Navier-Stokes equations. But in practice the word 'turbulence' 
tends to refer to messy dynamics which we understand poorly. As soon as a 
phenomenon is understood better, it is reclaimed and renamed: 'a route to chaos', 
'spatiotemporal chaos', and so on. 

In ChaosBook we shall develop a theory of chaotic dynamics for low dimens- 
ional attractors visualized as a succession of nearly periodic but unstable motions. 
In the same spirit, we shall think of turbulence in spatially extended systems in 
terms of recurrent spatiotemporal patterns. Pictorially, dynamics drives a given 
spatially extended system (clouds, say) through a repertoire of unstable patterns; 
as we watch a turbulent system evolve, every so often we catch a glimpse of a 
familiar pattern: 
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For any finite spatial resolution, a deterministic flow follows approximately for a 
finite time an unstable pattern belonging to a finite alphabet of admissible patterns, 
and the long term dynamics can be thought of as a walk through the space of such 
patterns. In ChaosBook we recast this image into mathematics. 

1.3.2 When does 'chaos' matter? 

In dismissing Pollock's fractals because of their limited 
magnification range, Jones-Smith and Mathur would also 
dismiss half the published investigations of physical frac- 
tals. 

— Richard P. Taylor [4, 5] 

When should we be mindful of chaos? The solar system is 'chaotic', yet we 
have no trouble keeping track of the annual motions of planets. The rule of thumb 
is this; if the Lyapunov time (l.l)-the time by which a state space region initially 
comparable in size to the observational accuracy extends across the entire acces- 
sible state space-is significantly shorter than the observational time, you need to 
master the theory that will be developed here. That is why the main successes of 
the theory are in statistical mechanics, quantum mechanics, and questions of long 
term stability in celestial mechanics. 

In science popularizations too much has been made of the impact of 'chaos 
theory,' so a number of caveats are already needed at this point. 

At present the theory that will be developed here is in practice applicable only 
to systems of a low intrinsic dimension - the minimum number of coordinates nec- 
essary to capture its essential dynamics. If the system is very turbulent (a descrip- 
tion of its long time dynamics requires a space of high intrinsic dimension) we are 
out of luck. Hence insights that the theory offers in elucidating problems of fully 
developed turbulence, quantum field theory of strong interactions and early cos- 
mology have been modest at best. Even that is a caveat with qualifications. There 
are applications-such as spatially extended (non-equilibrium) systems, plumber's 
turbulent pipes, etc., -where the few important degrees of freedom can be isolated 
and studied profitably by methods to be described here. 

Thus far the theory has had limited practical success when applied to the very 
noisy systems so important in the life sciences and in economics. Even though 
we are often interested in phenomena taking place on time scales much longer 
than the intrinsic time scale (neuronal inter-burst intervals, cardiac pulses, etc.), 
disentangling 'chaotic' motions from the environmental noise has been very hard. 

In 1980's something happened that might be without parallel; this is an area 
of science where the advent of cheap computation had actually subtracted from 
our collective understanding. The computer pictures and numerical plots of frac- 
tal science of the 1980's have overshadowed the deep insights of the 1970's, and 
these pictures have since migrated into textbooks. By a regrettable oversight, 



intro - 9apr2009 



ChaosBook.org versionl4.4. 1, Apr21 2013 



CHAPTER 1. OVERTURE 



11 



Figure 1.5: Katherine Jones-Smith, 'Untitled 5,' the 
drawing used by K. Jones-Smith and R.P. Taylor to test 
the fractal analysis of Pollock's drip paintings [6]. 




ChaosBook has none, so 'Untitled 5' of figure 1.5 will have to do as the illustra- 
tion of the power of fractal analysis. Fractal science posits that certain quantities 
(Lyapunov exponents, generalized dimensions, . . . ) can be estimated on a com- 
puter. While some of the numbers so obtained are indeed mathematically sensible 
characterizations of fractals, they are in no sense observable and measurable on 
the length-scales and time-scales dominated by chaotic dynamics. 

Even though the experimental evidence for the fractal geometry of nature is 
circumstantial [7], in studies of probabilistically assembled fractal aggregates we 
know of nothing better than contemplating such quantities. In deterministic sys- 
tems we can do much better. 



1.4 A game of pinball 



Formulas hamper the understanding. 
— S. Smale 

We are now going to get down to the brass tacks. Time to fasten your seat belts 
and turn off all electronic devices. But first, a disclaimer: If you understand the 
rest of this chapter on the first reading, you either do not need this book, or you are 
delusional. If you do not understand it, it is not because the people who figured 
all this out first are smarter than you: the most you can hope for at this stage is to 
get a flavor of what lies ahead. If a statement in this chapter mystifies/intrigues, 
fast forward to a section indicated by [section ...] on the margin, read only the 
parts that you feel you need. Of course, we think that you need to learn ALL of it, 
or otherwise we would not have included it in ChaosBook in the first place. 

Confronted with a potentially chaotic dynamical system, our analysis pro- 
ceeds in three stages; I. diagnose, II. count, III. measure. First, we determine 
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Figure 1.6: Binary labeling of the 3-disk pinball tra- 
jectories; a bounce in which the trajectory returns to 
the preceding disk is labeled 0, and a bounce which 
results in continuation to the third disk is labeled 1 . 



the intrinsic dimension of the system-the minimum number of coordinates nec- 
essary to capture its essential dynamics. If the system is very turbulent we are, 
at present, out of luck. We know only how to deal with the transitional regime 
between regular motions and chaotic dynamics in a few dimensions. That is still 
something; even an infinite-dimensional system such as a burning flame front can 
turn out to have a very few chaotic degrees of freedom. In this regime the chaotic 
dynamics is restricted to a space of low dimension, the number of relevant param- 
eters is small, and we can proceed to step II; we count and classify all possible 
topologically distinct trajectories of the system into a hierarchy whose successive 
layers require increased precision and patience on the part of the observer. This 
we shall do in sect. 1.4.2. If successful, we can proceed with step III: investigate 
the weights of the different pieces of the system. 

We commence our analysis of the pinball game with steps I, II: diagnose, 
count. We shall return to step Ill-measure-in sect. 1.5. The three sections that 
follow are highly technical, they go into the guts of what the book is about. If 
today is not your thinking day, skip them, jump straight to sect. 1.7. 



chapter 1 1 
chapter 15 



chapter 20 



1.4.1 Symbolic dynamics 

With the game of pinball we are in luck-it is a low dimensional system, free 
motion in a plane. The motion of a point particle is such that after a collision 
with one disk it either continues to another disk or it escapes. If we label the 
three disks by 1, 2 and 3, we can associate every trajectory with an itinerary, a 
sequence of labels indicating the order in which the disks are visited; for example, 
the two trajectories in figure 1 .2 have itineraries _23 13_, _23 132321_ respectively. 
Such labeling goes by the name symbolic dynamics. As the particle cannot collide sec tj on 2 1 
two times in succession with the same disk, any two consecutive symbols must 
differ. This is an example of pruning, a rule that forbids certain subsequences 
of symbols. Deriving pruning rules is in general a difficult problem, but with the 
game of pinball we are lucky-for well-separated disks there are no further pruning 
rules. 



chapter 12 



The choice of symbols is in no sense unique. For example, as at each bounce 
we can either proceed to the next disk or return to the previous disk, the above 
3-letter alphabet can be replaced by a binary {0, 1) alphabet, figure 1.6. A clever 
choice of an alphabet will incorporate important features of the dynamics, such as 
its symmetries. 

Suppose you wanted to play a good game of pinball, that is, get the pinball 
to bounce as many times as you possibly can-what would be a winning strategy? 
The simplest thing would be to try to aim the pinball so it bounces many times 
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Figure 1.7 : The 3-disk pinball cycles 1232 and 
121212313. 



Figure 1.8: (a) A trajectory starting out from disk 
1 can either hit another disk or escape, (b) Hitting 
two disks in a sequence requires a much sharper aim, 
with initial conditions that hit further consecutive disks 
nested within each other, as in Fig. 1.9. 

between a pair of disks-if you managed to shoot it so it starts out in the periodic 
orbit bouncing along the line connecting two disk centers, it would stay there for- 
ever. Your game would be just as good if you managed to get it to keep bouncing 
between the three disks forever, or place it on any periodic orbit. The only rub 
is that any such orbit is unstable, so you have to aim very accurately in order to 
stay close to it for a while. So it is pretty clear that if one is interested in playing 
well, unstable periodic orbits are important-they form the skeleton onto which all 
trajectories trapped for long times cling. 

1.4.2 Partitioning with periodic orbits 

A trajectory is periodic if it returns to its starting position and momentum. We 
shall sometimes refer to the set of periodic points that belong to a given periodic 
orbit as a cycle. 

Short periodic orbits are easily drawn and enumerated-an example is drawn in 
figure 1.7-but it is rather hard to perceive the systematics of orbits from their con- 
figuration space shapes. In mechanics a trajectory is fully and uniquely specified 
by its position and momentum at a given instant, and no two distinct state space 
trajectories can intersect. Their projections onto arbitrary subspaces, however, 
can and do intersect, in rather unilluminating ways. In the pinball example the 
problem is that we are looking at the projections of a 4-dimensional state space 
trajectories onto a 2-dimensional subspace, the configuration space. A clearer 
picture of the dynamics is obtained by constructing a set of state space Poincare 
sections. 

Suppose that the pinball has just bounced off disk 1. Depending on its position 
and outgoing angle, it could proceed to either disk 2 or 3. Not much happens in 
between the bounces-the ball just travels at constant velocity along a straight line- 
so we can reduce the 4-dimensional flow to a 2-dimensional map P that takes the 
coordinates of the pinball from one disk edge to another disk edge. The trajectory 
just after the moment of impact is defined by s n , the arc-length position of the 
rath bounce along the billiard wall, and p n = p sin <f> n the momentum component 
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Figure 1.9: The 3-disk game of pinball Poincare 
section, trajectories emanating from the disk 1 
with xq = (so, po) ■ (a) Strips of initial points Mn, 
M\3 which reach disks 2, 3 in one bounce, respec- 
tively, (b) Strips of initial points Mm, Mm Mm 
and AI123 which reach disks 1, 2, 3 in two bounces, 
respectively. The Poincare sections for trajectories 
originating on the other two disks are obtained by 
the appropriate relabeling of the strips. Disk ra- 
dius : center separation ratio a:R = 1:2.5. (Y. 
Lan) 




123 


131 


121 


132 



(b) 



parallel to the billiard wall at the point of impact, see figure 1.9. Such section of a 
flow is called a Poincare section. In terms of Poincare sections, the dynamics is 
reduced to the set of six maps Ps k ^ Sj '■ {s n ,Pn) (s n +i, Pn+\), with s e {1,2,3}, 
from the boundary of the disk j to the boundary of the next disk k. 

Next, we mark in the Poincare section those initial conditions which do not 
escape in one bounce. There are two strips of survivors, as the trajectories orig- 
inating from one disk can hit either of the other two disks, or escape without 
further ado. We label the two strips Mn, AI13. Embedded within them there 
are four strips Aim, AI123, AI131, AI132 of initial conditions that survive for two 
bounces, and so forth, see figures 1.8 and 1.9. Provided that the disks are suffi- 
ciently separated, after n bounces the survivors are divided into 2 distinct strips: 
the Al,th strip consists of all points with itinerary i = S1S2S3 . . . s„, s = {1,2,3}. 
The unstable cycles as a skeleton of chaos are almost visible here: each such patch 
contains a periodic point STS2S3 ...s„ with the basic block infinitely repeated. Pe- 
riodic points are skeletal in the sense that as we look further and further, the strips 
shrink but the periodic points stay put forever. 



example 3.9 
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We see now why it pays to utilize a symbolic dynamics; it provides a naviga- 
tion chart through chaotic state space. There exists a unique trajectory for every 
admissible infinite length itinerary, and a unique itinerary labels every trapped 
trajectory. For example, the only trajectory labeled by 12 is the 2-cycle bouncing 
along the line connecting the centers of disks 1 and 2; any other trajectory starting 
out as 12 . . . either eventually escapes or hits the 3rd disk. 



1.4.3 Escape rate 

example 17.4 

What is a good physical quantity to compute for the game of pinball? Such a sys- 
tem, for which almost any trajectory eventually leaves a finite region (the pinball 
table) never to return, is said to be open, or a repeller. The repeller escape rate 
is an eminently measurable quantity. An example of such a measurement would 
be an unstable molecular or nuclear state which can be well approximated by a 
classical potential with the possibility of escape in certain directions. In an ex- 
periment many projectiles are injected into a macroscopic 'black box' enclosing 
a microscopic non-confining short-range potential, and their mean escape rate is 
measured, as in figure 1.1. The numerical experiment might consist of injecting 
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the pinball between the disks in some random direction and asking how many 
times the pinball bounces on the average before it escapes the region between the 
disks. 

For a theorist, a good game of pinball consists in predicting accurately the 
asymptotic lifetime (or the escape rate) of the pinball. We now show how periodic 
orbit theory accomplishes this for us. Each step will be so simple that you can 
follow even at the cursory pace of this overview, and still the result is surprisingly 
elegant. 

Consider figure 1 .9 again. In each bounce the initial conditions get thinned 
out, yielding twice as many thin strips as at the previous bounce. The total area 
that remains at a given time is the sum of the areas of the strips, so that the fraction 
of survivors after n bounces, or the survival probability is given by 

a IMol lAfil a IMool lAiiol IMoil IMil 

i 1 — — — 1 — , 1 ? — — 1 — 1 — 1- 



\M\ \M\ \M\ \M\ \M\ \M\ 

■ v*t m ' (L2) 



where i is a label of the /th strip, \M\ is the initial area, and \A\\ is the area of 
the /th strip of survivors, i = 01, 10, 11, ... is a label, not a binary number. Since 
at each bounce one routinely loses about the same fraction of trajectories, one 
expects the sum (1.2) to fall off exponentially with n and tend to the limit 



\ +i /T n = -» e~\ (1.3) 



exercise 1 .2 
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The quantity y is called the escape rate from the repeller. 



1.5 Chaos for cyclists 



Etant donnees des equations ... et une solution particuliere 
quelconque de ces equations, on peut toujours trouver une 
solution periodique (dont la periode peut, il est vrai, etre 
tres longue), telle que la difference entre les deux solu- 
tions soit aussi petite qu'on le veut, pendant un temps aussi 
long qu'on le veut. D'ailleurs, ce qui nous rend ces solu- 
tions periodiques si precieuses, c'est qu'elles sont, pour 
ansi dire, la seule breche par ou nous puissions esseyer de 
penetrer dans une place jusqu'ici reputee inabordable. 

— H. Poincare, Les methodes nouvelles de la 
mechanique celeste 

We shall now show that the escape rate y can be extracted from a highly conver- 
gent exact expansion by reformulating the sum (1.2) in terms of unstable periodic 
orbits. 
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If, when asked what the 3-disk escape rate is for a disk of radius 1, center- 
center separation 6, velocity 1, you answer that the continuous time escape rate 
is roughly y = 0.4103384077693464893384613078192 . . ., you do not need this 
book. If you have no clue, hang on. 

1.5.1 How big is my neighborhood? 



Not only do the periodic points keep track of topological ordering of the strips, 
but, as we shall now show, they also determine their size. As a trajectory evolves, 
it carries along and distorts its infinitesimal neighborhood. Let 



denote the trajectory of an initial point xq = x(0). Expanding f(xo + 8xq) to 
linear order, the evolution of the distance to a neighboring trajectory j$(t) + Sxj(t) 
is given by the Jacobian matrix J: 



A trajectory of a pinball moving on a flat surface is specified by two position co- 
ordinates and the direction of motion, so in this case d = 3. Evaluation of a cycle 
Jacobian matrix is a long exercise - here we just state the result. The Jacobian 
matrix describes the deformation of an infinitesimal neighborhood of x(t) along 
the flow; its eigenvectors and eigenvalues give the directions and the correspond- 
ing rates of expansion or contraction, figure 1.10. The trajectories that start out in 
an infinitesimal neighborhood separate along the unstable directions (those whose 
eigenvalues are greater than unity in magnitude), approach each other along the 
stable directions (those whose eigenvalues are less than unity in magnitude), and 
maintain their distance along the marginal directions (those whose eigenvalues 
equal unity in magnitude). 

In our game of pinball the beam of neighboring trajectories is defocused along 
the unstable eigen-direction of the Jacobian matrix /. 

As the heights of the strips in figure 1 .9 are effectively constant, we can con- 
centrate on their thickness. If the height is « L, then the area of the zth strip is 
Mi « Lli for a strip of width /,-. 



Of course, we can prove all these results directly from 
Eq. (17.26) by pedestrian mathematical manipulations, 
but that only makes it harder to appreciate their physical 
significance. 



— Rick Salmon, "Lectures on Geophysical Fluid Dy- 
namics", Oxford Univ. Press (1998) 



x(t) = f(x 0 ) 




d 



(1.4) 



intro - 9apr2009 



ChaosBook.org versionl4.4. 1, Apr21 2013 



CHAPTER 1. OVERTURE 



17 




8x(t) = J l Sx(0) 



Figure 1.10: The Jacobian matrix / maps an infinites- 
imal displacement Sx at xq into a displacement J'(xq)6x 
finite time t later. 



Each strip i in figure 1.9 contains a periodic point x,. The finer the intervals, 
the smaller the variation in flow across them, so the contribution from the strip 
of width /, is well-approximated by the contraction around the periodic point % 
within the interval, 

h = ad\H , (1.5) 

where A; is the unstable eigenvalue of the Jacobian matrix J{xj) evaluated at 
the /th periodic point for t - T p , the full period (due to the low dimensionality, 
the Jacobian can have at most one unstable eigenvalue). Only the magnitude of 
this eigenvalue matters, we can disregard its sign. The prefactors q reflect the 
overall size of the system and the particular distribution of starting values of x. As 
the asymptotic trajectories are strongly mixed by bouncing chaotically around the 
repeller, we expect their distribution to be insensitive to smooth variations in the 
distribution of initial points. 

To proceed with the derivation we need the hyperbolicity assumption: for 
large n the prefactors a, ~ 0(1) are overwhelmed by the exponential growth of 
A;, so we neglect them. If the hyperbolicity assumption is justified, we can replace 
\Mi\ » Lli in (1.2) by 1/|A,-| and consider the sum 



(n) 



r„ = 2 \/\Ai\, 



where the sum goes over all periodic points of period n. We now define a gener- 
ating function for sums over all periodic orbits of all lengths: 



ra) = ^r/. (i.6) 

n=\ 



Recall that for large n the nth level sum (1.2) tends to the limit T n — > e " y , so the 
escape rate y is determined by the smallest z = e y for which (1.6) diverges: 



r(z)« V {ze -y)" = -^—. (i.7) 

1 - ze y 

n=l 



section 16.4 



section 18.1.1 
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This is the property of T(z) that motivated its definition. Next, we devise a formula 
for (1.6) expressing the escape rate in terms of periodic orbits: 



(/;) 



n=l 



2 2 2 1 

z z z z z z 



+ tt—. + — — r + : r + — — : + 
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|Aol |Ai| |Aool |Aoil lAiol |An| 

3 3 3 3 

+— — + — — + — — + — — + ... (1.8) 
lAoool lAooil IAqioI lAiool 



For sufficiently small z this sum is convergent. The escape rate y is now given by 
the leading pole of (1.7), rather than by a numerical extrapolation of a sequence of 
y n extracted from (1.3). As any finite truncation n < «trunc °f (1-8) is a polyno- 
mial in z, convergent for any z, finding this pole requires that we know something 
about r„ for any n, and that might be a tall order. 

We could now proceed to estimate the location of the leading singularity of 
T(z) from finite truncations of (1.8) by methods such as Pade approximants. How- 
ever, as we shall now show, it pays to first perform a simple resummation that 
converts this divergence into a zero of a related function. 



1.5.2 Dynamical zeta function 

If a trajectory retraces a prime cycle r times, its expanding eigenvalue is A^. A 
prime cycle p is a single traversal of the orbit; its label is a non-repeating symbol 
string of n p symbols. There is only one prime cycle for each cyclic permutation 
class. For example, p = 0011 = 1001 = 1100 = 0110 is prime, but 0101 =~01 is not. 
By the chain rule for derivatives the stability of a cycle is the same everywhere 

exercise 15.2 

along the orbit, so each prime cycle of length n p contributes n p terms to the sum section 4 5 
(1.8). Hence (1.8) can be rewritten as 



where the index p runs through all distinct prime cycles. Note that we have re- 
summed the contribution of the cycle p to all times, so truncating the summation 
up to given p is not a finite time n < n p approximation, but an asymptotic, infinite 
time estimate based by approximating stabilities of all cycles by a finite number of 
the shortest cycles and their repeats. The n p z Hp factors in (1.9) suggest rewriting 
the sum as a derivative 
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Hence T(z) is a logarithmic derivative of the infinite product 

i/^)-n (i ~^ ) ' ^ = ix~i- (uo) 



This function is called the dynamical zeta function, in analogy to the Riemann 
zeta function, which motivates the 'zeta' in its definition as l/£(z)- This is the 
prototype formula of periodic orbit theory. The zero of 1 /£(z) is a pole of T(z), 
and the problem of estimating the asymptotic escape rates from finite n sums 
such as (1.2) is now reduced to a study of the zeros of the dynamical zeta function 
(1.10). The escape rate is related by (1.7) to a divergence of T(z), and T(z) diverges 
whenever 1 /£(z) has a zero. 

Easy, you say: "Zeros of (1.10) can be read off the formula, a zero 
z P = |A/^ 

for each term in the product. What's the problem?" Dead wrong! 



1.5.3 Cycle expansions 

How are formulas such as (1.10) used? We start by computing the lengths and 
eigenvalues of the shortest cycles. This usually requires some numerical work, 
such as the Newton method searches for periodic solutions; we shall assume that 
the numerics are under control, and that all short cycles up to given length have 
been found. In our pinball example this can be done by elementary geometrical 
optics. It is very important not to miss any short cycles, as the calculation is as 
accurate as the shortest cycle dropped-including cycles longer than the shortest 
omitted does not improve the accuracy (unless exponentially many more cycles 
are included). The result of such numerics is a table of the shortest cycles, their 
periods and their stabilities. 

Now expand the infinite product (1.10), grouping together the terms of the 
same total symbol string length 

\IC = (l-fo)(l-ti)(l-fio)(l-*ioo)"- 

= 1 - to - h - Vho - hto\ - [(fioo - hoto) + (?ioi - ^io^i)] 
-[(fiooo _ Wioo) + (?1110 _ fi ?iio) 

+(?1001 _ M001 - t\0\to + tiotot\)] - . . . (1-11) 

The virtue of the expansion is that the sum of all terms of the same total length 
n (grouped in brackets above) is a number that is exponentially smaller than a 
typical term in the sum, for geometrical reasons we explain in the next section. 



section 22.1 
section 19.4 
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Figure 1.11: Approximation to a smooth dynamics 
(left frame) by the skeleton of periodic points, together 
with their linearized neighborhoods, (right frame). In- 
dicated are segments of two 1 -cycles and a 2-cycle 
that alternates between the neighborhoods of the two 
1 -cycles, shadowing first one of the two 1 -cycles, and 
then the other. 




Figure 1.12: A longer cycle p" shadowed by a pair of 
shorter cycles p and p' . 




The calculation is now straightforward. We substitute a finite set of the eigen- 
values and lengths of the shortest prime cycles into the cycle expansion (1.11), and 
obtain a polynomial approximation to l/£. We then vary z in (1.10) and determine 
the escape rate y by finding the smallest z = e y for which (1.11) vanishes. 

1.5.4 Shadowing 

When you actually start computing this escape rate, you will find out that the 
convergence is very impressive: only three input numbers (the two fixed points 0, 
1 and the 2-cycle 10) already yield the pinball escape rate to 3-4 significant digits! 
We have omitted an infinity of unstable cycles; so why does approximating the 
dynamics by a finite number of the shortest cycle eigenvalues work so well? 

The convergence of cycle expansions of dynamical zeta functions is a conse- 
quence of the smoothness and analyticity of the underlying flow. Intuitively, one 
can understand the convergence in terms of the geometrical picture sketched in 
figure 1.11; the key observation is that the long orbits are shadowed by sequences 
of shorter orbits. 

A typical term in (1 . 1 1) is a difference of a long cycle \ab\ minus its shadowing 
approximation by shorter cycles \a\ and \b\ (see figure 1.12), 



where a and b are symbol sequences of the two shorter cycles. If all orbits are 
weighted equally (t p = z np ), such combinations cancel exactly; if orbits of similar 




(1.12) 
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symbolic dynamics have similar weights, the weights in such combinations almost 
cancel. 

This can be understood in the context of the pinball game as follows. Consider 
orbits 0, 1 and 01. The first corresponds to bouncing between any two disks while 
the second corresponds to bouncing successively around all three, tracing out an 
equilateral triangle. The cycle 01 starts at one disk, say disk 2. It then bounces 
from disk 3 back to disk 2 then bounces from disk 1 back to disk 2 and so on, so its 
itinerary is 2321. In terms of the bounce types shown in figure 1.6, the trajectory is 
alternating between 0 and 1 . The incoming and outgoing angles when it executes 
these bounces are very close to the corresponding angles for 0 and 1 cycles. Also 
the distances traversed between bounces are similar so that the 2-cycle expanding 
eigenvalue Aqi is close in magnitude to the product of the 1-cycle eigenvalues 
A 0 Ai. 

To understand this on a more general level, try to visualize the partition of 
a chaotic dynamical system's state space in terms of cycle neighborhoods as a 
tessellation (a tiling) of the dynamical system, with smooth flow approximated by 
its periodic orbit skeleton, each 'tile' centered on a periodic point, and the scale 
of the 'tile' determined by the linearization of the flow around the periodic point, 
as illustrated by figure 1.11. 

The orbits that follow the same symbolic dynamics, such as {ab} and a 'pseudo 
orbit' {£?}{&}, lie close to each other in state space; long shadowing pairs have to 
start out exponentially close to beat the exponential growth in separation with 
time. If the weights associated with the orbits are multiplicative along the flow 
(for example, by the chain rule for products of derivatives) and the flow is smooth, 
the term in parenthesis in (1.12) falls off exponentially with the cycle length, and 
therefore the curvature expansions are expected to be highly convergent. 

Cn3pt6r c.o 



1.6 Change in time 



MEN are deplorably ignorant with respect to natural 
things and modern philosophers as though dreaming in the 
darkness must be aroused and taught the uses of things the 
dealing with things they must be made to quit the sort of 
learning that comes only from books and that rests only 
on vain arguments from probability and upon conjectures. 
— William Gilbert, De Magnete, 1600 

The above derivation of the dynamical zeta function formula for the escape rate 
has one shortcoming; it estimates the fraction of survivors as a function of the 
number of pinball bounces, but the physically interesting quantity is the escape 
rate measured in units of continuous time. For continuous time flows, the escape 
rate (1.2) is generalized as follows. Define a finite state space region M such 
that a trajectory that exits Al never reenters. For example, any pinball that falls 
of the edge of a pinball table in figure 1 . 1 is gone forever. Start with a uniform 
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distribution of initial points. The fraction of initial x whose trajectories remain 
within M at time t is expected to decay exponentially 



J dxdyS(y-f(x)) 

r(o = m > e - yt 



L dx 



The integral over x starts a trajectory at every x e M. The integral over y tests 
whether this trajectory is still in M at time t. The kernel of this integral 

£ t (y,x) = 6(y-f ! (x)) (1.13) 



is the Dirac delta function, as for a deterministic flow the initial point x maps 
into a unique point y at time t. For discrete time, f{x) is the nth iterate of the 
map /. For continuous flows, f(x) is the trajectory of the initial point x, and it 
is appropriate to express the finite time kernel £ in terms of 3K, the generator of 
infinitesimal time translations 



section 16.6 

very much in the way the quantum evolution is generated by the Hamiltonian H, 
the generator of infinitesimal time quantum transformations. 

As the kernel X is the key to everything that follows, we shall give it a name, 
and refer to it and its generalizations as the evolution operator for a ci-dimensional 
map or a J-dimensional flow. 

The number of periodic points increases exponentially with the cycle length 
(in the case at hand, as 2"). As we have already seen, this exponential proliferation 
of cycles is not as dangerous as it might seem; as a matter of fact, all our compu- 
tations will be earned out in the n —> oo limit. Though a quick look at long-time 
density of trajectories might reveal it to be complex beyond belief, this distribution 
is still generated by a simple deterministic law, and with some luck and insight, 
our labeling of possible motions will reflect this simplicity. If the rule that gets us 
from one level of the classification hierarchy to the next does not depend strongly 
on the level, the resulting hierarchy is approximately self-similar. We now turn 
such approximate self-similarity to our advantage, by turning it into an operation, 
the action of the evolution operator, whose iteration encodes the self-similarity. 



1.6.1 Trace formula 

In physics, when we do not understand something, we give 
it a name. 

— Matthias Neubert 
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Figure 1.13: The trace of an evolution operator is con- 
centrated in tubes around prime cycles, of length T p 
and thickness 1 /\A p \ r for the rth repetition of the prime 
cycle p. 



M!«T DIVERGE . 




Jy<la / ' 



> Ihfckness ^£ 




Recasting dynamics in terms of evolution operators changes everything. So far 
our formulation has been heuristic, but in the evolution operator formalism the es- 
cape rate and any other dynamical average are given by exact formulas, extracted 
from the spectra of evolution operators. The key tools are trace formulas and 
spectral determinants. 

The trace of an operator is given by the sum of its eigenvalues. The explicit 
expression (1.13) for £!{x,y) enables us to evaluate the trace. Identify y with x 
and integrate x over the whole state space. The result is an expression for tr £ as 
a sum over neighborhoods of prime cycles p and their repetitions 



p £idet(l-M£) 



where T p is the period of prime cycle p, and the monodromy matrix M p is the 
flow-transverse part of Jacobian matrix J (1.4). This formula has a simple geo- 
metrical interpretation sketched in figure 1.13. After the rth return to a Poincare 
section, the initial tube M p has been stretched out along the expanding eigen- 

directions, with the overlap with the initial volume given by l/|det(l - Mp)| — > 
1 /\A p \, the same weight we obtained heuristically in sect. 1.5.1. 

The 'spiky' sum (1.14) is disquieting in the way reminiscent of the Poisson 
resummation formulas of Fourier analysis; the left-hand side is the smooth eigen- 
value sum tr e 5 ^ = £ e" at , while the right-hand side equals zero everywhere except 
for the set t - rT p . A Laplace transform smooths the sum over Dirac delta func- 
tions in cycle periods and yields the trace formula for the eigenspectrum s\, • • • 
of the classical evolution operator: 



dt e~ st tr£' = tr 



s-9K 



00 1 00 „r(B-A,,-sT„) 

y— = ytpY i i . (i.i5) 



The beauty of trace formulas lies in the fact that everything on the right-hand- 
side-prime cycles p, their periods T p and the eigenvalues of M ; ,-is an invariant 
property of the flow, independent of any coordinate choice. 



section 18.2 
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1.6.2 Spectral determinant 



The eigenvalues of a linear operator are given by the zeros of the appropriate 
determinant. One way to evaluate determinants is to expand them in terms of 
traces, using the identities 



exercise 4.1 



d d 1 

— lndet(j-^) = tr — \n{s - J{) = tr 

ds ds s - J{ 



(1.16) 



and integrating over s. In this way the spectral determinant of an evolution oper- 
ator becomes related to the traces that we have just computed: 



chapter 19 



det (s - Jl) = exp 



1 

-2Z 1 



e -T,r 



( V^i r det (1-^)1 



(1.17) 



The 1 jr factor is due to the s integration, leading to the replacement T p — > T p /rT p 

in the periodic orbit expansion (1.15). 

F section 19.5 

We have now retraced the heuristic derivation of the divergent sum (1.7) and 
the dynamical zeta function (1.10), but this time with no approximations: formula 
(1.17) is exact. The computation of the zeros of det (s - 3T) proceeds very much 
like the computations of sect. 1.5.3. 



1.7 From chaos to statistical mechanics 



Under heaven, all is chaos. The situation is excellent! 
— Chairman Mao Zedong, a letter to Jiang Qing 

The replacement of individual trajectories by evolution operators which propagate 
densities feels like a bit of mathematical voodoo. Nevertheless, something very 
radical and deeply foundational has taken place. Understanding the distinction 
between evolution of individual trajectories and the evolution of the densities of 
trajectories is key to understanding statistical mechanics-this is the conceptual 
basis of the second law of thermodynamics, and the origin of irreversibility of the 
arrow of time for deterministic systems with time-reversible equations of motion: 
reversibility is attainable for distributions whose measure in the space of density 
functions goes exponentially to zero with time. 

Consider a chaotic flow, such as the stirring of red and white paint by some 
deterministic machine. If we were able to track individual trajectories, the fluid 
would forever remain a striated combination of pure white and pure red; there 
would be no pink. What is more, if we reversed the stirring, we would return to 
the perfect white/red separation. However, that cannot be-in a very few turns of 
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the Stirling stick the thickness of the layers goes from centimeters to Angstroms, 
and the result is irreversibly pink. 

A century ago it seemed reasonable to assume that statistical mechanics ap- 
plies only to systems with very many degrees of freedom. More recent is the 
realization that much of statistical mechanics follows from chaotic dynamics, and 
already at the level of a few degrees of freedom the evolution of densities is irre- 
versible. Furthermore, the theory that we shall develop here generalizes notions of 
'measure' and 'averaging' to systems far from equilibrium, and transports us into 
regions hitherto inaccessible with the tools of equilibrium statistical mechanics. 

By going to a description in terms of the asymptotic time evolution operators 
we give up tracking individual trajectories for long times, but trade in the un- 
controllable trajectories for a powerful description of the asymptotic trajectory 
densities. This will enable us, for example, to give exact formulas for transport 
coefficients such as the diffusion constants without any probabilistic assumptions. 
The classical Boltzmann equation for evolution of 1 -particle density is based on 
stosszahlansatz, neglect of particle correlations prior to, or after a 2-particle col- 
lision. It is a very good approximate description of dilute gas dynamics, but 
a difficult starting point for inclusion of systematic corrections. In the theory 
developed here, no correlations are neglected - they are all included in the cy- 
cle averaging formulas such as the cycle expansion for the diffusion constant 
IdD = lim^oo (x{T) 2 ^ /T of a particle diffusing chaotically across a spatially- 
periodic array, 



" 2d {T)^ K |A P1 ---A W | 



(1.18) 



where h p is a translation along one period of a spatially periodic 'runaway' tra- 
jectory p. Such formulas are exact; the issue in their applications is what are 
the most effective schemes of estimating the infinite cycle sums required for their 
evaluation. Unlike most statistical mechanics, here there are no phenomenological 
macroscopic parameters; quantities such as transport coefficients are calculable to 
any desired accuracy from the microscopic dynamics. 

The concepts of equilibrium statistical mechanics do help us, however, to un- 
derstand the ways in which the simple-minded periodic orbit theory falters. A 
non-hyperbolicity of the dynamics manifests itself in power-law correlations and 
even 'phase transitions.' 



chapter 25 
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1.8 Chaos: what is it good for? 



Happy families are all alike; every unhappy family is un- 
happy in its own way. 

— Anna Karenina, by Leo Tolstoy 
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(a) 





(b) sin(Q)t) 



frequency Q 



Figure 1.14: (a) Washboard mean velocity, (b) 
cold atom lattice diffusion, and (c) AFM tip drag 
force. (Y. Lan) 



(q) velocity 




With initial data accuracy 8x = |<5x(0)| and system size L, a trajectory is predictable 
only up to the finite Lyapunov time (1.1), 7L yap ~ AT 1 \n\LI5x\ . Beyond that, 
chaos rules. And so the most successful applications of 'chaos theory' have so far 
been to problems where observation time is much longer than a typical 'turnover' 
time, such as statistical mechanics, quantum mechanics, and questions of long 
term stability in celestial mechanics, where the notion of tracking accurately a 
given state of the system is nonsensical. 



So what is chaos good for? Transport! Though superficially indistinguishable 
from the probabilistic random walk diffusion, in low dimensional settings the de- 
terministic diffusion is quite recognizable, through the fractal dependence of the 
diffusion constant on the system parameters, and perhaps through non-Gaussion 

relaxation to equilibrium (non-vanishing Burnett coefficients). 

M v 6 ' section 25.2.1 

Several tabletop experiments that could measure transport on macroscopic 
scales are sketched in figure 1.14 (each a tabletop, but an expensive tabletop). Fig- 
ure 1.14(a) depicts a 'slanted washboard;' a particle in a gravity field bouncing 
down the washboard, losing some energy at each bounce, or a charged particle in 
a constant electric field trickling across a periodic condensed-matter device. The 
interplay between chaotic dynamics and energy loss results in a terminal mean ve- 
locity/conductance, a function of the washboard slant or external electric field that 
the periodic theory can predict accurately. Figure 1 . 14 (b) depicts a 'cold atom lat- 
tice' of very accurate spatial periodicity, with a dilute cloud of atoms placed onto 
a standing wave established by strong laser fields. Interaction of gravity with gen- 
tle time-periodic jiggling of the EM fields induces a diffusion of the atomic cloud, 
with a diffusion constant predicted by the periodic orbit theory. Figure 1.14(c) 
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depicts a tip of an atomic force microscope (AFM) bouncing against a periodic 
atomic surface moving at a constant velocity. The frictional drag experienced 
is the interplay of the chaotic bouncing of the tip and the energy loss at each 
tip/surface collision, accurately predicted by the periodic orbit theory. None of 
these experiments have actually been earned out, (save for some numerical exper- 
imentation), but are within reach of what can be measured today. 

Given microscopic dynamics, periodic orbit theory predicts observable macro- 
scopic transport quantities such as the washboard mean velocity, cold atom lattice 
diffusion constant, and AFM tip drag force. But the experimental proposal is sex- 
ier than that, and goes into the heart of dynamical systems theory. 

Smale 1960s theory of the hyperbolic structure of the non-wandering set 
(AKA 'horseshoe') was motivated by his 'structural stability' conjecture, which - 
in non-technical terms - asserts that all trajectories of a chaotic dynamical system 
deform smoothly under small variations of system parameters. 

Why this cannot be true for a system like the washboard in figure 1.14 (a) is 
easy to see for a cyclist. Take a trajectory which barely grazes the tip of one of the 
groves. An arbitrarily small change in the washboard slope can result in loss of 
this collision, change a forward scattering into a backward scattering, and lead to 
a discontinuous contribution to the mean velocity. You might hold out hope that 
such events are rare and average out, but not so - a loss of a short cycle leads to a 
significant change in the cycle-expansion formula for a transport coefficient, such 
as (1.18). 

When we write an equation, it is typically parameterized by a set of parameters 
by as coupling strengths, and we think of dynamical systems obtained by a smooth 
variation of a parameter as a 'family' We would expect measurable predictions to 
also vary smoothly, i.e., be 'structurally stable.' 

But dynamical systems families are 'families' only in a name. That the struc- 
tural stability conjecture turned out to be badly wrong is, however, not a blow for 
chaotic dynamics. Quite to the contrary, it is actually a virtue, perhaps the most 
dramatic experimentally measurable prediction of chaotic dynamics. 

As long as microscopic periodicity is exact, the prediction is counterintuitive 
for a physicist - transport coefficients are not smooth functions of system parame- 
ters, rather they are non-monotonic, nowhere differentiate functions. Conversely, 
if the macroscopic measurement yields a smooth dependence of the transport on 
system parameters, the periodicity of the microscopic lattice is degraded by impu- 
rities, and probabilistic assumptions of traditional statistical mechanics apply. So 
the proposal is to -by measuring macroscopic transport- conductance, diffusion, 
drag -observe determinism on nanoscales, and -for example- determine whether 
an atomic surface is clean. 

The signatures of deterministic chaos are even more baffling to an engineer: 
a small increase of pressure across a pipe exhibiting turbulent flow does not nec- 
essarily lead to an increase in the mean flow; mean flow dependence on pressure 
drop across the pipe is also a fractal function. 
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Is this in contradiction with the traditional statistical mechanics? No - deter- 
ministic chaos predictions are valid in settings where a few degrees of freedom are 
important, and chaotic motion time and space scales are commensurate with the 
external driving and spatial scales. Further degrees of freedom act as noise that 
smooths out the above fractal effects and restores a smooth functional dependence 
of transport coefficients on external parameters. 



This book offers everyman a breach into a domain hitherto reputed unreachable, 
a domain traditionally traversed only by mathematical physicists and mathemati- 
cians. What distinguishes it from mathematics is the insistence on computability 
and numerical convergence of methods offered. A rigorous proof, the end of the 
story as far as a mathematician is concerned, might state that in a given setting, 
for times in excess of 10 32 years, turbulent dynamics settles onto an attractor of 
dimension less than 600. Such a theorem is of a little use to an honest, hard- 
working plumber, especially if her hands-on experience is that within the span of 
a few typical 'turnaround' times the dynamics seems to settle on a (transient?) 
attractor of dimension less than 3. If rigor, magic, fractals or brains is your thing, 
read remark 1 .4 and beyond. 

So, no proofs! but lot of hands-on plumbing ahead. 

Many a chapter alone could easily grow to a book size if unchecked: the 
nuts and bolt of the theory include ODEs, PDEs, stochastic ODEs, path integrals, 
group theory, coding theory, graph theory, ergodic theory, linear operator theory, 
quantum mechanics, etc.. We include material into the text proper on 'need-to- 
know' basis, relegate technical details to appendices, and give pointers to further 
reading in the remarks at the end of each chapter. 



Resume 

This text is an exposition of the best of all possible theories of deterministic chaos, 
and the strategy is: 1) count, 2) weigh, 3) add up. 

In a chaotic system any open ball of initial conditions, no matter how small, 
will spread over the entire accessible state space. Hence the theory focuses on 
describing the geometry of the space of possible outcomes, and evaluating av- 
erages over this space, rather than attempting the impossible: precise prediction 



1.9 What is not in 



ChaosBook 



There is only one thing which interests me vitally now, 
and that is the recording of all that which is omitted in 
books. Nobody, as far as I can see, is making use of those 
elements in the air which give direction and motivation to 
our lives. 



— Henry Miller, Tropic of Cancer 
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of individual trajectories. The dynamics of densities of trajectories is described 
in terms of evolution operators. In the evolution operator formalism the dynami- 
cal averages are given by exact formulas, extracted from the spectra of evolution 
operators. The key tools are trace formulas and spectral determinants. 

The theory of evaluation of the spectra of evolution operators presented here is 
based on the observation that the motion in dynamical systems of few degrees of 
freedom is often organized around a few fundamental cycles. These short cycles 
capture the skeletal topology of the motion on a strange attractor/repeller in the 
sense that any long orbit can approximately be pieced together from the nearby pe- 
riodic orbits of finite length. This notion is made precise by approximating orbits 
by prime cycles, and evaluating the associated curvatures. A curvature measures 
the deviation of a longer cycle from its approximation by shorter cycles; smooth- 
ness and the local instability of the flow implies exponential (or faster) fall-off for 
(almost) all curvatures. Cycle expansions offer an efficient method for evaluating 
classical and quantum observables. 

The critical step in the derivation of the dynamical zeta function was the hy- 
perbolicity assumption, i.e., the assumption of exponential shrinkage of all strips 
of the pinball repeller. By dropping the a- prefactors in (1.5), we have given up on 
any possibility of recovering the precise distribution of starting x (which should 
anyhow be impossible due to the exponential growth of errors), but in exchange 
we gain an effective description of the asymptotic behavior of the system. The 
pleasant surprise of cycle expansions (1.10) is that the infinite time behavior of an 
unstable system is as easy to determine as the short time behavior. 

To keep the exposition simple we have here illustrated the utility of cycles 
and their curvatures by a pinball game, but topics covered in ChaosBook - un- 
stable flows, Poincare sections, Smale horseshoes, symbolic dynamics, pruning, 
discrete symmetries, periodic orbits, averaging over chaotic sets, evolution oper- 
ators, dynamical zeta functions, spectral determinants, cycle expansions, quantum 
trace formulas, zeta functions, and so on to the semiclassical quantization of he- 
lium - should give the reader some confidence in the broad sway of the theory. 
The formalism should work for any average over any chaotic set which satisfies 
two conditions: 

1 . the weight associated with the observable under consideration is multiplica- 
tive along the trajectory, 

2. the set is organized in such a way that the nearby points in the symbolic 
dynamics have nearby weights. 

The theory is applicable to evaluation of a broad class of quantities characterizing 
chaotic systems, such as the escape rates, Lyapunov exponents, transport coeffi- 
cients and quantum eigenvalues. A big surprise is that the semi-classical quantum 
mechanics of systems classically chaotic is very much like the classical mechanics 
of chaotic systems; both are described by zeta functions and cycle expansions of 
the same form, with the same dependence on the topology of the classical flow. 
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But the power of instruction is seldom of much efficacy, 
except in those happy dispositions where it is almost su- 
perfluous. 

— Gibbon 



Commentary 

Remark 1.1 Nonlinear dynamics texts. This text aims to bridge the gap between the 
physics and mathematics dynamical systems literature. The intended audience is Hen- 
riette Roux, the perfect physics graduate student with a theoretical bent who does not 
believe anything he is told. As a complementary presentation we recommend Gaspard's 
monograph [ 8] which covers much of the same ground in a highly readable and scholarly 
manner. 

As far as the prerequisites are concerned-ChaosBook is not an introduction to non- 
linear dynamics. Nonlinear science requires a one semester basic course (advanced un- 
dergraduate or first year graduate). A good start is the textbook by Strogatz [ ], an in- 
troduction to the applied mathematician's visualization of flows, fixed points, manifolds, 
bifurcations. It is the most accessible introduction to nonlinear dynamics-a book on dif- 
ferential equations in nonlinear disguise, and its broadly chosen examples and many ex- 
ercises make it a favorite with students. It is not strong on chaos. There the textbook 
of Alligood, Sauer and Yorke [ 10] is preferable: an elegant introduction to maps, chaos, 
period doubling, symbolic dynamics, fractals, dimensions-a good companion to Chaos- 
Book. Introduction more comfortable to physicists is the textbook by Ott [11], with the 
baker's map used to illustrate many key techniques in analysis of chaotic systems. Ott is 
perhaps harder than the above two as first books on nonlinear dynamics. Sprott [ 12] and 
Jackson [13] textbooks are very useful compendia of the '70s and onward 'chaos' litera- 
ture which we, in the spirit of promises made in sect. 1.1, tend to pass over in silence. 

An introductory course should give students skills in qualitative and numerical anal- 
ysis of dynamical systems for short times (trajectories, fixed points, bifurcations) and 
familiarize them with Cantor sets and symbolic dynamics for chaotic systems. For the 
dynamical systems material covered here in chapters 2 to 4, as well as for the in-depth 
study of bifurcation theory we warmly recommend Kuznetsov [14]. A good introduction 
to numerical experimentation with physically realistic systems is Tufillaro, Abbott, and 
Reilly [15]. Korsch and Jodl [16] and Nusse and Yorke [17] also emphasize hands-on 
approach to dynamics. With this, and a graduate level-exposure to statistical mechan- 
ics, partial differential equations and quantum mechanics, the stage is set for any of the 
one-semester advanced courses based on ChaosBook. 

Remark 1.2 ChaosBook based courses. The courses taught so far (for a listing, 
consult ChaosBook.org/courses) start out with the introductory chapters on qualita- 
tive dynamics, symbolic dynamics and flows, and then continue in different directions: 

Deterministic chaos. Chaotic averaging, evolution operators, trace formulas, zeta func- 
tions, cycle expansions, Lyapunov exponents, billiards, transport coefficients, thermody- 
namic formalism, period doubling, renormalization operators. A graduate level introduc- 
tion to statistical mechanics from the dynamical point view is given by Dorfman [18]; the 
Gaspard monograph [8] covers the same ground in more depth. Driebe monograph [19] 
offers a nice introduction to the problem of irreversibility in dynamics. The role of 'chaos' 
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in statistical mechanics is critically dissected by Bricmont in his highly readable essay 
"Science of Chaos or Chaos in Science?" [20]. 

Spatiotemporal dynamical systems. Partial differential equations for dissipative sys- 
tems, weak amplitude expansions, normal forms, symmetries and bifurcations, pseu- 
dospectral methods, spatiotemporal chaos, turbulence. Holmes, Lumley and Berkooz [ 21] 
offer a delightful discussion of why the Kuramoto-Sivashinsky equation deserves study as 
a staging ground for a dynamical approach to study of turbulence in full-fledged Navier- 
Stokes boundary shear flows. 

Quantum chaos. Semiclassical propagators, density of states, trace formulas, semiclassi- 
cal spectral determinants, billiards, semiclassical helium, diffraction, creeping, tunneling, 
higher-order h corrections. For further reading on this topic, consult the quantum chaos 
part of ChaosBook . org. 

Remark 1.3 Periodic orbit theory. This book puts more emphasis on periodic orbit 
theory than any other current nonlinear dynamics textbook. The role of unstable periodic 
orbits was already fully appreciated by Poincare [22, 23], who noted that hidden in the 
apparent chaos is a rigid skeleton, a tree of cycles (periodic orbits) of increasing lengths 
and self-similar structure, and suggested that the cycles should be the key to chaotic dy- 
namics. Periodic orbits have been at core of much of the mathematical work on the theory 
of the classical and quantum dynamical systems ever since. We refer the reader to the 
reprint selection [24] for an overview of some of that literature. 

Remark 1.4 If you seek rigor? If you find ChaosBook not rigorous enough, 

you should turn to the mathematics literature. We give a short shrift to the theory of 
bifurcations, and the KAM (Kolmogorov-Arnol'd-Moser) tori make only a tangential ap- 
pearance. We recommend Robinson's advanced graduate level exposition of dynamical 
systems theory [25] from Smale perspective. The most extensive reference is the treatise 
by Katok and Hasselblatt [26], an impressive compendium of modern dynamical systems 
theory. The fundamental papers in this field, all still valuable reading, are Smale [27], 
Bowen [28] and Sinai [29]. Sinai's paper is prescient and offers a vision and a program 
that ties together dynamical systems and statistical mechanics. It is written for readers 
versed in statistical mechanics. For a dynamical systems exposition, consult Anosov and 
Sinai [ ]. Markov partitions were introduced by Sinai in ref. [31]. The classical text 
(though certainly not an easy read) on the subject of dynamical zeta functions is Ruelle's 
Statistical Mechanics, Thermodynamic Formalism [32]. In Ruelle's monograph transfer 
operator technique (or the 'Perron-Frobenius theory') and Smale's theory of hyperbolic 
flows are applied to zeta functions and correlation functions. The status of the theory from 
Ruelle's point of view is compactly summarized in his 1995 Pisa lectures [33]. Further 
excellent mathematical references on thermodynamic formalism are Parry and Pollicott's 
monograph [ -' • ] with emphasis on the symbolic dynamics aspects of the formalism, and 
Baladi's clear and compact reviews of the theory of dynamical zeta functions [ 35, 36]. 

Remark 1.5 If you seek magic? ChaosBook resolutely skirts number-theoretical 
magic such as spaces of constant negative curvature, Poincare tilings, modular domains, 
Selberg Zeta functions, Riemann hypothesis, . . . Why? While this beautiful mathematics 
has been very inspirational, especially in studies of quantum chaos, almost no powerful 



intro - 9apr2009 



ChaosBook.org versionl4.4. 1, Apr21 2013 



CHAPTER 1. OVERTURE 



32 



method in its repertoire survives a transplant to a physical system that you are likely to 
care about. 

Remark 1.6 Grasshoppers vs. butterflies. The 'sensitivity to initial conditions' 
was discussed by Maxwell, then 30 years later by Poincare. In weather prediction, the 
Lorenz' 'Butterfly Effect' started its journey in 1898, as a 'Grasshopper Effect' in a book 
review by W. S. Franklin [16]. In 1963 Lorenz ascribed a 'seagull effect' to an unnamed 
meteorologist, and in 1972 he repackaged it as the 'Butterfly Effect'. 

Remark 1.7 Sorry, no schmactals! ChaosBook skirts mathematics and empirical 
practice of fractal analysis, such as Hausdorff and fractal dimensions. Addison's intro- 
duction to fractal dimensions [37] offers a well-motivated entry into this field. While in 
studies of probabilistically assembled fractals such as diffusion limited aggregates (DLA) 
better measures of 'complexity' are lacking, for deterministic systems there are much 
better, physically motivated and experimentally measurable quantities (escape rates, dif- 
fusion coefficients, spectrum of helium, ...) that we focus on here. 

Remark 1.8 Rat brains? If you were wondering while reading this introduction 
'what's up with rat brains?', the answer is yes indeed, there is a line of research in neu- 
ronal dynamics that focuses on possible unstable periodic states, described for example in 
refs. [38, 39, 40,41]. 
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A guide to exercises 

God can afford to make mistakes. So can Dada! 
— Dadaist Manifesto 

The essence of this subject is incommunicable in print; the only way to develop 
intuition about chaotic dynamics is by computing, and the reader is urged to try to 
work through the essential exercises. As not to fragment the text, the exercises are 
indicated by text margin boxes such as the one on this margin, and collected at the 
end of each chapter. By the end of a (two-semester) course you should have com- 
pleted at least three small projects: (a) compute everything for a 1-dimensional 
repeller, (b) compute escape rate for a 3-disk game of pinball, (c) compute a part 
of the quantum 3-disk game of pinball, or the helium spectrum, or if you are 
interested in statistical rather than the quantum mechanics, compute a transport 
coefficient. The essential steps are: 

• Dynamics 

1. count prime cycles, exercise 1.1, exercise 9.6, exercise 11.1 

2. pinball simulator, exercise 8.1, exercise 13.4 

3. pinball stability, exercise 13.7, exercise 13.4 

4. pinball periodic orbits, exercise 13.5, exercise 13.6 

5. helium integrator, exercise 2.10, exercise 13.11 

6. helium periodic orbits, exercise 13.12 

• Averaging, numerical 

1. pinball escape rate, exercise 17.2 

• Averaging, periodic orbits 

1. cycle expansions, exercise 20.1, exercise 20.2 

2. pinball escape rate, exercise 20.4, exercise 20.5 

3. cycle expansions for averages, exercise 20. 1 , exercise 22.3 

4. cycle expansions for diffusion, exercise 25.1 

5. pruning, transition graphs, exercise 15.6 

6. desymmetrization exercise 21.1 

7. intermittency, phase transitions, exercise 24.6 

The exercises that you should do have underlined titles . The rest (smaller type) 

are optional. Difficult problems are marked by any number of *** stars. If you 
solve one of those, it is probably worth a publication. Solutions to some of the 
problems are available on ChaosBook . org. A clean solution, a pretty figure, or a 
nice exercise that you contribute to ChaosBook will be gratefully acknowledged. 
Often going through a solution is more instructive than reading the chapter that 
problem is supposed to illustrate. 
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Exercises 

1.1. 3-disk symbolic dynamics. As periodic trajectories 
will turn out to be our main tool to breach deep into 
the realm of chaos, it pays to start familiarizing oneself 
with them now by sketching and counting the few short- 
est prime cycles (we return to this in sect. 15.4). Show 
that the 3-disk pinball has 3 • 2 n_1 itineraries of length 
«. List periodic orbits of lengths 2, 3, 4, 5, ■ ■ ■. Verify 
that the shortest 3-disk prime cycles are 12, 13, 23, 123, 
132, 1213, 1232, 1323, 12123, • • -. Try to sketch them, 
(continued in exercise 12.6) 



1.2. Sensitivity to initial conditions. Assume that two pin- 
ball trajectories start out parallel, but separated by 1 
Angstrom, and the disks are of radius a = 1 cm and 
center-to-center separation R = 6 cm. Try to estimate 
in how many bounces the separation will grow to the 
size of system (assuming that the trajectories have been 
picked so they remain trapped for at least that long). Es- 
timate the Who's Pinball Wizard's typical score (num- 
ber of bounces) in a game without cheating, by hook or 
crook (by the end of chapter 20 you should be in position 
to make very accurate estimates). 



References 

[1.1] J. Guckenheimer, "Piecewise-Smooth Dynamical Systems (Book Review)" 
SIAMRev. 50, 606(2008). 

[1.2] G.W. Leibniz, Von dem Verhdngnisse, in Hauptschriften zur Grundlegung 
der Philosophie, Vol II, , pp. 129-134 (Ernst Cassirer, Leibzig 1906). 

[1.3] T. Li and J. Yorke, "Period 3 implies chaos," Amer. Math. Monthly 82, 985 
(1975). 

[1.4] R. P. Taylor, A. P. Micolich and D. Jonas, "Fractal analysis of Pollock's drip 
paintings," Nature 399, 422 (1999). 

[1.5] K. Jones-Smith and H. Mathur, "Fractal Analysis: Revisiting Pollock's drip 
paintings," Nature 444, E9 (2006); R. P. Taylor, A. P. Micolich and D. Jonas, 
"Fractal Analysis: Revisiting Pollock's drip paintings (Reply)," Nature 444, 
E10 (2006). 

[1.6] R. Kennedy, "The Case of Pollock's Fractals Focuses on Physics," New York 
Times (Dec. 2, 2006). 

[1.7] D. Avnir, O. Biham, D. Lidar and O. Malcai, "Is the Geometry of Nature 
Fractal?," Science 279, 39 (1998). 

[1.8] P. Gaspard, Chaos, Scattering and Statistical Mechanics (Cambridge U. 
Press, Cambridge 1998). 

[1.9] S.H. Strogatz, Nonlinear Dynamics and Chaos (Addison- Wesley, Reading 
1994). 

[1.10] K.T. Alligood, T.D. Sauer and J.A. Yorke, Chaos, an Introduction to Dy- 
namical Systems (Springer, New York 1996) 



refslntro - 6mar2009 



ChaosBook.org vcrsionI4.4.1, Apr 21 2013 



References 



35 



[1.11] E. Ott, Chaos in Dynamical Systems (Cambridge U. Press, Cambridge 
1993). 

[1.12] J. C. Sprott, Chaos and Time-Series Analysis (Oxford Univ. Press, Oxford 
2003) 

[1.13] E. Atlee Jackson, Perspectives of nonlinear dynamics (Cambridge U. 
Press, Cambridge 1989). 

[1.14] Y. A. Kuznetsov, Elements of Applied Bifurcation Theory, (Springer, New 
York 2004). 

[1.15] N.B. Tufillaro, T.A. Abbott, and J.P Reilly, Experimental Approach to 
Nonlinear Dynamics and Chaos (Addison Wesley, Reading MA 1992). 

[1.16] H.J. Korsch and H.-J. Jodl, Chaos. A Program Collection for the PC, 
(Springer, New York 1994). 

[1.17] H.E. Nusse and J.A. Yorke, Dynamics: Numerical Explorations 
(Springer, New York 1997). 

[1.18] J. R. Dorfman, An Introduction to Chaos in Nonequilibrium Statistical Me- 
chanics (Cambridge U. Press, Cambridge 1999). 

[1.19] D.J. Driebe, Fully Chaotic Map and Broken Time Symmetry (Kluwer, Dor- 
drecht 1999). 

[1.20] J. Bricmont, 'Science of Chaos or Chaos in Science?," in: The Flight from 
Science and Reason, PR. Gross, N. Levitt, and M.W. Lewis, eds., Annals of 
the New York Academy of Sciences 775; mp_arc 96- 1 16 . ps . gz 

[1.21] P. Holmes, J.L. Lumley and G. Berkooz, Turbulence, Coherent Structures, 
Dynamical Systems and Symmetry (Cambridge U. Press, Cambridge 1996). 

[1.22] H. Poincare, Les methodes nouvelles de la mechanique celeste (Guthier- 
Villars, Paris 1892-99) 

[1.23] For a very readable exposition of Poincare's work and the development of 
the dynamical systems theory see J. Barrow-Green, Poincare and the Three 
Body Problem, (Amer. Math. Soc, Providence RI 1997), and F. Diacu and P. 
Holmes, Celestial Encounters, The Origins of Chaos and Stability (Princeton 
Univ. Press, Princeton NJ 1996). 

[1.24] R.S. MacKay and J.D. Meiss, Hamiltonian Dynamical Systems (Adam 
Hilger, Bristol 1987). 

[1.25] C. Robinson, Dynamical Systems: Stability, Symbolic Dynamics, and 
Chaos (C. R. C. Press, Boca Raton 1999) 

[1.26] A. Katok and B. Hasselblatt, Introduction to the Modern Theory of Dy- 
namical Systems (Cambridge U. Press, Cambridge 1995). 

[1.27] S. Smale, "Differentiable Dynamical Systems," Bull. Am. Math. Soc. 73, 
747 (1967). 



refslntro - 6mar2009 



ChaosBook.org vcrsionI4.4.1, Apr 21 2013 



References 



36 



[1.28] R. Bowen, Equilibrium states and the ergodic theory of Anosov diffeomor- 
phisms, Springer Lecture Notes in Math. 470 (1975). 

[1.29] Ya.G. Sinai, "Gibbs measures in ergodic theory," Russ. Math. Surveys 166, 
21 (1972). 

[1.30] D.V. Anosov and Ya.G. Sinai, "Some smooth ergodic systems," Russ. 
Math. Surveys 22, 103 (1967). 

[1.31] Ya.G. Sinai, "Construction of Markov partitions," Funkts. Analiz i Ego 
Pril. 2, 70 (1968). English translation: Functional Anal. Appl. 2, 245 (1968). 

[1.32] D. Ruelle, Statistical Mechanics, Thermodynamic Formalism, (Addison- 
Wesley, Reading MA 1978). 

[1.33] D. Ruelle, "Functional determinants related to dynamical systems 
and the thermodynamic formalism," (Lezioni Fermiane, Pisa), preprint 
IHES/P/95/30 (March 1995). 

[1.34] W. Parry and M. Pollicott, Zeta Functions and the Periodic Structure 
of Hyperbolic Dynamics, Asterisque 187-188 (Societe Mathematique de 
France, Paris 1990). 

[1.35] V. Baladi, "Dynamical zeta functions," in B. Branner and P. Hjorth, eds., 
Real and Complex Dynamical Systems (Kluwer, Dordrecht 1995). 

[1.36] V. Baladi, Positive Transfer Operators and Decay of Correlations (World 
Scientific, Singapore 2000). 

[1.37] PS. Addison Fractals and chaos: an illustrated course, (Inst, of Physics 
Publishing, Bristol 1997). 

[1.38] S.J. Schiff, et al. "Controlling chaos in the brain," Nature 370, 615 (1994). 

[1.39] F. Moss, "Chaos under control," Nature 370, 596 (1994). 

[1.40] J. Glanz, "Do chaos-control techniques offer hope for epilepsy?" Science 
265, 1174 (1994). 

[1.41] J. Glanz, "Mastering the Nonlinear Brain," Science 227, 1758 (1997). 

[1.42] Henry Miller, Tropic of Cancer (Obelisk Press, Paris 1939; Grove Press, 
New York 1961). 



refslntro- 6mar2009 



ChaosBook.org vcrsionI4.4.1, Apr 21 2013 



Chapter 2 



Go with the flow 



Dynamical systems theory includes an extensive body of 
knowledge about qualitative properties of generic smooth 
families of vector fields and discrete maps. The theory 
characterizes structurally stable invariant sets [...] The 
logic of dynamical systems theory is subtle. The theory 
abandons the goal of describing the qualitative dynamics 
of all systems as hopeless and instead restricts its atten- 
tion to phenomena that are found in selected systems. The 
subtlety comes in specifying the systems of interest and 
which dynamical phenomena are to be analyzed. 

— John Guckenheimer 

(R. Mainieri, P. Cvitanovic and E.A. Spiegel) 



We define a dynamical system (M, f) and classify its solutions as equilibria, 
periodic, and aperiodic. An 'aperiodic' solution is either 'wandering' or 
belongs to a non-wandering set, which in turn can be decomposed into 
into chain-recurrent sets. Various cases are illustrated with concrete examples, 
such as the Rossler and Lorenz systems. 




fast track: 
chapter 16, p. 327 



2.1 Dynamical systems 



In a dynamical system we observe the world as it evolves with time. We express 
our observations as numbers and record how they change; given sufficiently de- 
tailed information and understanding of the underlying natural laws, we see the 
future in the present as in a mirror. The motion of the planets against the celestial 
firmament provides an example. Against the daily motion of the stars from East 
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f'(x) 



Figure 2.1: A trajectory traced out by the evolution 
rule /'. Starting from the state space point x, after a 
time t, the point is at f'(x). 




to West, the planets distinguish themselves by moving among the fixed stars. An- 
cients discovered that by knowing a sequence of planet's positions-latitudes and 
longitudes-its future position could be predicted. 

For the solar system, tracking the latitude and longitude in the celestial sphere 
suffices to completely specify the planet's apparent motion. All possible values for 
positions and velocities of the planets form the phase space of the system. More 
generally, a state of a physical system, at a given instant in time, can be represented 
by a single point in an abstract space called state space Af (mnemonic: curly 'At' 
for a 'manifold'). As the system changes, so does the representative point in state 
space. We refer to the evolution of such points as dynamics, and the function / 
which specifies where the representative point is at time t as the evolution rule. 

If there is a definite rule / that tells us how this representative point moves in 
At, the system is said to be deterministic. For a deterministic dynamical system, 
the evolution rule takes one point of the state space and maps it into exactly one 
point. However, this is not always possible. For example, knowing the tempera- 
ture today is not enough to predict the temperature tomorrow; knowing the value 
of a stock today will not determine its value tomorrow. The state space can be en- 
larged, in the hope that in a sufficiently large state space it is possible to determine 
an evolution rule, so we imagine that knowing the state of the atmosphere, mea- 
sured over many points over the entire planet should be sufficient to determine the 
temperature tomorrow. Even that is not quite true, and we are less hopeful when 
it comes to stocks. 

For a deterministic system almost every point has a unique future, so trajecto- 
ries cannot intersect. We say 'almost' because there might exist a set of measure 
zero (tips of wedges, cusps, etc.) for which a trajectory is not defined. We may 
think such sets a nuisance, but it is quite the contrary-they will enable us to parti- 
tion state space, so that the dynamics can be better understood. 

Locally, the state space Af looks like R d , meaning that a dynamical evolu- 
tion is an initial value problem, with d numbers sufficient to determine what will 
happen time t later. The local Euclidean structure at any given state space point 
x e Af is described by a 'chart'. Globally, the state space may be a more compli- 
cated manifold such as a torus, a cylinder, or some other smooth geometric object. 
By manifold we mean a smooth differentiable J-dimensional space which looks 
like Ef 1 only locally. For example, the state space of an autonomous Hamiltonian 
system the flow is confined to a constant energy hyper-surface. When we need to 
stress that the dimension d of A1 is greater than one, we may refer to the point 
x e Af as X{ where i = 1,2, 3, . . . ,d. If the dynamics is described by a set of PDEs 
(partial differential equations), the state space is the infinite dimensional function 
space. The evolution rule f : At — * At tells us where a point x is in At after a 
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Figure 2.2: The evolution rule /'can be used to map a 
region At; of the state space into the region /'(At;). 




time interval t. 



The pair (At, /) constitute a dynamical system. 



The dynamical systems we will be studying are smooth. This is expressed 
mathematically by saying that the evolution rule / can be differentiated as many 
times as needed. Its action on a point x is sometimes indicated by f{x, t) to remind 
us that / is really a function of two variables: the time and a point in state space. 
Note that time is relative rather than absolute, so only the time interval is neces- 
sary. This follows from the fact that a point in state space completely determines 
all future evolution, and it is not necessary to know anything besides the time 
interval. The time parameter can be a real variable (? € R), in which case the evo- 
lution is called a flow, or an integer (t e Z), in which case the evolution advances 
in discrete steps in time, given by iteration of a map. The evolution parameter 
need not be the physical time; for example, a time-stationary solution of a partial 
differential equation is parameterized by spatial variables. In such situations one 
talks of a 'spatial profile' rather than a 'flow'. 

Nature provides us with innumerable dynamical systems. They manifest them- 
selves through their trajectories: given a state xq at initial time to, the flow map 

f : x 0 -> x(x 0 , t) 

yields the state x{t) time t later. This evolution rule traces out a sequence of 
points x(t) = f(xo), the trajectory through the point xq = x{0). We shall usually 
omit the xq label from x{xq, t). By extension, we can also talk of the evolution 
of a region At; of the state space. The language of continuum mechanics is quite 
helpful in visualizing such deformations, not only in 3-dimensional space, but also 
in state spaces of arbitrary dimension. Consider a motion / from the undeformed 
{reference or initial) region (a 'body') A\ to the deformed {current or final) region 
At/ = f'{Mi). We may write the motion as a map 



such that every xq in At; is mapped to an x = f(xo) in At/, as in figure 2.2, where 
x denotes the state in the deformed region, and xq represents the state in the initial, 



f: Mi^Mf, 



(2.1) 



undeformed region. 



exercise 2.1 
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Figure 2.3: A periodic point returns to the initial point 
after a finite time, x = f T ''(.x). Periodic orbit p is the 
set of periodic points p = M p = {x\ , x-i, ■ ■ •} swept out 
by the trajectory of any one of them in the finite time 
T 



The subset of points M Xo c M that belong to the infinite-time trajectory 
of a given point xq is called the orbit of xq\ we shall talk about forward orbits, 
backward orbits, periodic orbits, etc.. For a flow, an orbit is a smooth continuous 
curve; for a map, it is a sequence of points. 'Trajectory' refers to a set of points or 
a curve segment traced out by x(t) up to time instant t. 'Orbit' refers to the totality 
of states that can be reached from xq, with state space M foliated into a union of 
such orbits (each M Xo labeled by a single point belonging to the set, x® = x(0) 
for example). Under time evolution a trajectory segment is mapped into another 
trajectory segment, but points within an orbit are only shifted; the orbit considered 
as a set is unchanged. Hence an orbit is a dynamically invariant notion. 

The central idea of ChaosBook is to replace the complicated, ergodic, asymp- 
totic t — > co dynamics by a systematic hierarchy of compact time-invariant sets or 
compact orbits (equilibria, periodic orbits, invariant tori, • • •)■ 

2.1.1 A classification of possible motions? 

What kinds of orbits are there? This is a grand question, and there are many 
answers. The following chapters offer some. Here is a first attempt to classify all 
possible orbits: 

stationary: f{x) = x for all t 

periodic: f'(x) - f t+T i'(x) for a given minimum period T p 
aperiodic: f(x) + f\x) for all t + f . 

A periodic orbit (or a cycle) p is the set of points A\, c M swept out by a 
trajectory that returns to the initial point in a finite time. We refer to a point on a 
periodic orbit as a periodic point, see figure 2.3. Periodic orbits form a very small 
subset of the state space, in the same sense that rational numbers are a set of zero 
measure on the unit interval. 

chapter 5 

Periodic orbits and equilibrium points are the simplest examples of 'non- 
wandering' invariant sets preserved by dynamics. Dynamics can also preserve 
higher-dimensional smooth compact invariant manifolds; most commonly en- 
countered are the M-dimensional tori of Hamiltonian dynamics, with the notion of 
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periodic motion generalized to quasiperiodic (the superposition of M incommen- 
surate frequencies) motion on a smooth torus, and families of solutions related 
by a continuous symmetry. Further examples are afforded by stable / unstable 
manifolds (a semi-infinite curve originating at an equilibrium along each stabil- 
ity eigenvector) and the most mysterious of all invariant orbits, the infinite time 
ergodic orbits. 

The ancients tried to make sense of all dynamics in terms of periodic motions, 
epicycles, what we today call 'integrable systems'. The embarrassing truth is that 
for a generic dynamical system almost all motions are aperiodic. So we refine the 
classification by dividing aperiodic motions into two subtypes: those that wander 
off, and those that keep coming back. 

A point x € Mis called a wandering point, if there exists an open neighbor- 
hood Mo of x to which the trajectory never returns 

f(x) $ Mo for all t>t min . (2.2) 

In physics literature, the dynamics of such a state is often referred to as transient. 

Wandering points do not take part in the long-time dynamics, so your first task 
is to prune them from M as well as you can. What remains envelops the set of the 
long-time trajectories, or the non-wandering set. 

For times much longer than a typical 'turnover' time, it makes sense to relax 
the notion of exact periodicity and replace it by the notion of recurrence. A point 
is recurrent or non-wandering, if for any open neighborhood M of x and any 
time t, mn there exists a later time t, such that 

fix) e Mo . (2.3) 

In other words, the trajectory of a non-wandering point reenters the neighborhood 
Mo infinitely often. We shall denote the non-wandering set of / by £1, i.e., the 
union of all the non-wandering points of M. This non-wandering set of / is key 
to understanding the long-time behavior of a dynamical system; all calculations 
undertaken here will be carried out on non-wandering sets. 

So much about individual trajectories. What about clouds of initial points? If 
there exists a connected state space volume that maps into itself under forward 
evolution (and you can prove that by the method of Lyapunov functionals, or 
several other methods available in the literature), the flow is globally contracting 
onto a subset of M which we shall refer to as the attractor. The attractor may 
be unique, or there can coexist any number of distinct attracting sets, each with 
its own basin of attraction, the set of all points that fall into the attractor under 
forward evolution. The attractor can be a fixed point (a sink), a periodic orbit 
(a limit cycle), aperiodic, or any combination of the above. The most interesting 
case is that of an aperiodic recurrent attractor, to which we shall refer loosely as a 



section 12.1 
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strange attractor. We say 'loosely', as will soon become apparent that diagnosing 

... r . . . • i • 1 1 example 2.3 

and proving existence of a genuine, card-carrying strange attractor is a highly 

nontrivial undertaking; it requires explaining notions like "transitive" and "chain- 
recurrent" that we will be ready to discuss only in sect. 14.1. 

Conversely, if we can enclose the non-wandering set Q by a connected state 
space volume A\q and then show that almost all points within Afc, but not in Q, 
eventually exit Ato, we refer to the non-wandering set Q. as a repeller. An example 
of a repeller is not hard to come by-the pinball game of sect. 1 .3 is a simple chaotic 
repeller. Q, the non-wandering set of /, is the union of all of the above, separately 
invariant sets: attracting/repelling fixed points, strange attractors, repellers, etc.. 

It would seem, having said that the periodic points are so exceptional that 
almost all non-wandering points are aperiodic, that we have given up the ancients' 
fixation on periodic motions. Nothing could be further from truth. As longer and 
longer cycles approximate more and more accurately finite segments of aperiodic 
trajectories, we shall establish control over non-wandering sets by defining them 
as the closure of the union of all periodic points. 

Before we can work out an example of a non-wandering set and get a better 
grip on what chaotic motion might look like, we need to ponder flows in a little 
more depth. 



2.2 Flows 

Knowing the equations and knowing the solution are two 
different things. Far, far away. 
— T.D. Lee 

A flow is a continuous-time dynamical system. The evolution rule / is a family 
of mappings of At — » M parameterized by t e R. Because t represents a time 
interval, any family of mappings that forms an evolution rule must satisfy: 

(a) f°(x) = x (in 0 time there is no motion) 

(b) f\f '(*)) - f +t (x) (the evolution law is the same at all times) 

(c) the mapping (x, t) i-> f(x) from M x R into At is continuous. 



exercise 2.2 



We shall often find it convenient to represent functional composition by 'o :' 

appendix 1.1 

f +s =fof =f\f). (2.4) 



The family of mappings f(x) thus forms a continuous (forward semi-) group. 
Why 'semi-'group? It may fail to form a group if the dynamics is not reversible, 
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and the rule f{x) cannot be used to rerun the dynamics backwards in time, with 
negative t; with no reversibility, we cannot define the inverse /"'(fix)) = f°(x) = 
x , in which case the family of mappings f(x) does not form a group. In ex- 
ceedingly many situations of interest-for times beyond the Lyapunov time, for 
asymptotic attractors, for dissipative partial differential equations, for systems 
with noise, for non-invertible maps-the dynamics cannot be run backwards in 
time, hence, the circumspect emphasis on semigroups. On the other hand, there 
are many settings of physical interest, where dynamics is reversible (such as finite- 
dimensional Hamiltonian flows), and where the family of evolution maps / does 
form a group. 

For infinitesimal times, flows can be defined by differential equations. We 
write a trajectory, a smooth curve embedded in the state space as 



x(t + T)=f +T (x 0 )=f(f(x {h t),T) 



(2.5) 



and express the tangent to the curve at point x(t) as 



exercise 2.3 



dx 

j7 



r=0 



d T f(f(x 0 ,t),T)\ T=Q = m, 



(2.6) 



the time derivative of the evolution rule, a vector evaluated at the point x(t). By 
considering all possible trajectories, we obtain the vector x(t) at any point x e At. 
This vector field is a (generalized) velocity field: 



remark 10.3 



x(t) = v(x) . 



(2.7) 



Newton's laws, Lagrange's method, or Hamilton's method are all familiar pro- 
cedures for obtaining a set of differential equations for the vector field v(x) that 
describes the evolution of a mechanical system. Equations of mechanics may ap- 
pear different in form from (2.7), as they are often involve higher time derivatives, 
but an equation that is second or higher order in time can always be rewritten as a 
set of first order equations. 

We are concerned here with a much larger world of general flows, mechanical 
or not, all defined by a time-independent vector field (2.7). At each point of the 
state space a vector indicates the local direction in which the trajectory evolves. 
The length of the vector |v(x)| is proportional to the speed at the point x, and the 
direction and length of v(x) changes from point to point. When the state space is a 
complicated manifold embedded in R d , one can no longer think of the vector field 
as being embedded in the state space. Instead, we have to imagine that each point 
x of state space has a different tangent plane TM X attached to it. The vector field 
lives in the union of all these tangent planes, a space called the tangent bundle 

TM = |J TM X . 

xeM 
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Figure 2.4: (a) The 2-dimensional vector field for 
the Duffing system (2.8), together with a short tra- 
jectory segment, (b) The flow lines. Each 'comet' 
represents the same time interval of a trajectory, 
starting at the tail and ending at the head. The 
longer the comet, the faster the flow in that region. 



(a) 





(b) 



TM X is called a fibre at x, hence the whole thing is called the, fibre bundle. Locally 
a fibre bundle looks like the product of two spaces. Relax: we'll do our best 
not to use such words again. 



Example 2.1 A 2-dimensional vector Held v(x): 

afforded by the unforced Duffing system 



A simple example of a flow is 



i(f) = y(t) 

y(t) = -0.15 y(t) + x(t)-x(t) 3 



(2.8) 



plotted in figure 2.4. The velocity vectors are drawn superimposed over the configura- 
tion coordinates (x(t),y(t)) of state space M, but they belong to a different space, the 
tangent bundle TA-f. 



The instantaneous velocity vector v is tangent to the trajectory, except at the 
equilibrium points, where it vanishes. 



If v(jc 9 ) = 0, 



(2.9) 



x q is also referred to as a stationary, fixed, critical, invariant, rest, stagnation 
point, zero of the vector field v, standing wave, stationary solution, or steady 
state. Our usage will be 'equilibrium' for a flow, 'fixed point' for a map. The 
trajectory remains forever stuck at x q . Otherwise the trajectory passing through 
xq at time t = 0 can be obtained by integrating the equations (2.7): 



x(t) - f\xo) - Xq + I dTv(x(T)) , x(0) = Xq . 

Jo 



(2.10) 



We shall consider here only autonomous flows, i.e., flows for which the velocity 
field v/ is stationary, not explicitly dependent on time. A non-autonomous system 



— = w(y, t) , 
dr 



(2.11) 



can always be converted into a system where time does not appear explicitly. 



exercise 2.4 
exercise 2.5 
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Figure 2.5: Lorenz "butterfly" strange attractor. (J. 
Halcrow) 
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To do so, extend ('suspend') state space to be (d + l)-dimensional by defining 
x = {y, t}, with a stationary vector field 



v{x) 



w(y,r) 
1 



(2.12) 



The new flow x = v(x) is autonomous, and the trajectory y(r) can be read off x(t) 
by ignoring the last component of x. 

Example 2.2 Lorenz strange attractor: Edward Lorenz arrived at the equation 





X 




cr(y - x) 




x — v(x) = 


y 




px-y - xz 


(2.13) 




z 




xy -bz 





by a drastic simplification of the Rayleigh-Benard flow. Lorenz fixed cr = 10, b = 8/3, 
and varied the "Rayleigh number" p. For 0 < p < 1 the equilibrium EQ 0 = (0, 0, 0) at the 
origin is attractive. At p = 1 it undergoes a pitchfork bifurcation into a pair of equilibria 
at 



X EQl2 = (+ yjb(p- 1), + Vft(p-l) tP - 1) , 



remark 2.3 



(2.14) 



We shall not explore the Lorenz flow dependence on the p parameter in what follows, 
but here is a brief synopsis: the EQo l-dimensional unstable manifold closes into a 
homoclinic orbit at p - 13.56 . . .. Beyond that, an infinity of associated periodic orbits 
are generated, until p = 24.74 . . ., where EQ U2 undergo a Hopf bifurcation. 

All computations that follow will be performed for the Lorenz parameter choice 
cr = 10, b = 8/3, p = 28 . For these parameter values the long-time dynamics is confined 
to the strange attractor depicted in figure 2.5, and the positions of its equilibria are 
marked in figure 9.5. (continued in example 3.4) 



Example 2.3 Rossler strange attractor: The Duffing flow of figure 2.4 is bit of 
a bore-every trajectory ends up in one of the two attractive equilibrium points. Let's 
construct a flow that does not die out, but exhibits a recurrent dynamics. Start with a 
harmonic oscillator 



x = -y, 



y — x . 



(2.15) 
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Figure 2.6: A trajectory of the Rossler flow at time 
t = 250. (G. Simon) 




The solutions are re", re " , and the whole x-y plane rotates with constant angular 
velocity 0=1, period T = 2n. Now make the system unstable by adding 

x = —y , y = x + ay , a>0, (2.16) 

or, in radial coordinates, ir = ar sin 2 0,0=1 + (a/2) sin 20. The plane is still rotating with 
the same average angular velocity, but trajectories are now spiraling out. Any flow in 
the plane either escapes, falls into an attracting equilibrium point, or converges to a limit 
cycle. Richer dynamics requires at least one more dimension. In order to prevent the 
trajectory from escaping to oo, kick it into 3rd dimension when x reaches some value c 
by adding 

z = b + z(x-c), c>Q. (2.17) 

As x crosses c, z shoots upwards exponentially, z - e (x ~ c)t . In order to bring it back, 
start decreasing x by modifying its equation to 

x = —y — z ■ 

Large z drives the trajectory toward x = 0; there the exponential contraction by e~ c ' 
kicks in, and the trajectory drops back toward the x-y plane. This frequently studied 
example of an autonomous flow is called the Rossler flow 

x = -y-z 
y = x + ay 

z = b + z(x-c), a = b = 0.2, c = 5.7 (2.18) 

(for definitiveness, we fix the parameters a, b, c in what follows). The system is as 
simple as they get-it would be linear, were it not for the sole bilinear term zx. Even for 
so 'simple' a system the nature of long-time solutions is far from obvious. 

There are two repelling equilibrium points (2.9): 

x± = (— + — \/l — 4ab/c 2 )(c, —c/a, cja) 

X- ~ (ab/c,—b/c,b/c) , x + « (c,—c/a,c/a) 

(x_,y_,z_) = (0.0070,-0.0351,0.0351 ) 

(x+,y+,z + ) = (5.6929,-28.464,28.464) (2.19) 

One is close to the origin by construction. The other, some distance away, exists be- 
cause the equilibrium condition has a 2nd-order nonlinearity. 
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To see what solutions look like in general, we need to resort to numerical in- 
tegration. A typical numerically integrated long-time trajectory is sketched in figure 2.6 
(see also figure 11.10). Trajectories that start out sufficiently close to the origin seem 

to converge to a strange attractor. We say 'seem' as there exists no proof that such 

©x©rci c 

an attractor is asymptotically aperiodic-it might well be that what we see is but a long 
transient on a way to an attractive periodic orbit. For now, accept that figure 2.6 and 
similar figures in what follows are examples of 'strange attractors.' (continued in 
exercise 2.8 and example 3.3) (R. Paskauskas) 

The Rossler flow is the simplest flow which exhibits many of the key aspects 
of chaotic dynamics; we shall use it and the 3-pinball (see chapter 8) systems 
throughout ChaosBook to motivate introduction of Poincare sections, return maps, 
symbolic dynamics, cycle expansions, and much else. Rossler flow is integrated 
in exercise 2.7, its equilibria are determined in exercise 2.8, its Poincare sections 
constructed in exercise 3.1, and the corresponding return Poincare map computed 
in exercise 3.2. Its volume contraction rate is computed in exercise 4.3, its topol- 
ogy investigated in exercise 4.4, the shortest Rossler flow cycles are computed and 
tabulated in exercise 13.10, and its Lyapunov exponents evaluated in exercise 6. 5. 



2.2.1 Lagrangian and Eulerian viewpoints 

Continuum mechanics offers two profoundly different but mathematically equiva- 
lent ways to represent a given state space flow, the 'Lagrangian' and the 'Eulerian' 
viewpoints. From the Eulerian perspective one only cares about what is the state 
of system here and now; think of a field of grass, each grass blade the local ve- 
locity vector. From the Lagrangian viewpoint one cares about where a state space 
point come fromand where is it going to; think of the state space foliated into a 
bowl of linguini, each noodle an orbit, marked with a label xq somewhere along 
it. In the Eulerian formulation the flow is defined by specifying (2.7), the velocity 
field v(x). In the Lagrangian formulation it is given by the finite time flow (2.10), 
i.e., the totality of the trajectories x(t) comprising the deformed region, labeled by 
their origin xq in the initial undeformed region. If we mark the trajectory x(t) by 
its initial point xq, we are describing the flow in the Lagrangian coordinates. The 
Eulerian velocity v(x) at a fixed state space position x is equal to the Lagrangian 
velocity v(x(t)) at the trajectory passing through x at the instant t. Because / is 
a single-valued function, any point on the trajectory can be used to label the tra- 
jectory. The transport of the 'material point' at t = 0 to its value at the current 
point x(t) = /'(xo) is a coordinate transformation from the Lagrangian coordinates 
to the Eulerian coordinates. 

In numerical work we are given the equations of motion (the local Eulerian 
velocity field v(x)), but we care about the solutions of these equations (the global 
Lagrangian flow). Conversely, in experimental work we observe ensembles of 
Lagrangian trajectories from which we then extract the velocity field (in fluid 
dynamics this is achieved by particle image velocimetry (PIV)). Once an Eulerian 
velocity field has been specified or extracted from the observational data, it is 
straightforward to compute the Lagrangian trajectories, objects of great practical 
interest in studies of long time dynamics, mixing, and transport. 
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fast track: 



chapter 3, p. 57 



2.3 Changing coordinates 

Problems are handed down to us in many shapes and forms, and they are not al- 
ways expressed in the most convenient way. In order to simplify a given problem, 
one may stretch, rotate, bend and mix the coordinates, but in doing so, the vector 
field will also change. The vector field lives in a (hyper)plane tangent to state 
space and changing the coordinates of state space affects the coordinates of the 
tangent space as well, in a way that we will now describe. 

Denote by h the conjugation function which maps the coordinates of the initial 
state space At into the reparameterized state space Af = h(M), with a point 
x € At related to a point y e Af by 

y = h(x) = (yi(x),y 2 (x), . . . ,y<t(x)) . 

The change of coordinates must be one-to-one and span both Af and At, so given 
any point y we can go back to x = h~ x (y). For smooth flows the reparameterized 
dynamics should support the same number of derivatives as the initial one. If h is 
a (piecewise) analytic function, we refer to h as a smooth conjugacy. 

The evolution rule g t (yo) on At' can be computed from the evolution rule 
/'(jcrj) on At by taking the initial point yo e At', going back to At, evolving, and 
then mapping the final point x(t) back to Af : 



Here V stands for functional composition h o f(x) = h(f(x)), so 0.20) is a 
shorthand for y(t) = h(f(h- l (y 0 ))). 

The vector field x = v(x) is locally tangent to the flow f; it is related to the 
flow by differentiation (2.6) along the trajectory. The vector field y = w(y), y e Af 
locally tangent to g 1 , follows by the chain rule: 



In order to rewrite the right-hand side as a function of y, note that the dy differen- 
tiation of h{hT l (y)) = y implies 



y(t) = g t (y 0 ) = hofoh- l (y Q ). 



(2.20) 




= h'{h- l (y))v(hr l (y)) = h\x)v(x). 



(2.21) 



dh dh~ l 



= 1 




dh~ l 



60 



(2.22) 




y 



dy 
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so the equations of motion in the transformed coordinates, with the indices rein- 
stated, are 



ji = W((y) 



dh 



-l 



dy 



"00 



v/ZT'OO). (2.23) 



Imagine the state space as a rubber sheet with the flow lines drawn on it. 
A coordinate change h corresponds to pulling and tugging on the rubber sheet 
smoothly, without cutting, gluing, or self-intersections of the distorted rubber 
sheet. Trajectories that are closed loops in M will remain closed loops in the 
new manifold M' , but their shapes will change. Globally, h deforms the rubber 
sheet in a highly nonlinear manner, but locally it simply rescales and shears the 
tangent field by the Jacobian matrix djhi, yielding the simple transformation law 
(2.21) for the velocity fields. 

Time itself is a parametrization of points along flow lines, and it can also 
be reparameterized, s = s(t), with the concomitant modification of (2.23). An 
example is the 2-body collision regularization of the helium Hamiltonian (7.8), to 
be undertaken in appendix B. 2 below. 



2.4 Computing trajectories 



On two occasions I have been asked [by members of Par- 
liament], 'Pray, Mr. Babbage, if you put into the machine 
wrong figures, will the right answers come out?' I am not 
able rightly to apprehend the kind of confusion of ideas 
that could provoke such a question. 
— Charles Babbage 

You have not learned dynamics unless you know how to integrate numerically 
whatever dynamical equations you face. Sooner or later, you need to implement 
some finite time-step prescription for integration of the equations of motion (2.7). 
The simplest is the Euler integrator which advances the trajectory by St X velocity 
at each time step: 

Xf — » xi + V{(x) 8t . (2.24) 

This might suffice to get you started, but as soon as you need higher numerical ac- 
curacy, you will need something better. There are many excellent reference texts 
and computer programs that can help you learn how to solve differential equations 
numerically using sophisticated numerical tools, such as pseudo-spectral methods 
or implicit methods. If a 'sophisticated' integration routine takes days and gob- 
bles up terabits of memory, you are using brain-damaged high level software. Try 
writing a few lines of your own Runge-Kutta code in some mundane everyday 
language. While you absolutely need to master the requisite numerical methods, 



exercise 



exercise 
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this is neither the time nor the place to expound upon them; how you learn them is 

your business. And if you have developed some nice routines for solving problems exercjse 2 g 

in this text or can point another student to some, let us know. ' . „' 

exercise 2.10 



Resume 



Chaotic dynamics with a low-dimensional attractor can be visualized as a suc- 
cession of nearly periodic but unstable motions. In the same spirit, turbulence in 
spatially extended systems can be described in terms of recurrent spatiotemporal 
patterns. Pictorially, dynamics drives a given spatially extended system through 
a repertoire of unstable patterns; as we watch a turbulent system evolve, every so 
often we catch a glimpse of a familiar pattern. For any finite spatial resolution 
and finite time, the system follows approximately a pattern belonging to a finite 
repertoire of possible patterns. The long-term dynamics can be thought of as a 
walk through the space of such patterns. Recasting this image into mathematics is 
the subject of this book. 



Commentary 



Remark 2.1 'State space' or 'phase space?' In ChaosBook, state space is the set 
of admissible states in a general d- or oo -dimensional dynamical system. The term phase 
space is reserved for Hamiltonian state spaces of 2D-dimensions, where D is the number 
of Hamiltonian degrees of freedom. If the state space is a continuous smooth manifold 
much of the literature [?, 1 2] refers to it as 'phase space,' but we find the control engi- 
neering usage sharper: in the state space (or 'time-domain') description of an autonomous 
physical system, the state of the system is represented as a vector within the 'state space,' 
space whose axes are the state variables, and the evolution of a state is given by differ- 
ential equations which are first-order in time. Hopf [11] would refer to such a state as 
an 'instantaneous phase' of the system obeying a 'differential law of the phase motion' . 
The distinction made here is needed in a text where one treats deterministic dynami- 
cal systems, stochastic systems and quantum-mechanical systems. The term 'phase' has 
a precise meaning in wave mechanics, quantum mechanics and dynamics of integrable 
systems at the heart of Hamilton's formulation of Newtonian mechanics, while 'state 
space' is more descriptive of the way the notion is used in the general theory of dynami- 
cal systems. Further confusion arises when prefix spatio- as in 'spatiotemporal' is used in 
reference to states extended in the (1, 2, or 3 -dimensional) physical configuration space. 
They may exhibit spatial wave-like behaviors, but their state space is oo-dimensional. 

Much of the literature denotes the vector field in a first order differential equation 
(2.7) by f(x) or F(x) or even X(x), and its integral for time t by the 'time-f forward map' 
or 'flow map' x(xq, t) = <$>(xq, t), or <p t (xo), or something else. Here we treat maps and 
flows on an equal footing, and we save Greek letters for matters quantum-mechanical. We 
reserve the notation f'(x) for maps such as (2. 10) and refer to a state space velocity vector 
field as v(x). We come to regret this choice very far into the text, only by the time we 
delve into Navier-Stokes equations. 
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Remark 2.2 Rossler and Duffing flows. The Duffing system (2.8) arises in the 
study of electronic circuits [4]. The Rossler flow (2.18) is the simplest flow which ex- 
hibits many of the key aspects of chaotic dynamics. It was introduced in ref. [ 5] as a set of 
equations describing no particular physical system, but capturing the essence of Lorenz 
chaos in the most simple of smooth flows. Otto Rossler, a man of classical education, was 
inspired in this quest by that rarely cited grandfather of chaos, Anaxagoras (456 B.C.). 
This and references to earlier work can be found in refs. [ 6, 7, 8]. We recommend in par- 
ticular the inimitable Abraham and Shaw illustrated classic [9] for its beautiful sketches 
of many flows, including the Rossler flow. Timothy Jones [ 1 0] has a number of interesting 
simulations on a Drexel website. 

Remark 2.3 Lorenz equation. The Lorenz equation (2.13) is the most celebrated 
early illustration of "deterministic chaos" [ 1 2] (but not the first - that honor goes to Dame 
Cartwright [13]). Lorenz's paper, which can be found in reprint collections refs. [ 14, 15], 
is a pleasure to read, and it is still one of the best introductions to the physics motivating 
such models. For a geophysics derivation, see Rothman course notes [ ]. The equations, 
a set of ODEs in R 3 , exhibit strange attractors. W. Tucker [17, 18, 19] has proven rigor- 
ously via interval arithmetic that the Lorenz attractor is strange for the original parameters 
(no stable orbits and that it has a long stable periodic orbit for slightly different parame- 
ters. In contrast to the hyperbolic strange attractors such as the weakly perturbed cat map, 
the Lorenz attractor is structurally unstable. Fr0yland [20] has a nice brief discussion 
of Lorenz flow. Fr0yland and Alfsen [21] plot many periodic and heteroclinic orbits of 
the Lorenz flow; some of the symmetric ones are included in ref. [20]. Guckenheimer- 
Williams [22] and Afraimovich-Bykov-Shilnikov [23] offer an in-depth discussion of the 
Lorenz equation. The most detailed study of the Lorenz equation was undertaken by Spar- 
row [24]. For a physical interpretation of p as "Rayleigh number," see Jackson [25] and 
Seydel [26]. The Lorenz truncation to 3 modes is so drastic that the model bears no rela- 
tion to the geophysical hydrodynamics problem that motivated it. For a detailed pictures 
of Lorenz invariant manifolds consult Vol II of Jackson [25]. The Lorenz attractor is a 
very thin fractal - as we saw, stable manifold thickness is of the order 10 ~ 4 - whose frac- 
tal structure has been accurately resolved by D. Viswanath [27, 28]. If you wonder what 
analytic function theory has to say about Lorenz, check ref. [29]. Refs. [30, 31] might 
also be of interest, (continued in remark 9.2) 

Remark 2.4 Diagnosing chaos. In sect. 1.3.1 we have stated that a determin- 

istic system exhibits 'chaos' if its trajectories are locally unstable (positive Lyapunov 
exponent) and globally mixing (positive entropy). In sect. 6.2 we shall define Lyapunov 
exponents and discuss their evaluation, but already at this point it would be handy to have 
a few quick numerical methods to diagnose chaotic dynamics. Laskar's frequency anal- 
ysis method [32] is useful for extracting quasi-periodic and weakly chaotic regions of 
state space in Hamiltonian dynamics with many degrees of freedom. For pointers to other 
numerical methods, see ref. [33]. 

Remark 2.5 Dynamical systems software: J.D. Meiss [34] has maintained for 
many years Sci.nonlinear FAQ which is now in part superseded by the SIAM Dynami- 
cal Systems website www . dynamicalsystems . org. The website glossary contains most 
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of Meiss's FAQ plus new ones, as well as an up-to-date software list [35] with links 
to DSTool, xpp, AUTO, etc.. Springer on-line Encyclopaedia of Mathematics main- 
tains links to dynamical systems software packages on eom.springer.de/D/dl30210.htm. 
Kuznetsov [ 1 4-] Appendix D.9 gives an exhaustive overview of software available in 2004. 
(see also remark 12.1) 
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The exercises that you should do have underlined titles . The rest (smaller type) 

are optional. Difficult problems are marked by any number of *** stars. 



Exercises 

2.1. Trajectories do not intersect. A trajectory in the 
state space M is the set of points one gets by evolving 
x G M forwards and backwards in time: 

C x = {y € M : fix) =y for t e R) . 

Show that if two trajectories intersect, then they are the 
same curve. 

2.2. Evolution as a group. The trajectory evolution /' is 
a one-parameter semigroup, where (2.4) 

r s =f°f. 

Show that it is a commutative semigroup. 

In this case, the commutative character of the semi- 
group of evolution functions comes from the commuta- 
tive character of the time parameter under addition. Can 
you think of any other semigroup replacing time? 

2.3. Almost ODE's. 

(a) Consider the point x on R evolving according 
x = e x . Is this an ordinary differential equation? 

(b) Is x — x(x(t)) an ordinary differential equation? 

(c) What about x = x(t+l)7 

2.4. All equilibrium points are fixed points. Show that 
a point of a vector field v where the velocity is zero is a 
fixed point of the dynamics /'. 

2.5. Gradient systems. Gradient systems (or 'potential 
problems') are a simple class of dynamical systems for 
which the velocity field is given by the gradient of an 
auxiliary function, the 'potential' <p 

x = -V<pix) 

where x e R d , and <p is a function from that space to the 
reals R. 

(a) Show that the velocity of the particle is in the di- 
rection of most rapid decrease of the function (p. 

(b) Show that all extrema of 0 are fixed points of the 
flow. 



(c) Show that it takes an infinite amount of time for 
the system to reach an equilibrium point. 

(d) Show that there are no periodic orbits in gradient 
systems. 

2.6. Runge-Kutta integration. Implement the fourth- 
order Runge-Kutta integration formula (see, for exam- 
ple, ref. [36]) for x = \>(x): 

k\ k*> ki k\ s 
x„ + \ = x„ + — + — + — + — + Oi&i) 
6 3 3 6 

k\ — <5t v(jt„) , k% = drvix n + k\/2) 
£3 = drvix n + &2/2) 
k\ = 6tv(x„ + kj,) . 

If you already know your Runge-Kutta, program what 
you believe to be a better numerical integration routine, 
and explain what is better about it. 

2.7. Rossler flow. Use the result of exercise 2.6 or some 
other integration routine to integrate numerically the 
Rossler flow (2.18). Does the result look like a 'strange 
attractor' ? 

2.8. Equilibria of the Rossler flow. 

(a) Find all equilibrium points {x q ,y q ,z ci ) of the 
Rossler system (2.18). How many are there? 

(b) Assume that b — a. As we shall see, some surpris- 
ingly large, and surprisingly small numbers arise 
in this system. In order to understand their size, 
innoduce parameters 

e = a/c, D= l-4e 2 , p ± = (1 ± Vd)/2. 

Express all the equilibria in terms of (c, e, D, /?*), 
expand to the first order in e, and evaluate for 
a — b — 0.2, c = 5.7 in (2.18). In the case stud- 
ied e ~ 0.03, so these estimates are quite accurate, 
(continued in exercise 3.1) 

(Rytis Paskauskas) 

2.9. Can you integrate me? Integrating equations nu- 
merically is not for the faint of heart. It is not always 
possible to establish that a set of nonlinear ordinary 
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differential equations has a solution for all times and 
there are many cases were the solution only exists for 
a limited time interval, as, for example, for the equation 
x = x 2 , jc(0) = 1 . 

(a) For what times do solutions of 

x = x(x(t)) 

exist? Do you need a numerical routine to answer 
this question? 

(b) Let's test the integrator you wrote in exercise 2.6. 
The equation 'x - —x with initial conditions x(0) = 
2 and x = 0 has the solution x(t) = e~'(l + e 2 ') . 
Can your integrator reproduce this solution for the 
interval t e [0, 10]? Check your solution by plot- 
ting the error as compared to the exact result. 

(c) Now we will try something a little harder. The 
equation is going to be third order 

x +0.6.X + x - |x| + 1 = 0 , 

which can be checked-numerically-to be chaotic. 
For initial conditions, we will always use jc(0) = 
x(0) = x(0) = 0 . Can you reproduce the re- 
sult x(12) = 0.8462071873 (all digits are sig- 
nificant)? Even though the equation being inte- 
grated is chaotic, the time intervals are not long 
enough for the exponential separation of trajecto- 
ries to be noticeable (the exponential growth fac- 
tor is « 2.4). 

(d) Determine the time interval for which the solution 
of x — x 2 , x(Q) = 1 exists. 

2.10. Classical collinear helium dynamics. In order to ap- 
ply periodic orbit theory to quantization of helium we 



shall need to compute classical periodic orbits of the he- 
lium system. In this exercise we commence their evalu- 
ation for the collinear helium atom (7.8) 

1 , 1 , Z Z 1 
H =~Pl+~P2 + — — ■ 

2 2 r\ r% r\ + r-i 

The nuclear charge for helium is Z = 2. Colinear he- 
lium has only 3 degrees of freedom and the dynamics 
can be visualized as a motion in the {j\,ri\ r,- > 0 quad- 
rant. In (n, ^-coordinates the potential is singular for 
r, — > 0 nucleus-electron collisions. These 2-body col- 
lisions can be regularized by rescaling the coordinates, 
with details given in sect. B.2. In the transformed coor- 
dinates (x\,X2,p\,pz) the Hamiltonian equations of mo- 
tion take the form 



2fii 



Pi = 2Q 2 
1 



P 2 O 2 
2-L 2 -Q 2 (l + % 
8 2 R 4 

P 2 O 2 
2 - — - g 2 (l + =r) 
8 1 R 4 



1 



(2.25) 



where/? = (Q 2 + g 2 ) 1/2 . 



(a) Integrate the equations of motion by the fourth or- 
der Runge-Kutta computer routine of exercise 2.6 
(or whatever integration routine you like). A 
convenient way to visualize the 3-dimensional 
state space orbit is by projecting it onto the 2- 
dimensional (r i (f), ^2(0) plane, (continued in ex- 
ercise 3.4) 

(Gregor Tanner, Per Rosenqvist) 
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Discrete time dynamics 



Gentles, perchance you wonder at this show; But wonder 
on, till truth make all things plain. 

— W. Shakespeare, A Midsummer Night's Dream 



The time parameter in the definition of a dynamical system can be either con- 
tinuous or discrete. Discrete time dynamical systems arise naturally from 
flows. In general there are two strategies for replacing a continuous-time 
flow by iterated mappings; by cutting it by Poincare sections, or by strobing it 
at a sequence of instants in time. Think of your partner moving to the beat in a 
disco: a sequence of frozen stills. While 'strobing' is what any numerical inte- 
grator does, by representing a trajectory by a sequence of time-integration step 
separated points, strobing is in general not a reduction of a flow, as the sequence 
of strobed points still resides in the full state space At, of dimensionality d. An 
exception are non-autonomous flows that are externally periodically forced. In 
that case it might be natural to observe the flow by strobing it at time intervals 
fixed by the external forcing, as in example 7.7 where strobing of a periodically 
forced Hamiltonian leads to the 'standard map.' 

In the Poincare section method one records the coordinates of a trajectory 
whenever the trajectory crosses a prescribed trigger. This triggering event can be 
as simple as vanishing of one of the coordinates, or as complicated as the trajectory 
cutting through a curved hypersurface. A Poincare section (or, in the remainder 
of this chapter, just 'section') is not a projection onto a lower-dimensional space: 
rather, it is a local change of coordinates to a direction along the flow, and the 
remaining coordinates (spanning the section) transverse to it. No information 
about the flow is lost by reducing it to its set of Poincare section points and the 
return maps connecting them; the full space trajectory can always be reconstructed 
by integration from the nearest point in the section. 

Reduction of a continuous time flow to its Poincare section is a powerful vi- 
sualization tool. But, the method of sections is more than visualization; it is also 
a fundamental tool of dynamics - to fully unravel the geometry of a chaotic flow, 
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Figure 3.1: A trajectory x(t) that intersects a Poincare 
section P at times h,t2,h,U, and closes a cycle 
(Jci, X2, Xj,, X4), Xk = x(tk) 6 V of topological length 
4 with respect to the section. In general, the intersec- 
tions are not normal to the section. Note also that the 
crossing z does not count, as it in the wrong direction. 




one has to quotient all of its symmetries, and evolution in time is one of these 
(This delphic piece of hindsight while be illuminated in chapter 10). 



3.1 Poincare sections 



A continuous time flow decomposes the state space into Lagrangian 'spaghetti' of 
figure 2.2, a union of non-intersecting 1-dimensional orbits. Any point on an orbit 
can be used to label the orbit, with the state space thus reduced to a 'skew-product' 
of a (d- l)-dimensional space P of labeling points xj € P and the corresponding 
1-dimensional orbit curves At/ on which the flow acts as a time translation. How- 
ever, as orbits can be arbitrary complicated and, if unstable, uncontrollable for 
times beyond the Lyapunov time (1.1), in practice it is necessary to split the orbit 
into finite trajectory segments, with time intervals corresponding to the shortest re- 
currence times on a non- wondering set of the fiow, finite times for which the flow 
is computable. A particular prescription for picking the orbit-labeling points 
in called a Poincare section. In introductory texts Poincare sections are treated 
as pretty visualizations of a chaotic flows, akin to plastic surgery and Botox, but 
their dynamical significance is much deeper than that. Once a section is defined, 
a 'Lagrangian' description of the flow (discussed above, page 47) is replaced by 
the 'Eulerian' formulation, with the trajectory-tangent velocity field v(Jc) , x e P 
enabling us to go freely between the time-quotiened space P and the full state 
space At. The dynamically important transverse dynamics -description of how 
nearby trajectories attract / repeal each other- is encoded in mapping of P — > P 
induced by the flow - dynamics along orbits is of secondary importance. 

Successive trajectory intersections with a Poincare section, a (ci-1 ^dimension- 
al hypersurface embedded in the J-dimensional state space At, figure 3.1, define 
the Poincare return map P(x), a.(d- l)-dimensional map of form 



chapter 10 



x' = P(x) = f T{i) (x) , x,xeP. 



(3.1) 



Here the first return function T(Jc)-sometimes referred to as the ceiling function-is 
the time of flight to the next section for a trajectory starting at x. The choice of 
the section hypersurface P is altogether arbitrary. It is rarely possible to define 
a single section that cuts across all trajectories of interest. Fortunately, one often 
needs only a local section, a finite hypersurface of codimension 1 intersected by 
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a swarm of trajectories near to the trajectory of interest (the case of several sec- 
tions is discussed in sect. 3.4). Such hypersurface can be specified implicitly by a 
single condition, through a function U (x) that is zero whenever a point x is on the 
Poincare section, 

xeP iff £/(*) = 0. (3.2) 

The gradient of U (x) evaluated at x e P serves a two-fold function. First, the 
flow should pierce the hypersurface P, rather than being tangent to it. A nearby 
point x + 8x is in the hypersurface P if U(x + 8x) - 0. A nearby point on the 
trajectory is given by 5x = vSt, so a traversal is ensured by the transversality 
condition 



d q 

(y>VU) = Y J vj&)djU&)*0, djU(x) = —U(x), xeP. (3.3) 

7-1 3 



Second, the gradient VU defines the orientation of the hypersurface P. The flow 
is oriented as well, and a periodic orbit can pierce P twice, traversing it in either 
direction, as in figure 3.1. Hence the definition of Poincare return map P(x) needs 
to be supplemented with the orientation condition 

x n+ i=P(x„), U(x„+i) = U(x n ) - 0, neZ + 

d 

YjVjiXrddjUiXn) > 0. (3.4) 
7=1 

In this way the continuous time t flow x(t) = f{x) is reduced to a discrete time n 
sequence x n of successive oriented trajectory traversals of P. 

With a sufficiently clever choice of a Poincare section or a set of sections, any 
orbit of interest intersects a section. Depending on the application, one might need 
to convert the discrete time n back to the continuous flow time. This is accom- 
plished by adding up the first return function times r(^ ? ), with the accumulated 
flight time given by 

tn+l = t„ + T(x n ) , to = 0 , Xn € P . (3.5) 

Other quantities integrated along the trajectory can be defined in a similar manner, 
and will need to be evaluated in the process of evaluating dynamical averages. 

A few examples may help visualize this. 



Example 3.1 A template and the associated hyperplane Poincare section: 

The simplest choice of a Poincare section is a plane V specified by a 'template' point 
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Figure 3.2: (Right:) a sequence of Poincare sec- 
tions of the Rossler strange attractor, defined by 
planes through the z axis, oriented at angles (a) 
-60° (b) 0°, (c) 60°, (d) 120°, in the x-y plane. 
(Left:) side and x-y plane view of a typical tra- 
jectory with Poincare sections superimposed. (R. 
Paskauskas) 




X 



(located at the tip of the vector x') and a normal vector h perpendicular to the plane. A 
point 1 is in this plane if it satisfies the linear condition 

U(x) = (x- x') ■ h = 0 for x€p. (3.6) 

Consider a circular periodic orbit centered at x', but not lying in V. It pierces 
the hyperplane twice; the v • h > 0 traversal orientation condition (3.4) ensures that the 
first return time is the full period of the cycle, (continued in example 12. 1 ) 



What about smooth, continuous time flows, with no obvious surfaces that 
would be good Poincare sections? 



Example 3.2 Pendulum: The phase space of a simple pendulum is 2-dimensional: 
momentum on the vertical axis and position on the horizontal axis. We choose the 
Poincare section to be the positive horizontal axis. Now imagine what happens as a 
point traces a trajectory through this phase space. As long as the motion is oscillatory 
in the pendulum all orbits are loops, so any trajectory will periodically intersect the line, 
that is the Poincare section, at one point. 

Consider next a pendulum with friction, such as the unforced Duffing system 
plotted in figure 2.4. Now every trajectory is an inward spiral, and the trajectory will 
intersect the Poincare section y = 0 at a series of points that get closer and closer to 
either of the equilibrium points; the Duffing oscillator at rest. 



Motion of a pendulum is so simple that you can sketch it yourself on a piece of 
paper. The next example offers a better illustration of the utility of visualization 
of dynamics by means of Poincare sections. 
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Figure 3.3: Return maps for the r„ — > 

fn+i ra- 
dial distance Poincare sections of figure 3.2. The 
'multi-valuedness' of (b) and (c) is only appar- 
ent: the full return map is 2-dimensional, (r\z') = 
P{r, z). (R. Paskauskas) 




4 6 8 10 12 



Example 3.3 Rossler flow: (continued from example 2.3) Consider figure 2.6, a 
typical trajectory of the 3-dimensional Rossler flow (2. 18). The strange attractor wraps 1 
around the z axis, so one choice for a Poincare section is a plane passing through the 
z axis. A sequence of such Poincare sections placed radially at increasing angles with 
respect to the x axis, figure 3.2, illustrates the 'stretch & fold' action of the Rossler flow, 
by assembling these sections into a series of snapshots of the flow. A line segment 
in (a), traversing the width of the attractor at y = 0, x > 0 section, starts out close to 
the x-y plane, and after the stretching (a) -» (b) followed by the folding (c) -» (d), the 
folded segment returns (d) -> (a) close to the initial segment, strongly compressed. In 
one Poincare return the interval is thus stretched, folded and mapped onto itself, so the 
flow is expanding. It is also mixing, as in one Poincare return a point from the interior 
of the attractor can map onto the outer edge, while an edge point lands in the interior. 

Once a particular Poincare section is picked, we can also exhibit the return map 
(3.1), as in figure 3.3. Cases (a) and (d) are examples of nice 1-to-1 return maps. While 
(b) and (c) appear multimodal and non-invertible, they are artifacts of projectina^a 2: 2 
dimensional return map (r„,z„) — > (r B +i,z B+ i) onto a 1 -dimensional subspace r„ -> r n+ \. 
(continued in example 3.5) 



fast track: 
sect. 3.3, p. 65 



The above examples illustrate why a Poincare section gives a more informative 
snapshot of the flow than the full flow portrait. For example, while the full flow 
portrait of the Rossler flow figure 2.6 gives us no sense of the thickness of the 
attractor, we see clearly in the Poincare sections of figure3.2 that even though the 
return maps are 2-dimensional —> 2-dimensional, the flow contraction is so strong 
that for all practical purposes it renders the return maps 1 -dimensional. (We shall 
quantify this claim in example 4.5.) 



3.1.1 Section border 

How far does the neighborhood of a template extend along the hyperplane (3.6)? 
A section captures faithfully neighboring orbits as long as it cuts them transver- 
sally; it fails the moment the velocity field at point x* fails to pierce the section. 
At this location the velocity is tangent to the section and, thus, orthogonal to the 
template normal h, 



h-v(x*) = 0, x*eS, 



(3.7) 
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i.e., v ± (x), component of the v(x) normal to the section, vanishes at x* . For a 
smooth flow such points form a smooth (<i-2)-dimensional section border S CP, 
encompassing the open neighborhood of the template characterized by qualita- 
tively similar flow. We shall refer to this region of the section hyperplane as the 
(maximal) chart of the template neighborhood for a given hyperplane (5.6). 

If the template point is an equilibrium Xq, there is no dynamics exactly at this 
point as the velocity vanishes (v(xq) = 0 by the definition of equilibrium) and 
cannot be used to define a normal to the section. Instead, we use the local lin- 
earized flow to construct the local Poincare section P. We orient P so the unsta- 
ble eigenvectors are transverse to the section, and at least the slowest contracting 
eigenvector is tangent to the section, as in figure 4.6. This ensures that the flow is 
transverse to P in an open neighborhood of the template x. { . 

Visualize the flow as a smooth 3-dimensional steady fluid flow cut by a 2- 
dimensional sheet of light. Lagrangian particle trajectories either cross, are tan- 
gent to, or fail to reach this plane; the 1 -dimensional curves of tangency points de- 
fine the section border. An example is offered by the velocity field of the Rossler 
flow of figure 4.5. Pick a Poincare section hyperplane so it goes through both equi- 
librium points. The section might be transverse to a large neighborhood around 
the inner equilibrium x~, but dynamics around the outer equilibrium x+ is totally 
different, and the competition between the two types of motion is likely to lead 
to vanishing of v±(x), component of the v(x) normal to the section, someplace 
in-between the two equilibria. A section is good up to the section border, but be- 
yond it an orbit infinitesimally close to x* generically does not cross the section 
hyperplane, at least not infinitesimally close to S. 

For 3-dimensional flows, the section border S is a 1 -dimensional closed curve 
in the section 2-dimensional P, and easy to visualize. In higher dimensions, the 
section border is a (d — 2) -dimensional manifold, not easily visualized, and the 
best one can do is to keep checking for change of sign (3.4) at Poincare section 
returns of nearby trajectories close to the section border hypersurface S; (5.7) will 
be positive inside, negative immediately outside S. 

Thus for a nonlinear flow, with its complicated curvilinear invariant manifolds, 
a single section rarely suffices to capture all of the dynamics of interest. 



exercise 3.7 



3.1.2 What is the best Poincare section? 



In practice, picking sections is a dark and painful art, especially for high-dimens- 
ional flows where the human visual cortex falls short. It helps to understand why 
we need them in the first place. 

Whenever a system has a continuous symmetry G, any two solutions related 
by the symmetry are equivalent. We do not want to keep recomputing these over 
and over. We would rather replace the whole continuous family of solutions by 
one solution in order to be more efficient. This approach replaces the dynamics 
(At,/) with dynamics on the quotient state space (M/t,f). For now, we only 
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Figure 3.4: (a) Lorenz flow figure 2.5 cut by y = x 

Poincare section plane P through the z axis and 
both EQ12 equilibria. Points where flow pierces 
into section are marked by dots. To aid visualiza- 
tion of the flow near the EQo equilibrium, the flow 
is cut by the second Poincare section, V , through 
y = —x and the z axis, (b) Poincare sections P and 
P' laid side-by-side. The singular nature of these 
sections close to EQo will be elucidated in exam- 
ple 4.6 and figure 1 1 .8 (b). (E. 
Siminos) 




(a) 



(b) 



remark that constructing explicit quotient state space flow / is either extremely 
difficult, impossible, or generates unintelligible literature. Our solution (see chap- 
ter 10) will be to resort to the method of slices. 

Time evolution itself is a 1 -parameter Lie group, albeit a highly nontrivial one 
(otherwise this book would not be much of a doorstop). The invariants of the flow 
are its infinite-time orbits; particularly useful invariants are compact orbits such 
as equilibrium points, periodic orbits, and tori. For any orbit it suffices to pick a 
single state space point x e M p , the rest of the orbit is generated by the flow. 

Choice of this one 'labeling' point is utterly arbitrary; in dynamics this is 
called a 'Poincare section', and in theoretical physics this goes by the excep- 
tionally uninformative name of 'gauge fixing'. The price is that one generates 
'ghosts', or, in dynamics, increases the dimensionality of the state space by addi- 
tional constraints (see sect. 13.4). It is a commonly deployed but inelegant proce- 
dure where symmetry is broken for computational convenience, and restored only 
at the end of the calculation, when all broken pieces are reassembled. 

With this said, there are a few rules of thumb to follow: (a) You can pick as 
many sections as convenient, as discussed in sect. 3.4. (b) For ease of compu- 
tation, pick linear sections (3.6) when possible, (c) If equilibria play important 
role in organizing a flow, pick sections that go through them (see example3.4). In 
that case, try to place contractor eigenvectors inside the hyperplane, see Lorenz 
figure 3.4. Remember, the stability eigenvectors are never orthogonal to each 

other, unless that is imposed by some symmetry, (d) If you have a global discrete 

• i i i- • • i i / chapter 9 

or continuous symmetry, pick sections left invariant by the symmetry (see exam- 
ple 9.14). For example, setting the normal vector h in (3.6) at x to be the velocity 
v(x) is natural and locally transverse, (e) If you are solving a local problem, like 
finding a periodic orbit, you do not need a global section. Pick a section or a set of 
(multi-shooting) sections on the fly, requiring only that they are locally transverse 
to the flow, (f) If you have another rule of thumb dear to you, let us know. 



Example 3.4 Sections of Lorenz flow: (continued from example 2.2) The plane 
V fixed by the x = y diagonal and the z-axis depicted in figure 3.4 is a natural choice 
of a Poincare section of the Lorenz flow of figure 2.5, as it contains all three equilib- 
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ria, x EQo = (0,0,0) and the (2.14) pair x EQl , x EQl . A section has to be supplemented 
with the orientation condition (3.4): here points where flow pierces into the section are 
marked by dots. 

Equilibria x EQl , x EQl are centers of out-spirals, and close to them the section 
is transverse to the flow. However, close to EQ 0 trajectories pass the z-axis either 
by crossing the section V or staying on the viewer's side. We are free to deploy as 
many sections as we wish: in order to capture the whole flow in this neighborhood 
we add the second Poincare section, V , through the y = -x diagonal and the z-axis. 
Together the two sections, figure 3.4(b), capture the whole flow near EQ 0 . In contrast 
to Rossler sections of figure 3.2, these appear very singular. We explain this singularity 
in example 4.6 and postpone construction of a Poincare return map until example 9. 14. 
(E. Siminos and J. Halcrow) 



3.2 Computing a Poincare section 



(R. Mainieri) 

For almost any flow of physical interest a Poincare section is not available in 
analytic form, so one tends to determine it crudely, by numerically bracketing 
the trajectory traversals of a section and iteratively narrowing the bracketing time 
interval. We describe here a smarter method, which you will only need when 
you seriously look at a strange attractor, with millions of points embedded in a 
high(er)-dimensional Poincare section - so skip this section on the first reading. 

Consider the system (2.7) of ordinary differential equations in the vector vari- 
able X = (X\,X2, ■ ■ ■ , X ( j) 



remark 3.2 



dxi 

-jT = Vi(x,t), (3.8) 
dt 



where the flow velocity v is a vector function of the position in state space x and 
the time t. In general, the map f n (x^) - x n + J drv(x(T)) cannot be integrated 
analytically, so we will have to resort to numerical integration to determine the 
trajectories of the system. Our task is to determine the points at which the numer- 
ically integrated trajectory traverses a given hypersurface. The hypersurface will 
be specified implicitly through a function U (x) that is zero whenever a point x is 
on the Poincare section, such as the hyperplane (3.6). 

If we use a tiny step size in our numerical integrator, we can observe the value 
of U as we integrate; its sign will change as the trajectory crosses the hypersurface. 
The problem with this method is that we have to use a very small integration time 
step. However, there is a better way to land exactly on the Poincare section. 

Let t a be the time just before U changes sign, and % the time just after it 
changes sign. The method for landing exactly on the Poincare section will be to 
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convert one of the space coordinates into an integration variable for the part of the 
trajectory between t a and tb- Using 

dxb dx\ dxh 

1 Ji -^ = 1 jL v l (x,t) = v k (x,t) (3.9) 
dx\ dt dx\ 



we can rewrite the equations of motion (3.8) as 

dt I dxd _Vd_ 

dx\ v\ dx\ v\ 

Now we use x\ as the 'time' in the integration routine and integrate it from x\(t a ) to 
the value of x\ on the hypersurface, determined by the hypersurface intersection 
condition (3.6). This is the end point of the integration, with no need for any 
interpolation or backtracking to the surface of section. The ^-axis need not be 
perpendicular to the Poincare section; any xj can be chosen as the integration 
variable, provided the ^,-axis is not parallel to the Poincare section at the trajectory 
intersection point. If the section crossing is transverse (3.3), vi cannot vanish in 
the short segment bracketed by the integration step preceding the section, and the 
point on the Poincare section. 



Example 3.5 Computation of Rossler flow Poincare sections. (continued from 
example 3.3) Convert Rossler equation (2. 18) to cylindrical coordinates: 

r - v r — —z cos 6 + ar sin 2 8 

z a 
8 = v g = 1 + - sin# + - sin 20 
r 2 

z = v z — b + z(r cos 6 - c) . (3.11) 

Poincare sections of figure 3.2 are defined by the fixing angle U(x) = 8 - 8q - 0. In 
principle one should use the equilibrium x+ from (2.19) as the origin, and its eigen- 
vectors as the coordinate frame, but here original coordinates suffice, as for parameter 
values (2. 18), and(x Q ,y 0 ,zo) sufficiently faraway from the inner equilibrium, 6 increases 
monotonically with time. Integrate 

dr dt dz 

— = U r /Vg, — = 1/ Vg, — = VjVg (3.12) 

d6 dO dO 

from (r n ,8 n ,z n ) to the next Poincare section at G„+i, and switch the integration back to 
(x, y, z) coordinates, (continued in example 4. 1 ) (Radford Mitchell, Jr.) 



3.3 Mappings 



Do it again! 

— Isabelle, age 3 
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Figure 3.5: A flow x(t) of figure 3.1 represented by a 
Poincare return map that maps points in the Poincare 
section V as x n+ \ = f(x„) . In this example the orbit of 
X\ is periodic and consists of the four periodic points 




Though we have motivated discrete time dynamics by considering sections of a 
continuous flow and reduced the continuous-time flow to a family of maps P{x) 
mapping points x from a section to a section, there are many settings in which 
dynamics is inherently discrete, and naturally described by repeated iterations of 
the same map 



remark 3.1 



f:M^M, 

or sequences of consecutive applications of a finite set of maps, a different map, 
Ia,Ib, ■ • ■, for points in different regions [Ma, Mb, ■ ■ ■ , Mz], 



{f A ,fB,...fz}:M^M, 



(3.13) 



for example maps relating different sections among a set of Poincare sections. The 
discrete 'time' is then an integer, the number of applications of the map or maps. 
As writing out formulas involving repeated applications of a set of maps explicitly 
can be awkward, we streamline the notation by denoting the (non-commutiative) 
map composition by V 



fz(- • ■ f B (f A (x))) ■ • •) = fz o ■ ■ ■ f B o f A (x) , 



(3.14) 



and the nth iterate of map / by 



fix) = f o f~\x) =f(f-\x)) , f(x) 



The trajectory of x is the finite set of points 



section 2.1 



[x, f{x), f{x),..., f{x)} , 



traversed in time n, and M x , the orbit of x, is the subset of all points of M that 
can be reached by iterations of /. A periodic point icycle point) x\ belonging to a 
periodic orbit icycle) of period n is a real solution of 



f"ix k ) - /(/(. . .fix k ) ...)) = x k , k = 0, 1,2, .. . ,n - 1 . 



(3.15) 



For example, the orbit of x.\ in figure 3.5 is a set of four cycle points, (jq , x%, £3, £4) . 
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Figure 3.6: The strange attractor and an unstable pe- 
riod 7 cycle of the Henon map (3.17) with a = 1.4, 




b = 0.3. The periodic points in the cycle are connected 
to guide the eye. (from K.T Hansen [2]) 



The functional form of such Poincare return maps P as figure 3.3 can be ap- 
proximated by tabulating the results of integration of the flow from x to the first 
Poincare section return for many x e P, and constructing a function that inter- 
polates through these points. If we find a good approximation to P(x), we can 
get rid of numerical integration altogether, by replacing the continuous time tra- 
jectory f(x) by iteration of the Poincare return map P{x). Constructing accurate 
P(x) for a given flow can be tricky, but we can already learn much from approxi- 
mate Poincare return maps. Multinomial approximations 



P k (x) = a k + ^ b kj Xj + ^ 



Ckij 



xeP 



(3.16) 



i.7= 



to Poincare return maps 







'P\{x n )^ 










, X n , X n + 1 £ y 


^,n+l ' 




^ Pd(x„) ) 





motivate the study of model mappings of the plane, such as the Henon map. 



Example 3.6 Henon map: The map 

x n+ ] = l-ax 2 n + by„ 

y n +\ = x„ (3.17) 

is a nonlinear 2-dimensional map frequently employed in testing various hunches about 
chaotic dynamics. The Henon map is sometimes written as a 2-step recurrence relation 

x n +\ = 1 - ax\ + bx„-\ . (3.18) 



An n-step recurrence relation is the discrete-time analogue of an nth order differential 
equation, and it can always be replaced by a set ofn 1-step recurrence relations. 
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The Henon map is the simplest map that captures the 'stretch & fold' dynamics 
of return maps such as Rossler's, figure 3.2. It can be obtained by a truncation of a 
polynomial approximation (3. 16) to a Poincare return map (3. 16) to second order. 

A quick sketch of the long-time dynamics of such a mapping (an example is 
depicted in figure 3.6), is obtained by picking an arbitrary starting point and iterating 
(3. 17) on a computer. 

Always plot the dynamics of such maps in the (x„, x n +\) plane, rather than in the 
(x n ,y n ) plane, and make sure that the ordinate and abscissa scales are the same, so 
x„ = x n+ \ is the 45° diagonal. There are several reasons why one should plot this way: 
(a) we think of the Henon map as a model return map x„ -* x„+i, and (b) as parameter 
b varies, the attractor will change its y-axis scale, while in the (x n , x n +{) plane it goes to 
a parabola as b -> 0, as it should. 

exercise 3.5 

As we shall soon see, periodic orbits will be key to understanding the long-time 
dynamics, so we also plot a typical periodic orbit of such a system, in this case an 
unstable period 7 cycle. Numerical determination of such cycles will be explained in 
sect. 29.1, and the periodic point labels 01 1 1010, 1 1 10100, ■■■in sect. 12.2. 



Example 3.7 Lozi map: Another example frequently employed is the Lozi map, a 
linear, 'tent map' version of the Henon map (3. 1 7) given by 

x n+ ] = 1 - a\x„\ + by„ 

y n +\ = x n . (3.19) 

Though not realistic as an approximation to a smooth flow, the Lozi map is a very helpful 
tool for developing intuition about the topology of a large class of maps of the 'stretch 
& fold' type. 

What we get by iterating such maps is-at least qualitatively-not unlike what 
we get from Poincare section of flows such as the Rossler flow figure 3.3. For 
an arbitrary initial point this process might converge to a stable limit cycle, to a 
strange attractor, to a false attractor (due to roundoff errors), or diverge. In other 
words, mindless iteration is essentially uncontrollable, and we will need to resort 
to more thoughtful explorations. As we shall explain in due course, strategies for 
systematic exploration rely on stable/unstable manifolds, periodic points, saddle- 
straddle methods and so on. 



exercise 6.4 



Example 3.8 Parabola: For sufficiently large value of the stretching parameter a, 
one iteration of the Henon map (3.17) stretches and folds a region of the (x,y) plane 
centered around the origin, as will be illustrated in figure 12.4. The parameter a controls 
the amount of stretching, while the parameter b controls the thickness of the folded 
image through the '1-step memory' term bx n -\ in (3. 18). In figure 3.6 the parameter b is 
rather large, b = 0.3, so the attractor is rather thick, with the transverse fractal structure 
clearly visible. For vanishingly small b the Henon map reduces to the 1 -dimensional 
quadratic map 

x n+ i = 1 - ax\ . (3.20) 

©xsrcis 

By setting b = 0 we lose determinism, as on reals the inverse of map (3.20) has two 
real preimages {x+_ v x~_ x } for most x n . If Bourbaki is your native dialect: the Henon 
map is injective or one-to-one, but the quadratic map is surjective or many-to-one. Still, 
this 1 -dimensional approximation is very instructive, (continued in example 11.5) 
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Figure 3.7: Reduction of a continuous-time flow (left 
frame) to a set of Poincare maps (right frame), with a 
point on 1 -cycle and the two cycle points of a 2-cycle 
used as template points. 




As we shall see in sect. 11.3, an understanding of 1-dimensional dynamics is 
indeed the essential prerequisite to unraveling the qualitative dynamics of many 
higher-dimensional dynamical systems. For this reason many expositions of the 
theory of dynamical systems commence with a study of 1-dimensional maps. We 
prefer to stick to flows, as that is where the physics is. 



appendix 1.8 



fast track: 
sect. 4, p. 77 



3.4 Charting the state space 



In simple examples, such as the Rossler example 3.3, a single Poincare section 
suffices, but this is rarely the case for flows of physical interest. In this section 
(skip it on first reading) we commence a discussion of the general case. 

A Poincare section is constructed by picking a 'template' point f within a 
state space region of interest, and defining a hypersurface (3.2) that goes through 
the template point. In theory, this Poincare section could be any (J-l)-dimensional 
manifold. In practice, a hyperplane (3.6) is the most convenient, the natural choice 
for the vector normal to the section being h = v{%), the velocity field at the 
template point x / . This Poincare section x e V is a hyperplane, 




appendix 13.4 



(x - x) = 0 , 



v(x) , 



(3.21) 



normal to the flow direction V at the template point x 1 . Such section cuts the 
nearby trajectories transversally, and is a good description of solutions similar to 
the given template. 

So one hyperspace P will, in general, not suffice. A more insightful picture 
of the dynamics is obtained by partitioning the state space into N qualitatively 
distinct regions {Ati, M2, ■ ■ ■ , Mn\ and constructing a Poincare section per re- 
gion, global atlas of the state space composed of ./V local Poincare sections fil) 
or charts, each one capturing a neighborhood of a qualitatively prominent state 



section 11.1 
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x'^ e S . We shall refer to these states as templates, each represented in the state 
space M of the system by a template point x'^ 2 \ ■■■ , x'^}. 

Our Poincare section is a hyperplane. If we pick another template point 
x' (2 \ it comes along with its own section hyperplane. The (d- l)-dimensional 
Poincare sections for an adjacent pair of template intersects in a 'ridge' ('bound- 
ary,' 'edge'), a (d- 2) -dimensional hyperplane, easy to compute. Follow an ant 
(the sequence of Poincare map iterates) as it progresses along the Poincare sec- 
tion !P (1) . The moment (^ (1) (r) - x' (2 ^) ■ fp^ changes sign, the ant has crossed the 
ridge, we switch the Poincare section, and the ant continues its merry stroll now 
confined to the !^ 2) section. Each Poincare section P^D, provides a local chart 
at x.'^ for a neighborhood of an important, qualitatively distinct class of solu- 
tions; together they 'Voronoi' tessellate the curved manifold in which the reduced 
dynamics is replaced by a finite set of mappings between hyperplane tiles. An ex- 
ample is the periodic-orbit implementation of the idea of state space tessellation 
by neighborhoods of recurrent points, so dear to professional cyclists, illustrated 
in figure 3.7. 

For a given dynamical flow, the physical task is to pick a minimal set of qual- 
itatively distinct templates. The state space might be filled by all kinds of highly 
unstable, never revisited equilibria and relative periodic orbits. The choice of 
templates should reflect the dynamically prominent states seen in the long-time 
simulations of system's dynamics. We have only vague advice on how to pick a 
single Poincare section (see sect. 3.1.2), and no advice on how to systematically 
pick a set of 'good' templates, other than that the associated section tiles should 
be as large as possible, but still sufficiently small to exclude orbit tangencies, i.e., 
stop before crossing their section borders (3.7). Ideally, one wold like to pick as 
few templates as possible in figure 3.7. Once templates are picked, the rest is ge- 
ometry of hyperplanes, so checking whether the section border is on the far side 
of the tile edge (ridge between two sections) is a fast, linear computation. 

There is a rub, though - you need to know how to pick the neighboring tem- 
plates. Perhaps a glance at figure 3.7 helps visualize the problem; imagine that 
the tiles belong to the Poincare sections through template points on these orbits. 
One could slide templates along their trajectories until the pairs of straight line 
segments connecting neighboring template points are minimized, but that seems 
a bit arbitrary. At this time we have no advice as how to 'synchronize' the tem- 
plates relative to each other. The astute reader will instantly recognize this as the 
problem of 'local gauge invariance' or 'gauge fixing' of Quantum Field Theory 
and General Relativity. 

3.4.1 Navigating the Poincare-charted state space 

Our goal now is to replace the continuous-time dynamics by a set of Poincare 
maps between a set of hyperplane sections, as in figure 3.7. The flat hyperplane 
(3.6) is an ad hoc construct; one Poincare section rarely suffices to capture all 
of the dynamics of interest. Instead we chart the state space by partitioning it 
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Figure 3.8: Some examples of 3-disk cycles: (a) 
12123 and 13132 are mapped into each other by the 
flip across 1 axis. Similarly (b) 123 and 132 are related 
by flips, an d (c) 1213, 1 232 and 1323 b y rotations, (d) 
The cycles 121212313 and 121212323 are related by 
rotation and time reversal. These symmetries are dis- 
cussed in chapter 9. (From ref. [1]) 



b) 



c! 



d) 




121212313 



121212323 



into N qualitatively distinct regions {Mi, Mi, ■ ■ ■ , M^}- Successive trajectory 
intersections with the set of (ci-l)-dimensional hypersurfaces f s embedded in the 
J-dimensional state space M, define the set of (d- 1) — > (d- 1) Poincare maps 



section 11.1 



%n+\ — P s n +\s n (.Xn) — f " (%n) 

x n+l € <P S >'^ , x n e V s " , 



(3.22) 



s € {1,2,..., AO, 



The (i-dimensional continuous time flow is thus reduced to discrete time compo- 
sition 



Ps 0 Si-S„ - Ps,^,,-! O • • • O P S2S] O P 



Si S 0 



of a set of Poincare maps (3.22) that map the coordinates of Poincare section P Sji 
to those of Ps„+i , the next section traversed by a given trajectory. 



If a trajectory traverses regions A1 S( 



Ms 



M s , the sequence 



sqS\ ■ • ■ s n - s n <— • • • <— s\ <— so is said to be admissible. The return map 
P So from section f SQ to itself has a contribution from any admissible returning 
(periodic, s„ = sq) sequence of compositions 



section 11.6 



S 0 S V -S„- l 



s 0 = fWi 0 • • • 0 ^Vi o P 



SlS 0 



(3.23) 



The next example offers an unambiguous set of such Poincare sections which 
do double duty, providing us both with an exact representation of dynamics in 
terms of maps, and with a symbolic dynamics, a subject that we will return to in 
chapter 11. 



chapter 1 1 
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Figure 3.9: (a) Poincare section coordinates for 
the 3-disk game of pinball. (b) Collision sequence 
(Ji.pi) i-» (S2,P2) >-» ($3, P3) from the boundary 
of a disk to the boundary of the next disk is coded 
by the Poincare maps sequence P^^Pi^i- 




Example 3.9 Pinball game, Poincare dissected. (continued from sect. 1.4) A 
phase-space orbit is fully specified by its position and momentum at a given instant, 
so no two distinct phase-space trajectories can intersect. The configuration space 
trajectories, however, can and do intersect, in rather unilluminating ways, as e.g. in 
figure 3.8 (d), and it can be rather hard to perceive the systematics of orbits from their 
configuration space shapes. The problem is that we are looking at the projections of 
4-dimensional state space trajectories onto a 2-dimensional configuration subspace. 
A much clearer picture of the dynamics is obtained by constructing a set of Poincare 
sections. 

Suppose that the pinball has just bounced off disk 1 . Depending on its posi- 
tion and outgoing angle, it could proceed to either disk 2 or 3. Not much happens in 
between the bounces-the ball just travels at constant velocity along a straight line-so 
we can reduce the 4-dimensional flow to a 2-dimensional map Po- k <-o-j that maps the 
coordinates (Poincare section Pi) of the pinball from one disk edge to another. Just 
after the moment of impact the trajectory is defined by s n , the arc-length position of the 
nth bounce along the billiard wall, and p„ = /?sin0„ the outgoing momentum compo- 
nent parallel to the billiard wall at the point of impact, figure 3.9(a). These coordinates 
(due to Birkhoff) are smart, as they conserve the phase-space volume. Trajectories 0 
originating from one disk can hit either of the other two disks, or escape without further 
ado. We label the survivor state space regions Pn, P\i- In terms of the three Poincare 
sections, one for each disk, the dynamics is reduced to the set of six maps 

On+l.Pn+l) = Po- n ^CT n (Sn,Pn), CT 6 {1,2,3} (3.24) 

from the boundary of a disk to the boundary of the next disk, figure 3.9 (b). The explicit 
form of this map is easily written down, see example 8. 1, but much more economical 
is the symmetry quotiented version of chapter 9 which replaces the above 6 fonvarp 
maps by a return map pair P 0 , P\ . (continued in chapter 8) 

chapter 8 

Billiard dynamics is exceptionally simple - free flight segments, followed by 
specular reflections at boundaries, with billiard boundaries the obvious choice as 
Poincare sections. For a general flow one is never so lucky. Also, so far we have 
discussed only flows with a 1 continuous parameter (the time). The general case 
of A^-parameter continuous symmetries we postpone to chapter 10. 
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Resume 



In recurrent dynamics a trajectory exits a region in state space and then reenters 
it infinitely often, with finite return times. If the orbit is periodic, it returns after 
a full period. So, on average, nothing much really happens along the trajectory- 
what is important is behavior of neighboring trajectories transverse to the flow. 
This observation motivates a replacement of the continuous time flow by iterative 
mapping, the Poincare maps. A visualization of a strange attractor can be greatly 
facilitated by a felicitous choice of Poincare sections, and the reduction of flow 
to Poincare maps. This observation motivates in turn the study of discrete-time 
dynamical systems generated by iterations of maps. 

A particularly natural application of the Poincare section method is the reduc- 
tion of a billiard flow to a boundary-to-boundary return map, described in chap- 
ter 8. As we shall show in appendix B, further simplification of a Poincare return 
map, or any nonlinear map, can be attained through rectifying these maps locally 
by means of smooth conjugacies. 

In truth, as we shall see in chapter 10, the reduction of a continuous time 
flow by the method of Poincare sections is not a convenience, but an absolute 
necessity - to make sense of an ergodic flow, all of its continuous symmetries 
must be reduced, evolution in time being one of these symmetries. 



chapter 8 
appendix B 



Commentary 



Remark 3.1 Functions, maps, mappings. In mathematics, 'mapping' is a noun, 
'map' is a verb. Nevertheless, 'mapping' is often shortened to 'map' and is often used 
as a synonym for 'function.' 'Function' is used for mappings that map to a single point 
in R or C, while a mapping which maps to R d would be called a 'mapping,' and not a 
'function.' Likewise, if a point maps to several points and/or has several pre-images, this 
is a 'many-to-many' mapping, rather than a function. In his review [ 27], Smale refers to 
iterated maps as 'diffeomorphisms', in contradistinction to 'flows', which are 1-parameter 
groups of diffeomorphisms. In the sense used here, in the theory of dynamical systems, 
dynamical evolution from an initial state to a state finite time later is a (time-forward) 
map. 



Remark 3.2 Determining a Poincare section. The trick described in sect. 3.2 is due 
to Henon [3, 4, 5]. The idea of changing the integration variable from time to one of the 
coordinates, although simple, avoids the alternative of having to interpolate the numerical 
solution to determine the intersection. 



Remark 3.3 Henon, Lozi maps. The Henon map is of no particular physical im- 
port in and of itself-its significance lies in the fact that it is a minimal normal form for 
modeling flows near a saddle-node bifurcation, and that it is a prototype of the stretching 



maps - 16mar2012 



ChaosBook.org version 14.4.1, Apr 21 2013 



CHAPTER 3. DISCRETE TIME DYNAMICS 



74 



and folding dynamics that leads to deterministic chaos. It is generic in the sense that it 
can exhibit arbitrarily complicated symbolic dynamics and mixtures of hyperbolic and 
non-hyperbolic behaviors. Its construction was motivated by the best known early ex- 
ample of 'deterministic chaos,' the Lorenz equation, see example 2.2 and remark 2.3. 
Y. Pomeau's studies of the Lorenz attractor on an analog computer, and his insights into 
its stretching and folding dynamics motivated Henon [6] to introduce the Henon map in 
1976. Henon's and Lorenz's original papers can be found in reprint collections refs. [ 7, 8]. 
They are a pleasure to read, and are still the best introduction to the physics motivating 
such models. Henon [6] had conjectured that for (a, b) = (1.4, 0.3) Henon map a generic 
initial point converges to a strange attractor. Its existence has never been proven. While 
for all practical purposes this is a strange attractor, it has not been demonstrated that long 
time iterations are not attracted by some long attracting limit cycle. Indeed, the pruning 
front techniques that we describe below enable us to find stable attractors arbitrarily close 
by in the parameter space, such as the 13-cycle attractor at (a, b) = (1.39945219, 0.3). A 
rigorous proof of the existence of Henon attractors close to 1 -dimensional parabola map is 
due to Benedicks and Carleson [9]. A detailed description of the dynamics of the Henon 
map is given by Mira and coworkers [10, 11, 12], as well as very many other authors. The 
Lozi map (3.19) is particularly convenient in investigating the symbolic dynamics of 2- 
dimensional mappings. Both the Lorenz and Lozi [ 13] systems are uniformly expanding 
smooth systems with singularities. The existence of the attractor for the Lozi map was 
proven by M. Misiurewicz [14], and the existence of the SRB measure was established by 
L.-S. Young [15]. 
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Exercises 



3.1. Poincare sections of the Rossler flow. (continuation 
of exercise 2.8) Calculate numerically a Poincare sec- 
tion (or several Poincare sections) of the Rossler flow. 
As the Rossler flow state space is 3D, the flow maps 
onto a 2D Poincare section. Do you see that in your 
numerical results? How good an approximation would 
a replacement of the return map for this section by a 1- 
dimensional map be? More precisely, estimate the thick- 
ness of the strange attractor. (continued as exercise 4.4) 

(R. Paskauskas) 

3.2. A return Poincare map for the Rossler flow. (con- 
tinuation of exercise 3.1) That Poincare return maps 
of figure 3.3 appear multimodal and non-invertible is 
an artifact of projections of a 2-dimensional return map 
(Rn,z„) — > (R„+i,z n +i) onto a 1-dimensional subspace 

Rn — * Rn+l- 

Construct a genuine s n+ i = f(s„) return map by parame- 
terizing points on a Poincare section of the attractor fig- 
ure 3.2 by a Euclidean length s computed curvilinearly 
along the attractor section. 

This is best done (using methods to be developed in 
what follows) by a continuation of the unstable man- 
ifold of the 1 -cycle embedded in the strange attractor, 
figure 13.2(b). 

(P. Cvitanovic) 

3.3. Arbitrary Poincare sections. We will generalize the 
construction of Poincare sections so that they can have 
any shape, as specified by the equation U(x) = 0. 

(a) Start by modifying your integrator so that you 
can change the coordinates once you get near the 
Poincare section. You can do this easily by writing 
the equations as 



dx k 

~T =*fk, 
as 



(3.25) 



with dt/ds = k, and choosing k to be 1 or \/f\. 
This allows one to switch between t and x\ as the 
integration 'time.' 
(b) Introduce an extra dimension x n+ \ into your sys- 
tem and set 

x n+ i = U(x) . (3.26) 

How can this be used to find a Poincare section? 

3.4. Classical collinear helium dynamics. 

(continuation of exercise 2.10) Make a Poincare surface 
of section by plotting (ri,pi) whenever r2 = 0: Note that 
for r2 = 0, pi is already determined by (7.8). Compare 
your results with figure B.3 (b). 



(Gregor Tanner, Per Rosenqvist) 

3.5. Henon map fixed points. Show that the two fixed 
points (xo,xo), (jci, jci) of the Henon map (3.17) are 
given by 



x 0 = 



-(l-b)- V(l ~b) 2 + 4a 
2a 



-(l-b)+ V(l-6)2+4fl 
x\ = . (3.27) 

2a 



3.6. Fixed points of maps. A continuous function F is 
a contraction of the unit interval if it maps the interval 
inside itself. 

(a) Use the continuity of F to show that a 1- 
dimensional contraction F of the interval [0, 1 ] has 
at least one fixed point. 

(b) In a uniform (hyperbolic) contraction the slope of 
F is always smaller than one, \F'\ < 1. Is the com- 
position of uniform contractions a contraction? Is 
it uniform? 

3.7. Section border for Rossler. (continuation of exer- 
cise 3.1) Determine numerically section borders (3.7) 
for several Rossler flow Poincare sections of exercise 3.1 
and figure 3.2, at least for angles 

(a) -60° , (b) 0°, and 

(c) A Poincare section hyperplane that goes through 
both equilibria, see (2.19) and figure 4.5. Two 
points only fix a line: think of a criterion for a 
good orientation of the section hyperplane, per- 
haps by demanding that the contracting eigenvec- 
tor of the 'inner' equilibrium x_ lies in it. 

(d) (Optional) Hand- or computer-draw a visualiza- 
tion of the section border as 3-dimensional fluid 
flow which either crosses, is tangent to, or fails to 
cross a sheet of light cutting across the flow. 

As the state space is 3-dimensional, the section borders 
are 1-dimensional, and it should be easy to outline the 
border by plotting the color-coded magnitude of v ±(x), 
component of the v(Jc) normal to the section, for a fine 
grid of 2-dimensional Poincare section plane points. For 
sections that go through the z-axis, the normal velocity 
v_l(Jc) is tangent to the circle through jc, and vanishes for 
9 in the polar coordinates (3.11), but that is not true for 
other Poincare sections, such as the case (c). 

(P. Cvitanovic) 
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Chapter 4 



Local stability 



(R. Mainieri and P. Cvitanovic) 

So far we have concentrated on describing the trajectory of a single initial 
point. Our next task is to define and determine the size of a neighborhood 
of x(t). We shall do this by assuming that the flow is locally smooth and by 
describing the local geometry of the neighborhood by studying the flow linearized 
around x(t). Nearby points aligned along the stable (contracting) directions remain 
in the neighborhood of the trajectory x(t) = f(xo); the ones to keep an eye on are 
the points which leave the neighborhood along the unstable directions. As we shall 
demonstrate in chapter 18, the expanding directions matter in hyperbolic systems. 
The repercussions are far-reaching. As long as the number of unstable directions 
is finite, the same theory applies to finite-dimensional ODEs, state space volume 
preserving Hamiltonian flows, and dissipative, volume contracting infinite-dim- 
ensional PDEs. 

In order to streamline the exposition, in this chapter all examples are collected 
in sect. 4.8; you can get to them and back to the text by clicking on the [example] 
links, such as 

example 4.8 



4.1 Flows transport neighborhoods 

As a swarm of representative points moves along, it carries along and distorts 
neighborhoods. The deformation of an infinitesimal neighborhood is best un- 
derstood by considering a trajectory originating near xj = x(0), with an initial 
infinitesimal displacement 6x(0). The flow then transports the displacement Sx(t) 
along the trajectory x(xq, t) = f ! (xo)- 
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4.1.1 Instantaneous shear 

The system of linear equations of variations for the displacement of the infinites- 
imally close neighbor x + 8x follows from the flow equations (2.7) by Taylor 
expanding to linear order 

Zdvi 
— — Sxj . 
OXj 

1 1 

The infinitesimal displacement 8x is thus transported along the trajectory x(xj, t), 
with time variation given by 



d v-i dvi 

-S Xi (x 0 ,t) = 2,^w 



J J 



6xj(x 0 ,t). (4.1) 

x=x(x 0 ,t) 



As both the displacement and the trajectory depend on the initial point and the 
time t, we shall often abbreviate the notation to x(xq, t) — > x(t) — > x, 5xi(xo, t) — > 
(5jc,(0 — * in what follows. Taken together, the set of equations 

Xi = Vj(x) , 5xj = ^ Ajj(x)Sxj (4.2) 

j 

governs the dynamics in the tangent bundle (x, 5x) e TA1 obtained by adjoining 
the cf-dimensional tangent space 5x e TM X to every point x e M in the J-dim- 
ensional state space M c K5'. The stability matrix (velocity gradients matrix) 



A iM ) = ^ (4-3) 

dXj 

describes the instantaneous rate of shearing of the infinitesimal neighborhood of 
x{i) by the flow. 



example 4.1 
p. 90 



4.1.2 Finite time linearized flow 

By Taylor expanding & finite time flow to linear order, 



ff(xo + Sx) = f'(xo) + ^ d -^-Sxj + ■ ■ ■ , (4.4) 

j 1 
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Figure 4.1: A local frame is transported along the or- v ^ 
bit and deformed by Jacobian matrix /. As J' is not 
self-adjoint, initial orthogonal frame is mapped into a 
non-orthogonal one. 



nv(0) 




one finds that the linearized neighborhood is transported by the Jacobian matrix 



remark 4.1 



Sx(t) - f(xo)6x 0 , 



Jt ( v dx i 

3 dxQj 



Axo) = 1 



(4.5) 



x=x(t) 



For example, in 2 dimensions the Jacobian matrix for change from initial to final 
coordinates is 



/ = (V*) T = 



d(x 0 ,yo) { £ g 



The map f is assumed invertible and differentiable so that / exists. For suf- 
ficiently short times / remains close to 1, so det / > 0. By continuity det / 
remains positive for all times t. However, for discrete time maps, det / can have 
either sign. 



4.1.3 Covariant frames 



/ describes the deformation of an infinitesimal neighborhood at finite time t in the 
co-moving frame of x(t), or, in the language of sect. 2.2.1, the Jacobian matrix 
maps the initial Lagrangian coordinate frame into the current Eulerian coordinate 
frame. This deformation of an initial frame at xo into a non-orthogonal frame at 
x{t) is described by the eigenvectors and eigenvalues of the Jacobian matrix of the 
linearized flow (see figure 4.1), 

jt e U) = Aje (j) , j= 1,2, (4.6) 

Throughout this text the symbol will always denote the kth eigenvalue (some- 
times referred to as the multiplier) of the finite time Jacobian matrix /. Symbol 
/I* will be reserved for the &th stability exponent, with real part /l® and phase 

A, = e a[k) = + iJ® . (4.7) 



As J' is a real matrix, its eigenvalues are either real or come in complex pairs, 



{A k ,A k+l } = {e n ^ k)+ ^ k) \e n ^-^} 
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with magnitude |At| and phase co. The phase describes the rotation velocity 
in the plane defined by the corresponding pair of real orthogonal eigenvectors, 
{Ree^Jme®}, with one period of rotation given by T = 2n/a) . 



example 4.4 
p. 91 

J\xq) depends on the initial point xq and the elapsed time t. For notational 
brevity we omitted this dependence, but in general both the eigenvalues and the 
eigenvectors, Ay = Ay(xo, t) , ••• , e (7) = e^(jco, t) , also depend on the trajectory 
traversed and the choice of coordinates. We shall refer to the eigenvectors 6& as 
covariant vectors. 

Nearby trajectories separate along the unstable directions, approach each other 
along the stable directions, and change their distance along the marginal direc- 
tions at a rate slower than exponential, corresponding to the eigenvalues of the Ja- 
cobian matrix with magnitude larger than, smaller than, or equal to 1 . In the litera- 
ture, the adjectives neutral, indifferent, center are often used instead of 'marginal'; 
(attracting) stable directions are sometimes called 'asymptotically stable,' and so 
on. 

One of the preferred directions is what one might expect, the direction of the 
flow itself. To see that, consider two initial points along a trajectory separated 
by infinitesimal flight time dt: 8xq = f St (xo) - xq = v(xo)6t . By the semigroup 
property of the flow, f t+St = f 6t+t , where 

Xst+t 
drv(x(T)) + f(xo) = 5tv(x(t)) + f(x 0 ) . 

Expanding both sides of f(f St (xo)) = f St (f'(xo)), keeping the leading term in 
dt, and using the definition of the Jacobian matrix (4.5), we observe that f(xo) 
transports the velocity vector at xq to the velocity vector at x(t) (see figure 4.1): 

v(x(t)) = J ! (x 0 )v(x Q ). (4.8) 




4.2 Computing the Jacobian 

As we started by assuming that we know the equations of motion, from (4.3) we 
also know stability matrix A, the instantaneous rate of shear of an infinitesimal 
neighborhood 5xj{t) of the trajectory x(t). What we do not know is the finite time 
deformation (4.5), so our next task is to relate the stability matrix A to Jacobian 
matrix J* . On the level of differential equations the relation follows by taking the 
time derivative of (4.5) and replacing 6x by (4.2) 

d dJ' 

— 5x(t) - — — 6xo - A Sx(t) - A J 6xo ■ 
dt dt 
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Hence the d 2 matrix elements of Jacobian matrix satisfy 'tangent linear equations,' 
the linearized equations (4.1) 

—f(x 0 ) = A(x) J'(x 0 ) , x = f(x 0 ) , initial condition J°(x 0 ) - 1 . (4.9) 
at 

For autonomous flows, the matrix of velocity gradients A(x) depends only on x, 
not time, while J' depends on both the state space position and time. Given a 
numerical routine for integrating the equations of motion, evaluation of the Jaco- 
bian matrix requires minimal additional programming effort; one simply extends 
the G?-dimensional integration routine and integrates concurrently with /(xo) the 
d 2 elements of J\xq). The qualifier 'simply' is perhaps too glib. Integration will 
work for short finite times, but for exponentially unstable flows one quickly runs 
into numerical over- and/or underflow problems. For high-dimensional flows the 
analytical expressions for elements of A might be so large that it fits on no com- 
puter. Further thought will have to go into implementation this calculation. 
F & chapter 26 

So now we know how to compute Jacobian matrix / given the stability matrix 
A, at least when the d 2 extra equations are not too expensive to compute. Mission 
accomplished. 

fast track: 
chapter 7, p. 1 25 

And yet... there are mopping up operations left to do. We persist until we de- 
rive the integral formula (4.19) for the Jacobian matrix, an analogue of the finite- 
time 'Green's function' or 'path integral' solutions of other linear problems. 

We are interested in smooth, differentiable flows. If a flow is smooth, in a suf- 
ficiently small neighborhood it is essentially linear. Hence the next section, which 
might seem an embarrassment (what is a section on linear flows doing in a book 
on nonlinear dynamics?), offers a firm stepping stone on the way to understanding 
nonlinear flows. Linear charts are the key tool of differential geometry, general 
relativity, etc., so we are in good company. If you know your eigenvalues and 
eigenvectors, you may prefer to fast forward here. 

fast track: 
sect. 4.4, p. 84 

4.3 A linear diversion 

Linear is good, nonlinear is bad. 
— Jean Bellissard 

Linear fields are the simplest vector fields, described by linear differential equa- 
tions which can be solved explicitly, with solutions that are good for all times. 
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The state space for linear differential equations is M - and the equations of 
motion (2.7) are written in terms of a vector x and a constant stability matrix A as 

x = v(x)=Ax. (4.10) 

Solving this equation means finding the state space trajectory 

x(t) = (Xi(t),X 2 (t),...,X d (t)) 

passing through a given initial point x®. If x(t) is a solution with x(0) = xq and 
y(t) another solution with y(0) - yo, then the linear combination ax(t) + by(t) with 
a, b e R is also a solution, but now starting at the point axo + by^. At any instant 
in time, the space of solutions is a cf-dimensional vector space, spanned by a basis 
of d linearly independent solutions. 

How do we solve the linear differential equation (4.10)? If instead of a matrix 
equation we have a scalar one, x - Ax, the solution is x(t) = i A xa . In order 
to solve the J-dimensional matrix case, it is helpful to rederive this solution by 
studying what happens for a short time step 8t. If at time t = 0 the position is x(0), 
then 

x{5t) - x{0) 

= Ax(0), (4.11) 

ot 

which we iterate m times to obtain Euler's formula for compounding interest 



x(t) « 1 1 + -A) x(0) « e a x(0) . (4.12) 
m 



The term in parentheses acts on the initial condition x(0) and evolves it to x(t) by 
taking m small time steps 5t = t/m. As m -» oo, the term in parentheses converges 
to e tA . Consider now the matrix version of equation (4.11): 



x(6t) - x(0) 

gt =Ax(0). (4.13) 



A representative point x is now a vector in W 1 acted on by the matrix A, as in 
(4. 10). Denoting by 1 the identity matrix, and repeating the steps (4. 1 1) and (4. 12) 
we obtain Euler's formula for the exponential of a matrix: 



x(t) = fx(0) , f = e tA = lim (l + —A) . (4.14) 

m-»oo \ m 



We will find this definition the exponential of a matrix helpful in the general case, 
where the matrix A - A(x(t)) varies along a trajectory. 
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As the system is a linear, a superposition of any two solutions to x(t) = Jx(0) 
is also a solution. One can take any d independent initial states, i^(0), x (2 \0), 
. . . ,.x (rf) (0), assemble them as columns of a matrix O(0), and formally write the 
solution for an arbitrary initial condition projected onto this basis, 

x(t) = 0(00(0)" 1 x(0) (4.15) 

where O(f) - [x (1 \t), x (2) (t), ■■■ , x (d) (t)]. O(r) is called the fundamental matrix of 
the system, and the Jacobian matrix / = O(?)O(0) _1 can thus be fashioned out of 
d trajectories {^\t)}. Numerically this works for sufficiently short times. 

Now that we have some feeling for the qualitative behavior of eigenvectors and 
eigenvalues of linear flows, we are ready to return to the nonlinear case. How do 
we compute the exponential (4.14)? 



example 4.2 TK^Sp 1 fast track: 

p. 90 S ect. 4.4, p. 84 

Henriette Roux: So, computing eigenvalues and eigenvectors seems like a good 
thing. But how do you really do it? 

A: Any text on numerics of matrices discusses how this is done; the keywords are 
Gram-Schmidt, and for high-dimensional flows 'Krylov subspace' and 'Arnoldi 
iteration'. Conceptually (but not for numerical purposes) we like the economical 
description of neighborhoods of equilibria and periodic orbits afforded by projec- 
tion operators. The requisite linear algebra is standard, but usually not phrased in 
language of projection operators. As this is a bit of sidetrack that you will find 
confusing at the first go, it is relegated to appendix C. 

Henriette Roux: In my 61,506-dimensional computation of a Navier-Stokes equi- 
librium I generated about 30 eigenvectors before I wanted to move on. How many 
of these eigenvectors are worth generating for a particular solution and why? 

A: A rule of the thumb is that you need all equilibrium eigenvalues / periodic orbit 
Floquet exponents with positive real parts, and at least those negative exponents 
whose magnitude is less or comparable to the largest expanding eigenvalue. More 
precisely; keep adding the next least contracting eigenvalue to the sum of the 
preceding ones as long as the sum is positive (Kaplan- Yorke criterion). Then, just 
to be conservative, double the number of eigenvalues you keep. You do not need to 
worry about the remaining (60 thousand!) eigen-directions for which the negative 
eigenvalues are of larger magnitude, as they always contract: nonlinear terms 
cannot mix them up in such a way that expansion in some directions overwhelms 
the strongly contracting ones. 



section 5.2.2 
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4.4 Stability of flows 

How do you determine the eigenvalues of the finite time local deformation J for 
a general nonlinear smooth flow? The Jacobian matrix is computed by integrating 
the equations of variations (4.2) 

x(f)=f(xo), 6x(xo,t) = J'(xo)Sx(xq,0) . (4.16) 

The equations are linear, so we should be able to integrate them-but in order to 
make sense of the answer, we derive this integral step by step. 

Consider the case of a general, non-stationary trajectory x{t). The exponential 
of a constant matrix can be defined either by its Taylor series expansion, or in 
terms of the Euler limit (4.14): 



—A k - lim (1 + —A] . (4.17) 

k\ m^oo \ ml 

k=0 

Taylor expanding is fine if A is a constant matrix. However, only the second, 
tax-accountant's discrete step definition of an exponential is appropriate for the 
task at hand, as for a dynamical system the local rate of neighborhood distortion 
A(x) depends on where we are along the trajectory. The linearized neighborhood 
is multiplicatively deformed along the flow, and the m discrete time-step approx- 
imation to / is therefore given by a generalization of the Euler product (4.17): 



J 1 = lim FT (1 + 6tA(x n )) = lim FT e StA{Xn) (4.18) 

m— *oo 1 1 m— >oo 1 1 

n=m n=m 

- lim e StAi ~ x «') e StA ( x >n-l) . . . e StA(xi) e 5tA(xi) 
m— >oo 



where 5t = (t - to)/m, and x„ = x(to + n6t). Indexing of the products indicates that 
the successive infinitesimal deformation are applied by multiplying from the left. 
The m — » oo limit of this procedure is the formal integral 



J-j(xo) = 



(4.19) 



where T stands for time-ordered integration, defined as the continuum limit of 
the successive left multiplications (4.18). This integral formula for / is the 
main conceptual result of this chapter, the finite time companion of the differential 
definition (4.9). The definition makes evident important properties of Jacobian 
matrices, such as that they are multiplicative along the flow, 

J t+t '{x) = J'(x') J\x), where x' - f{x Q ) , (4.20) 



exercise 4.5 
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an immediate consequence of time-ordered product structure of (1.18). However, 
in practice / is evaluated by integrating (4.9) along with the ODEs that define a 
particular flow. 



4.5 Stability of maps 



The transformation of an infinitesimal neighborhood of a trajectory under the iter- 
ation of a map follows from Taylor expanding the iterated mapping at finite time 
n to linear order, as in (4.4). The linearized neighborhood is transported by the 
Jacobian matrix evaluated at a discrete set of times n = 1,2,..., 



V X0) = — 



(4.21) 

x=x 0 



9 r,„ « V W) 



—fj(f( X )) = V 

° x i fr( d yk 



dfk(x) 
y=f(x) dxi 



If you prefer to think of a discrete time dynamics as a sequence of Poincare sec- 
tion returns, then (4.22) follows from (4.20): Jacobian matrices are multiplicative 
along the flow. 



example 4.10 »^-^^=> fast track: 

p. 96 [t3S? chapter 7, p. 125 



As in the finite time case (4.7), we denote by A* the &fh eigenvalue or multiplier 
of the finite time Jacobian matrix J" . There is really no difference from the con- 
tinuous time case, other than that now the Jacobian matrix is evaluated at integer 
times. 



example 4.9 
p. 95 

The formula for the linearization of nth iterate of a J-dimensional map 
/"(-to) = J(x„-i) ■ ■ ■ J(xOJ(xq) , xj - f J (x Q ) , (4.22) 

in terms of single time steps Jji = dfj/dxi follows from the chain rule for func- 
tional composition, 



exercise 6.4 
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Figure 4.2: If x(f) intersects the Poincare section 
P at time r, the nearby x(t) + Sx(t) trajectory inter- 
sects it time t + St later. As (W ■ v'St) = -(W ■ 
J 6x), the difference in arrival times is given by St = 
~{U' ■J6x)/(U' -v'). 




4.6 Stability of Poincare return maps 



(R. Paskauskas and P. Cvitanovic) 

We now relate the linear stability of the Poincare return map P : P — > P defined 
in sect. 3.1 to the stability of the continuous time flow in the full state space. 

The hypersurface P can be specified implicitly through a function U(x) that is 
zero whenever a point x is on the Poincare section. A nearby point x + Sxis in the 
hypersurface P if U(x + 5x) = 0, and the same is true for variations around the first 
return point x J = x(t), so expanding U(x') to linear order in variation 6x restricted 
to the Poincare section, and applying the chain rule leads to the condition 



Y dU(x') dx\ 
dxj dx: 



- 0. 



(4.23) 



v 



In what follows Uj - djU is the gradient of U defined in (3.3), unprimed quantities 
refer to the starting point x = xq e P, v = v(xrj), and the primed quantities to the 
first return: x' = x(t), V = v(x'), U' = U(x'). For brevity we shall also denote 
the full state space Jacobian matrix at the first return by / = J{xq). Both the first 
return x' and the time of flight to the next Poincare section t(x) depend on the 
starting point x, so the Jacobian matrix 



dx\ 

J(x)ij = — 
dx; 



(4.24) 



9 



with both initial and the final variation constrained to the Poincare section hyper- 
surface P is related to the continuous flow Jacobian matrix by 



dx\ 

dx; 



9 



dx ; dr dx lJ ' dx ,■ 



The return time variation dr/dx, figure 4.2, is eliminated by substituting this ex- 
pression into the constraint (4.23), 



o = diU' Jij + w -au')^-, 

dX; 
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yielding the projection of the full space cf-dimensional Jacobian matrix to the 
Poincare map (d- l)-dimensional Jacobian matrix: 



iv-ew),-'- <4 ' 25) 

Substituting (4.8) we verify that the initial velocity v(x) is a zero-eigenvector of/ 
Jv = 0 , (4.26) 



so the Poincare section eliminates variations parallel to v, and / is a rank (d- 1)- 
dimensional matrix, i.e., one less than the dimension of the continuous time flow. 



4.7 Neighborhood volume 



fa 



Consider a small state space volume AV = drx centered around the point xn at 

section G.d. 

time t = 0. The volume AV around the point x = x(t) time t later is remark 6 1 



, AV 
AV' = —AV = 



, dx' 




det- 


AV= < 


ox 



(4.27) 



so the |det/| is the ratio of the initial and the final volumes. The determinant 
det J\xq) - ]~[f = i A;(xo, t) is the product of the Jacobian matrix eigenvalues. We 
shall refer to this determinant as the Jacobian of the flow. The Jacobian is easily 
evaluated. Take the time derivative, use the / evolution equation (1.9) and the 
matrix identity In det / = tr In J: 



— lnAV(t) = — bidet/ = tr — In/ = tr-/ = trA = dm. 

dt dt dt J 



(Here, as elsewhere in this book, a repeated index implies summation.) Integrate 
both sides to obtain the time evolution of an infinitesimal volume 



exercise 4.1 



det J\xq) = exp 



f drtr 
Jo 



A(x(t)) 



f 

Jo 



= exp dr djVj(x(T)) 



(4.28) 



As the divergence 3,-v,- is a scalar quantity, the integral in the exponent (4.19) needs 
no time ordering. So all we need to do is evaluate the time average 



dm = lim - I dT V A u {x{t)) 

t^oo t Jo £J 

d d 

]a,<x 0 ,o = J>'Wo 



I In 

t 



(4.29) 
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along the trajectory. If the flow is not singular (for example, the trajectory does 
not run head-on into the Coulomb 1/r singularity), the stability matrix elements 
are bounded everywhere, |A ;; | < M , and so is the trace 2, A;/. The time integral 
in (4.29) thus grows at most linearly with t, d[V\ is bounded for all times, and 
numerical estimates of the t —> oo limit in (4.29) are not marred by any blowups. 



Even if we were to insist on extracting <9,v, from (4.18) by first multiplying 
Jacobian matrices along the flow, and then taking the logarithm, we can avoid ex- 
ponential blowups in / by using the multiplicative structure (4.20), det J (jco) = 
det J' (x r ) det J'(xq) to restart with J°(x') = 1 whenever the eigenvalues of /(xq) 
start getting out of hand. In numerical evaluations of stability exponents the sum 
rule (4.29) can serve as a helpful check on the accuracy of the computation. 

The divergence 3/V; characterizes the behavior of a state space volume in the 
infinitesimal neighborhood of the trajectory. If o)vj < 0, the flow is locally con- 
tracting, and the trajectory might be falling into an attractor. If b]vj(x) < 0 , for 
all x € M, the flow is globally contracting, and the dimension of the attractor is 
necessarily smaller than the dimension of state space At. If b}vj - 0, the flow 
preserves state space volume and det / = 1. A flow with this property is called 
incompressible. An important class of such flows are the Hamiltonian flows 
considered in sect. 7.3. 

But before we can get to that, Henriette Roux, the perfect student and always 
alert, pipes up. She does not like our definition of the Jacobian matrix in terms of 
the time-ordered exponential (4. 19). Depending on the signs of multipliers, the 
left hand side of (4.28) can be either positive or negative. But the right hand side 
is an exponential of a real number, and that can only be positive. What gives? As 
we shall see much later on in this text, in discussion of topological indices arising 
in semiclassical quantization, this is not at all a dumb question. 



Resume 

A neighborhood of a trajectory deforms as it is transported by a flow. In the 
linear approximation, the stability matrix A describes the shearing / compression 
/ expansion of an infinitesimal neighborhood in an infinitesimal time step. The 
deformation after a finite time t is described by the Jacobian matrix 



where T stands for the time-ordered integration, defined multiplicatively along 
the trajectory. For discrete time maps this is multiplication by time-step Jacobian 



example 4.8 
p. 95 



/(*>) = Te£ dTA(x(T)) , 
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matrix / along the n points xq, x\, xi, ■ ■ -, x n -\ on the trajectory of xq, 

J'\xq) = J{x n -\)J(Xn-2) ■ ■ ■ J{xi)J(xq) , 

with J(x) the single discrete time-step Jacobian matrix. In ChaosBook the stability 
multiplier denotes the kth eigenvalue of the finite time Jacobian matrix J(xq), 
f/® the real part of kth stability exponent, and (J® its phase, 

For complex eigenvalue pairs the 'angular velocity' co describes rotational motion 
in the plane spanned by the real and imaginary parts of the corresponding pair of 
complex eigenvectors. 

The eigenvalues and eigen-directions of the Jacobian matrix describe the de- 
formation of an initial infinitesimal cloud of neighboring trajectories into a dis- 
torted cloud a finite time t later. Nearby trajectories separate exponentially along 
unstable eigen-directions, approach each other along stable directions, and change 
slowly (algebraically) their distance along marginal, or center directions. The Ja- 
cobian matrix / is in general neither symmetric, nor diagonalizable by a rotation, 
nor do its (left or right) eigenvectors define an orthonormal coordinate frame. 
Furthermore, although the Jacobian matrices are multiplicative along the flow, in 
dimensions higher than one their eigenvalues in general are not. This lack of 
multiplicativity has important repercussions for both classical and quantum dy- 
namics. 



Commentary 

Remark 4.1 Linear flows. The subject of linear algebra generates innumerable tomes 
of its own; in sect. 4.3 we only sketch, and in appendix C recapitulate a few facts that our 
narrative relies on: a useful reference book is [ 1 ]. The basic facts are presented at length 
in many textbooks. Frequently cited linear algebra references are Golub and Van Loan [ 2], 
Coleman and Van Loan [3], and Watkins [4, 5]. The standard references that exhaus- 
tively enumerate and explain all possible cases are Hirsch and Smale [6] and Arnol'd [7]. 
A quick overview is given by Izhikevich [8]; for different notions of orbit stability see 
Holmes and Shea-Brown [9]. For ChaosBook purposes, we enjoyed the discussion in 
chapter 2 Meiss [ 0], chapter 1 of Perko [11] and chapters 3 and 5 of Glendinning [12], 
and liked the discussion of norms, least square problems, and differences between sin- 
gular value and eigenvalue decompositions in Trefethen and Bau [13]. Truesdell [2] and 
Gurtin [3] are excellent references for the continuum mechanics perspective on state space 
dynamics. We enjoyed Christov et al. [1] easy introduction to parallels between dynami- 
cal systems and continuum mechanics. 

The nomenclature tends to be a bit confusing. Jacobian matrix (4.5) is sometimes 
referred to as the fundamental solution matrix or simply fundamental matrix, a name in- 
herited from the theory of linear ODEs, or the Frechet derivative of the nonlinear mapping 
f'(x), or the 'tangent linear propagator' , or even, by Lorenz [?], the 'error matrix'. As 
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/ describes the deformation of an infinitesimal neighborhood at finite time t in the co- 
moving frame of x(t), in continuum mechanics it is called the deformation gradient, or 
transplacement gradient. It is often denoted Df, but for our needs (we shall have to sort 
through a plethora of related Jacobian matrices) matrix notation J is more economical. 
Single discrete time-step Jacobian Jji = dfj/dxi in (4.22) is referred to as the 'tangent 
map' by Skokos [15, 16]. 

In referring to the 'velocity gradients matrix' or 'velocity gradient tensor' A defined 
in (4.3) as the 'stability matrix' we follow Tabor [ -■]. It is the natural object for study 
of stability of equilibria, time-invariant point in state space; stability of trajectories is de- 
scribed by Jacobian matrices. Goldhirsch, Sulem, and Orszag [ 17] call it the 'Hessenberg 
matrix', and to the equations of variations (4.1) as 'stability equations.' Manos et al. [?] 
refer to (4.1) as the 'variational equations'. 

Sometimes A, which describes the instantaneous shear of the neighborhood of x(x o, t), 
is referred to as the 'Jacobian matrix,' a particularly unfortunate usage when one considers 
linearized stability of an equilibrium point (5.1). A is not a Jacobian matrix, just as a 
generator of SO(2) rotation is not a rotation; A is a generator of an infinitesimal time 
step deformation, J 6 ' ^ 1 + A5t . What Jacobi had in mind in his 1841 fundamental 
paper [ ] on the determinants today known as 'jacobians' were transformations between 
different coordinate frames. These are dimensionless quantities, while dimensionally A y 
is l/[time]. 

More unfortunate still is referring to the Jacobian matrix J' = exp(fA) as an 'evolution 
operator,' which here (see sect. 17.2) refers to something altogether different. In this book 
Jacobian matrix /' always refers to (4.5), the linearized deformation after a finite time t, 
either for a continuous time flow, or a discrete time mapping. 



4.8 Examples 

The reader is urged to study the examples collected here. To return back to the 
main text, click on [click to return] pointer on the margin. 



Example 4.1 Rossler and Lorenz flows, linearized: (continued from example 3.5) For 
the Rossler (2. 18) and Lorenz (2. 13) flows the stability matrices are, respectively 



( 0 
1 

z 



-1 -1 ^ 
a 0 
0 x — c j 



A lnr = 



p- z -1 

y x 



0 ^ 

x 

-b ) 



(4.30) 



(continued in example 4.5) 



click to return: p. 78 



Example 4.2 Jacobian matrix eigenvalues, diagonalizable case: Should we be 
so lucky that A - A D happens to be a diagonal matrix with eigenvalues [A (l \ A (2) , A {d) ), 
the exponential is simply 



l e' AW 



J' = e tA ° 



0 ^ 



(4.31) 
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Figure 4.3: Streamlines for several typical 2- 
dimensional flows: saddle (hyperbolic), in node (at- 
tracting), center (elliptic), in spiral. 




Next, suppose that A is diagonalizable and that U is a nonsingular matrix that brings it 
to a diagonal formA D = U~ l AU. Then J can also be brought to a diagonal form (insert 
factors 1 = UU~ l between the terms of the product (4. 14)): 

exercise 4.2 

f = e tA = Ue tAD U~ 1 . (4.32) 

The action of both A and J is very simple; the axes of orthogonal coordinate system 
where A is diagonal are also the eigen-directions of J', and under the flow the neigh- 
borhood is deformed by a multiplication by an eigenvalue factor for each coordinate 
axis. 

We recapitulate the basic facts of linear algebra in appendix C. The following 
2-dimensional example serves well to highlight the most important types of linear 
flows: 

Example 4.3 Linear stability of 2-dimensional flows: For a 2-dimensional flow the 
eigenvalues A {l) ,A (2) of A are either real, leading to a linear motion along their eigen- 
vectors, Xj(f) = xj(Q) sxp(tA <j) ), or a form a complex conjugate pair A (l) = fi + ioj,A {2) = 
[i-ito, leading to a circular or spiral motion in the [xu *i] plane. 

These two possibilities are refined further into sub-cases depending on the 
signs of the real part. In the case of real A {1) > 0, A {2) < 0, x\ grows exponentially 
with time, and x 2 contracts exponentially. This behavior, called a saddle, is sketched in 
figure 4.3, as are the remaining possibilities: in/out nodes, inward/outward spirals, and 
the center. The magnitude of out-spiral \x{t)\ diverges exponentially when fi > 0, and 
in-spiral contracts into (0, 0) when the ,u < 0, whereas the phase velocity co controls its 
oscillations. 

If eigenvalues A (V> = A {2) = A are degenerate, the matrix might have two linearly 
independent eigenvectors, or only one eigenvector. We distinguish two cases: (a) 
A can be brought to diagonal form, (b) A can be brought to Jordan form, which (in 
dimension 2 or higher) has zeros everywhere except for the repeating eigenvalues on 
the diagonal, and some 1 's directly above it. For every such Jordan [d a xd a ] block there 
is only one eigenvector per block. 

We sketch the full set of possibilities in figures 4.3 and 4.4, and work out in 

detail the most important cases in appendix C, example C.3. 

r ' r click to return: p. 83 



Example 4.4 In-out spirals. Consider an equilibrium whose stability exponents 
{A {l) ,A (2) } = [pi + icj,n - iu) form a complex conjugate pair. The corresponding com- 
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saddle out node in node 



Figure 4.4: Qualitatively distinct types of expo- 
nents (i (1) ,/t <2) ) of a [2x2] Jacobian matrix. 




plex eigenvectors can be replaced by their real and imaginary parts, {e (1) ,e {2) } 
{Re e (1) , Ime (1) ). The 2-dimensional real representation, 



C 7Mi !M! o) 



consists of the identity and the generator of SO(2) rotations in the {Re e (1) , Im e (1) } plane. 
Trajectories x(t) = J' x(0), where (omitting e (3) , e (4) , • • ■ eigen-directions) 

jt = e A q t = e¥ l^ut -sin<*\ 
\ sin cot cos ut j 

spiral in/out around (x,y) = (0,0), see figure 4.3, with the rotation period T, and con- 
traction/expansion radially by the multiplier A mdial , and by the multiplier Aj along the 
e 0) eigen-direction per a turn of the spiral: 



exercise C.1 



T = 2tt/<u, A radial = , Aj = e T " m . (4.34) 

We learn that the typical turnover time scale in the neighborhood of the equilibrium 
(x,y) = (0,0) is of order * T (and not, let us say, 1000 T, or 10- 2 T). Aj multipliers 
give us estimates of strange-set thickness in eigen-directions transverse to the rotation 
plane. 



Example 4.5 Stability of equilibria of the Rossler flow. (continued from ex- 
ample 4.1) The Rosier system (2.18) has two equilibrium points (2.19), the inner 
equilibrium (*_,)>_, z_), and the outer equilibrium point (x + ,y + ,z + ). Together with 
exponents (eigenvalues of the stability matrix), the two equilibria yield quite cfefa^ecP 
information about the flow. Figure 4.5 shows two trajectories which start in the neigh- 
borhood of the outer '+ ' equilibrium. Trajectories to the right of the equilibrium point '+ ' 
escape, and those to the left spiral toward the inner equilibrium point '- ', where they 
seem to wander chaotically for all times. The stable manifold of outer equilibrium point 
thus serves as the attraction basin boundary. Consider now the numerical values for 
eigenvalues of the two equilibria 

(H-\^- ± i w ( - 2) ) = ( -5.686, 0.0970 ± / 0.9951 ) 

(|/ < l 1) ,ju { + 2) + iojf) = (0.1929, -4.596 x 10~ 6 ± i 5.428 ) 

Outer equilibrium: The complex eigenvalue pair implies that neighborhood 

of the outer equilibrium point rotates with angular period T \ « |2^/w ( + 2) | = 1.1575. The 
multiplier by which a trajectory that starts near the '+' equilibrium point contracts in the 
stable manifold plane is the excruciatingly slow multiplier A^ ~ exp(// ( l 2) r + ) = 0.9999947 
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Figure 4.5: Two trajectories of the Rossler flow initi- 
ated in the neighborhood of the '+' or 'outer' equilib- 
rium point (2.19). (R. Paskauskas) 




per rotation. For each period the point of the stable manifold moves away along the 
unstable eigen-direction by factor A| w exp(jx+ ) T+) = 1.2497. Hence the slow spiraling 
on both sides of the '+ ' equilibrium point. 

Inner equilibrium: The p. (2) + iu> {2) complex eigenvalue pair tells us that neighbor- 
hood of the '-' equilibrium point rotates with angular period T « |2w/w®| = 6.313, 
slightly faster than the harmonic oscillator estimate in (2.15). The multiplier by which 
a trajectory that starts near the '-' equilibrium point spirals away per one rotation is 
^radial ~ expOu^T-) = 1-84. The fi_ eigenvalue is essentially the z expansion cor- 
recting parameter c introduced in (2. 1 7). For each Poincare section return, the trajec- 
tory is contracted into the stable manifold by the amazing factor of Ai « expQrjT-) = 

irr 15 - 6 (!). 

Suppose you start with a 1 mm interval pointing in the A] eigen-direction. After 
one Poincare return the interval is of order of 10~ 4 fermi, the furthest we will get into sub- 
nuclear structure in this book. Of course, from the mathematical point of view, the flow 
is reversible, and the Poincare return map is invertible. (continued in example 1 1.3) 

(R. Paskauskas) 



Example 4.6 Stability of Lorenz flow equilibria: (continued from example 4.1) A 
glance at figure 3.4 suggests that the flow is organized by its 3 equilibria, so lets have 
a closer look at their stable/unstable manifolds. 

The EQq equilibrium stability matrix (4.30) evaluated at x EQo = (0, 0, 0) is block- 
diagonal. The z-axis is an eigenvector with a contracting eigenvalue A {1) = -b. From^ ^ 
(4.41) it follows that all [x,y] areas shrink at rate -(cr + 1). Indeed, the [x,y] submatrix 

7 -i i < 4 - 36 > 



has a real expanding/contracting eigenvalue pairA <l ' 3) = -(o-+ 1)/2± ^j(cr - l) 2 /4 + pcr, 
with the right eigenvectors e {1) , e (3) in the [x,y] plane, given by (either) column of the 
projection operator 

A--A U) 1 1 / -cr-A^ cr \ 

P,. = — — = — I , ^ )fn , i + j e {1, 3} . (4.37) 

^(0-^0) A^-A^y p -l-A u) j 

EQ\,i equilibria have no symmetry, so their eigenvalues are given by the roots 
of a cubic equation, the secular determinant det(A - Al) = 0: 

A 3 + A 2 {cr + b + 1) + Ab(a- + p) + 2o-b(p - 1) = 0 . (4.38) 

For p > 24.74, EQ U2 have one stable real eigenvalue and one unstable complex con- 
jugate pair, leading to a spiral-out instability and the strange attractor depicted in fig- 
ure 2.5. 
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Figure 4.6: (a) A perspective view of the lin- 
earized Lorenz flow near EQ\ equilibrium, see fig- 
ure 3.4(a). The unstable eigenplane of EQ\ is 
spanned by Ree (1) and Ime' 1 '. The stable eigen- 
vector e <3) . (b) Lorenz flow near the EQo equi- 
librium: unstable eigenvector e (1) , stable eigen- 
vectors e <2) , e (3) . Trajectories initiated at distances 
1CT 8 • • • 1CT 12 , 10~ 13 away from the z-axis exit fi- 
nite distance from EQ 0 along the (e (l, ,e (2) ) eigen- 
vectors plane. Due to the strong expansion, the 
EQo equilibrium is, for all practical purposes, un- 
reachable, and the EQi — > EQ 0 heteroclinic con- 
nection never observed in simulations such as fig- 
ure 2.5. (E. Siminos; continued in figure 11.8.) 




(b) 



As all numerical plots of the Lorenz flow are here carried out for the Lorenz 
parameter choice <r = 10, b = 8/3, p = 28 , we note the values of these eigenvalues for 
future reference, 

EQo ■ (A m , A (2) , A {3) ) = ( 11.83,-2.666, -22.83) 

EQi : (// (1) ± /w^U®) = ( 0.094 ±i 10.19, -13.85), { } 

as well as the rotation period T EQl = 2n/to (1) about EQ\, and the associated expan- 
sion/contraction multipliers A® = exp(fj. U) T EQl ) per a spiral-out turn: 

T EQl =0.6163, (A (1) ,A (3) ) = ( 1.060,1. 957 xl0~ 4 ). (4.40) 

We learn that the typical turnover time scale in this problem is of order T w T EQl w 1 
(and not, let us say, 1000, or 10~ 2 ). Combined with the contraction rate (4.41), this tells 
us that the Lorenz flow strongly contracts state space volumes, by factor of ~ 10~ 4 per 
mean turnover time. 

In the EQi neighborhood the unstable manifold trajectories slowly spiral out, 
with very small radial per-turn expansion multiplier A m 1.06, and very strong con- 
traction multiplier A (3) 10~ 4 onto the unstable manifold, figure 4.6(a). This contraction 
confines, for all practical purposes, the Lorenz attractor to a 2-dimensional surface evi- 
dent in the section figure 3.4. 

In the x EQo = (0,0,0) equilibrium neighborhood the extremely strong A (3) » 
-23 contraction along the e (3) direction confines the hyperbolic dynamics near EQ 0 to 
the plane spanned by the unstable eigenvector e (1) , with A {1) 12, and the slowest 
contraction rate eigenvector e (2) along the z-axis, with A (2) -3. In this plane the strong 
expansion along e (1) overwhelms the slow A <2> -3 contraction down the z-axis, making 
it extremely unlikely for a random trajectory to approach EQo, figure 4.6(b). Thus 
linearization suffices to describe analytically the singular dip in the Poincare sections 
of figure 3.4, and the empirical scarcity of trajectories close to EQo. (continued in 
example 4.8) 

(E. Siminos and J. Halcrow) 



Example 4.7 Lorenz flow: Global portrait. (continued from example 4.6) As the 
EQi unstable manifold spirals out, the strip that starts out in the section above EQ | in 
figure 3.4 cuts across the z-axis invariant subspace. This strip necessarily contains a 
heteroclinic orbit that hits the z-axis head on, and in infinite time (but exponentially fast) 
descends all the way to EQo. 
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Figure 4.7: A unimodal map, together with fixed 
points 0, 1, 2-cycle 01 and 3-cycle Oil. 
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How? As in the neighborhood of the EQ 0 equilibrium the dynamics is linear 
(see figure 4.6(a)), there is no need to integrate numerically the final segment of the 
heteroclinic connection - it is sufficient to bring a trajectory a small distance away from 
EQ 0 , continue analytically to a small distance beyond EQ 0 , then resume the numerical 
integration. 

What happens next? Trajectories to the left of z-axis shoot off along the e (1) 
direction, and those to the right along -e ( ". As along the e (1) direction xy > 0, the 
nonlinear term in the z equation (2. 13) bends both branches of the EQq unstable man- 
ifold W"(EQ 0 ) upwards. Then ... - never mind. Best to postpone the completion of 
this narrative to example 9. 14, where the discrete symmetry of Lorenz flow will help us 
streamline the analysis. As we shall show, what we already know about the 3 equilib- 
ria and their stable/unstable manifolds suffices to completely pin down the topology of 
Lorenz flow, (continued in example 9. 14) 

(E. Siminos and J. Halcrow) 



Example 4.8 Lorenz flow state space contraction: (continued from exam- 
ple 4.6) It follows from (4.30) and (4.29) that Lorenz flow is volume contracting, 

3 

d iVi = ^ & (x, f) = -cr-b-l, (4.41) 



at a constant, coordinate- and p-independent rate, set by Lorenz to <9,v, = -13.66 . As 
for periodic orbits and for long time averages there is no contraction/expansion along 
the flow, = 0, and the sum of A {t) is constant by (4.4 1 ), there is only one independent 
exponent A i ' ) to compute, (continued in example 4. 7) 
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Example 4.9 Stability of a 1 -dimensional map: Consider the orbit {. . .,x-i,xo,x\,X2, ■ . ■} 

of a 1 -dimensional map x„ +1 = f(x n ). Since point x„ is carried into point x ll+l , in study- 
ing linear stability (and higher derivatives) of the map it is often convenient to deploy 
a local coordinate systems z a centered on the orbit points x a , together with a notation 
for the map, its derivative, and, by the chain rule, the derivative of the kth iterate f k 
evaluated at the point x a , 

X = X a + Za , fa(Za) = f(x a + Z a ) 
fa = f'iXa) 

A( X0 ,k) = f?=f a *-i-fa+lfa, * > 2 . (4.42) 
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Here a is the label of point x a , and the label a + l is a shorthand for the next point b on 
the orbit ofx a , x b = x a+1 = f(x a ). For example, a period-3 periodic point in figure 4.7 
might have label a = 01 1, and by x U o = /Oon) the next point label is b = 1 10 
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Example 4.10 Henon map Jacobian matrix: For the Henon map (3. 1 7) the Jaco- 
bian matrix for the nth iterate of the map is 



M n (x 0 ) = f[ ( 2 \ Xm o ) , x m = f?(x 0 ,y 0 ) • 



(4.43) 



The determinant of the Henon one time-step Jacobian matrix (4.43) is constant, 

detM = AiA 2 = -b (4.44) 



so in this case only one eigenvalue Ai = -b/A 2 needs to be determined. This is not 
an accident; a constant Jacobian was one of desiderata that led Henon to construct a 
map of this particular form. 
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Exercises 



4. 1 . Trace-log of a matrix. Prove that 

detM = e trlnM . 

for an arbitrary nonsingular finite dimensional matrix M, 
detM + 0. 

4.2. Stability, diagonal case. Verify the relation (4.32) 

J> = e tA = irV^U , A D = UAIT 1 . 

4.3. State space volume contraction. 



(a) Compute the Rossler flow volume contraction rate 
at the equilibria. 

(b) Study numerically the instantaneous <9,v, along a 
typical trajectory on the Rossler attractor; color- 
code the points on the trajectory by the sign (and 
perhaps the magnitude) of <9jV;. If you see regions 
of local expansion, explain them. 

(c) (optional) color-code the points on the trajectory 
by the sign (and perhaps the magnitude) of d , v, - 

dm. 

(d) Compute numerically the average contraction rate 
(4.29) along a typical trajectory on the Rossler at- 
tractor. Plot it as a function of time. 



(e) Argue on basis of your results that this attractor is 
of dimension smaller than the state space d — 3. 

(f) (optional) Start some trajectories on the escape 
side of the outer equilibrium, color-code the points 
on the trajectory. Is the flow volume contracting? 

(continued in exercise 20.1 1) 

4.4. Topology of the Rossler flow. (continuation of exer- 
cise 3.1) 

(a) Show that equation |det(A - Al)\ = 0 for Rossler 
flow in the notation of exercise 2.8 can be written 



A i +A 2 c{p f -e)+A(p ± le+\-c 2 ep f )+c VD = 

(b) Solve (4.45) for eigenvalues A* for each equilib- 
rium as an expansion in powers of e. Derive 

A\ - -c + ec/(c 2 + 1) + o(e) 
A- = ec 3 /[2(c 2 + l)] + o(e 2 ) 

l+e/[2(c 2 +l)]+ 0 (e) 
ce( l_ e) + o(e 3 ) (4-46) 

AX = -e 5 c 2 /2 + o(e 6 ) 



6»+ = VI + +o(6)) 
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Compare with exact eigenvalues. What are dy- 
namical implications of the extravagant value of 
/tj"? (continued as exercise 13.10) 

(R. Paskauskas) 

4.5. Time-ordered exponentials. Given a time dependent 
matrix V(t) check that the time-ordered exponential 

<U(t) = Te£ dTV(j) 
may be written as 



with the initial condition 1/(0) = 1. 

4.6. A contracting baker's map. Consider a contracting 
(or 'dissipative') baker's map, acting on a unit square 
[0, 1] 2 = [0,1] x [0,1], defined by 



00 r-t rti r'«i-\ 
<U(t) = V Ai dt 2 ■ ■ ■ dt m V(t 
Jo Jo Jo 



i)---V(t m ) 



and verify, by using this representation, that U{t) satis- 
fies the equation 

Hit) = v(tyu(t), 



y n +i 
y n +i 



x„/3 
2y„ 



y n < l/2 



XJ3 + 1/2 

2y„ - 1 



y„ > 1/2. 



This map shrinks strips by a factor of 1/3 in the x- 
direction, and then stretches (and folds) them by a factor 
of 2 in the y-direction. 

By how much does the state space volume contract for 
one iteration of the map? 
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Chapter 5 
Cycle stability 



We owe it to a book to withhold judgment until we reach 
page 100. 



— Henrietta McNutt, George Johnson's seventh- 
grade English teacher 




opological features of a dynamical system -singularities, periodic orbits, 



and the ways in which the orbits intertwine- are invariant under a general 



continuous change of coordinates. Equilibria and periodic orbits are flow- 
invariant sets, in the sense that the flow only shifts points along a periodic orbit, 
but the periodic orbit as the set of periodic points remains unchanged in time. Sur- 
prisingly, there also exist quantities that depend on the notion of metric distance 
between points, but nevertheless do not change value under a smooth change of 
coordinates. Local quantities such as the eigenvalues of equilibria and periodic 
orbits, and global quantities such as Lyapunov exponents, metric entropy, and 
fractal dimensions are examples of properties of dynamical systems independent 
of coordinate choice. 

We now turn to the first, local class of such invariants, linear stability of equi- 
libria and periodic orbits of flows and maps. This will give us metric information 
about local dynamics, as well as the key concept, the concept of a neighborhood 
of a point x: its size is determined by the number of expanding directions, and the 
rates of expansion along them: contracting directions play only a secondary role 
(see sect. 5.6). 

If you already know that the eigenvalues of periodic orbits are invariants of a 
flow, skip this chapter. 



As noted on page 41, a trajectory can be stationary, periodic or aperiodic. For 
chaotic systems almost all trajectories are aperiodic-nevertheless, equilibria and 
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periodic orbits turn out to be the key to unraveling chaotic dynamics. Here we 
note a few of the properties that make them so precious to a theorist. 



5.1 Equilibria 

For a start, consider the case where x q is an equilibrium point (2.9). Expanding 
around the equilibrium point Xq, using the fact that the stability matrix A = A(xq) 
in (4.2) is constant, and integrating, f(x) - x q + e At {x - x q ) + • • • , we verify that 
the simple formula (4.14) applies also to the Jacobian matrix of an equilibrium 
point, 

J\ = eV , f q = J\x q ) , A q = A(x q ) . (5. 1) 

As an equilibrium point is stationary, time plays no role, and the eigenvalues and 
the eigenvectors of stability matrix Aq evaluated at the equilibrium point Xq, 

A q ^> = if e U> , (5.2) 

describe the linearized neighborhood of the equilibrium point, with stability expo- 
nents Ap^ = fip ±ia)p independent of any particular coordinate choice. The eigen- 
vectors are also the eigenvectors of the Jacobian matrix, } e (;) - exp(tx£ ) e (i) . 

• If all fjy' < 0, then the equilibrium is stable, or a sink. 

• If some //j) < 0, and other f/^ > 0, the equilibrium is hyperbolic, or a 
saddle. 

• If all fjy' > 0, then the equilibrium is repelling, or a source. 

• If some f/y> = 0, think again (you have a symmetry or a bifurcation). 



5.2 Periodic orbits 

An obvious virtue of periodic orbits is that they are topological invariants: a 
fixed point remains a fixed point for any choice of coordinates, and similarly a 
periodic orbit remains periodic in any representation of the dynamics. Any re- 
parametrization of a dynamical system that preserves its topology has to preserve 
topological relations between periodic orbits, such as their relative inter-windings 
and knots. So the mere existence of periodic orbits suffices to partially organize 
the spatial layout of a non-wandering set. No less important, as we shall now 
show, is the fact that cycle eigenvalues are metric invariants: they determine the 
relative sizes of neighborhoods in a non-wandering set. 
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We start by noting that due to the multiplicative structure (1.20) of Jacobian 
matrices, the Jacobian matrix for the rth repeat of a prime cycle p of period T is 

J rT {x) = f{f (r ' l)T {x)) ■ ■ ■ J T (f(x))J T (x) = J p (x) r , (5.3) 

where J p (x) = J T (x) is the Jacobian matrix for a single traversal of the prime 
cycle p, x e M p is any point on the cycle, and f T {x) = x as f'(x) returns to x 
every multiple of the period T. Hence, it suffices to restrict our considerations to 
the stability of prime cycles. 

fast track: 
sect. 5.3, p. 104 

5.2.1 Floquet theory 

When dealing with periodic orbits, some of the quantities already introduced in 
chapter 4 inherit names from the Floquet theory of differential equations with 
time-periodic coefficients. Consider the equation of variations (4.2) evaluated on 
a periodic orbit p of period T, at point x(t) e M p , 

Sx = A(t) 8x , A(t) = A(t + T) , 

with A(t) = A(x(t)). The periodicity of the stability matrix implies that if 5x(t) 
is a solution, then also 6x(t + T) satisfies the same equation: moreover the two 
solutions are related by (4.5) 

5x(t + T) = J p (x) 6x(t) , xeM p . (5.4) 

Even though the Jacobian matrix J p (x) depends upon x (the 'starting' point of the 
periodic orbit), we shall show in sect. 5.3 that its eigenvalues do not, so we may 
write the eigenvalue equation as 

J p (x) e (j) (x) = Aj e (J) (x) , (5.5) 

where Ay are independent of x, and we refer to eigenvectors as 'covariant 
vectors,' or, for periodic orbits, as 'Floquet vectors'. 

Expand Sx in the (5.5) eigenbasis, 5x(t) = £ 5xj(t)e U) , e 0) = e O) (x(0)) . 
Taking into account (5.4), we get that 6xj(t) is multiplied by Ay per each period 

8x(t + T) = V 5xj(t + T) e 0) = Ay 6xj(t) e 0) . 

j j 
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Figure 5.1: For a prime cycle p, Floquet matrix 
J p returns an infinitesimal spherical neighborhood of 
xq 6 M p stretched into an ellipsoid, with overlap ra- 
tio along the eigendirection e <j) of J p (x) given by the 
Floquet multiplier |A ; |. These ratios are invariant un- 
der smooth nonlinear reparametrizations of state space 
coordinates, and are intrinsic property of cycle p. 




We can absorb this exponential growth / contraction by rewriting the coefficients 
6xj(t) as 6xj(t) = exp(/l (;) ?) uj(t) , uj(0) = 5xj(0) . Thus each solution of the 
equation of variations (4.2) may be expressed in the Floquet form, 

j 

with uj(t) periodic with period T. The exp(A^t) factor is not an eigenvalue of the 
Jacobian matrix f, it is only an interpolation between x and f T (x). The contin- 
uous time t in (5.6) does not imply that eigenvalues of the Jacobian matrix enjoy 
any multiplicative property for t + rT: exponents refer to a full traversal of 
the periodic orbit. Indeed, while uj(t) describes the variation of 5x{t) with respect 
to the stationary eigen-frame fixed by eigenvectors at the point x(0), the object of 
dynamical significance is the co-moving eigen-frame defined below in 0.12). 



5.2.2 Cycle stability 

The time-dependent T-periodic vector fields, such as the flow linearized around 
a periodic orbit, are described by Floquet theory. Hence from now on we shall 
refer to a Jacobian matrix evaluated on a periodic orbit p either as a [dxd] Floquet 
matrix J p or a [(d- 1) x (d- 1)] monodromy matrix M p , to its eigenvalues Aj as 
Floquet multipliers (5.5), and to /ll = fi p + ia)p as Floquet exponents. The 
stretching/contraction rates per unit time are given by the real parts of Floquet 
exponents 

nf = ^r^ p J[ . (5.7) 
1 p 

The factor l/T p in the definition of the Floquet exponents is motivated by its form 
for the linear dynamical systems, for example (4.31), as well as the fact that expo- 
nents so defined can be misinterpreted as Lyapunov exponents (??) evaluated on 
the prime cycle p. When Aj is real, we do care about a® = Aj/\Aj\ e {+1, -1} , 
the sign of the jth Floquet multiplier. If cr^ = -1 and \ Aj\ + 1, the corresponding 
eigen-direction is said to be inverse hyperbolic. Keeping track of this by case- 
by-case enumeration is an unnecessary nuisance, so most of our formulas will 



section 7.3 



invariants - 17mar2013 



ChaosBook.org version!4.4. 1, Apr21 2013 



CHAPTER 5. CYCLE STABILITY 



103 



Figure 5.2: An unstable periodic orbit repels every 
neighboring trajectory y! (f), except those on its center 
and stable manifolds. 




be stated in terms of the Floquet multipliers Ay rather than in the terms of the 
multiplier signs <t*A exponents and phases a/A 

In dynamics the expanding directions, IAJ > 1, have to be taken care of first, 
while the contracting directions |AJ < 1 tend to take care of themselves, hence we 
always order multipliers A* in order of decreasing magnitude |Ai| > IA2I > . . . > 
\A.d\ ■ Since |A ; | = e t/l(J) , this is the same as ordering by /J 1 ^ > // 2) > . . . > . We 
sort the Floquet multipliers {A Pi \, A Pi 2, ■ ■ ., A p/ /} of the Floquet matrix evaluated 
on the /?-cycle into three sets {<?, m, c] 



expanding: 


{A} e 


= (Apj 


: l A ^l > 1) 






= : 


nf > 0} 


marginal: 


{A},„ 


= (A/V 






[Mm 


= [$> : 


$ = 0} 


contracting: 


{A} c 


= ^PJ 


= Kj\ < U 






= (A? : 


lif < 0) . 



In what follows, the volume of expanding manifold will play an important role. 
We denote by A p (no jth eigenvalue index) the product of expanding Floquet 
multipliers 



A 



As J p is a real matrix, complex eigenvalues always come in complex conjugate 
pairs, A p j + i = A* ( ., so the product (5.9) is always real. 

A periodic orbit of a continuous-time flow, or of a map, or a fixed point of a 
map is 



p. 100 



• stable, a sink or a limit cycle if all |A ; | < 1 (real parts of all of its Floquet 
exponents, other than the vanishing longitudinal exponent for perturbations 
tangent to the cycle, see sect. 5.3.1, are strictly negative, 0 > /Z 1 ' > //^). 
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• hyperbolic or a saddle, unstable to perturbations outside its stable manifold 
if some |Ay| > 1, and other |Ay| < 1 (a set of p}^ > /j m j n > 0 is strictly 
positive, the rest is strictly negative). 

• elliptic, neutral or marginal if all |A;| - 1 (py> = 0). 

• partially hyperbolic, if /P> = 0 for a subset of exponents (other than the 
longitudinal one). 

• repelling, or a source, unstable to any perturbation if all \Aj\ > 1 (all Flo- 
quet exponents, other than the vanishing longitudinal exponent, are strictly 
positive, p^ > p {d) > 0). 

The region of system parameter values for which a periodic orbit p is stable is 
called the stability window of p. The set of initial points that are asymptotically 
attracted to M p as t — > +oo (for a fixed set of system parameter values) is called 
the basin of attraction of limit cycle p. Repelling and hyperbolic cycles are 
unstable to generic perturbations, and thus said to be unstable, see figure5.2. 

If all Floquet exponents (other than the vanishing longitudinal exponent) of 
all periodic orbits of a flow are strictly bounded away from zero, the flow is 
said to be hyperbolic. Otherwise the flow is said to be nonhyperbolic. A con- 
fined smooth flow or map is generically nonhyperbolic, with partial ellipticity or 
marginality expected only in presence of continuous symmetries, or for bifurca- 
tion parameter values. As we shall see in chapter 10, in presence of continuous 
symmetries equilibria and periodic orbits are not likely solutions, and their role 
is played by higher-dimensional, toroidal, relative equilibria and relative periodic 
orbits. For Hamiltonian flows the Sp(<i) symmetry (Liouville phase-space volume 
conservation, Poincare invariants) leads to a proliferation of elliptic and partially 
hyperbolic tori. 



example 5.1 
p. 111 



section 7.4 



section 7.5 



5.3 Floquet multipliers are invariant 



The 1-dimensional map Floquet multiplier (5.24) is a product of derivatives 
over all points around the cycle, and is therefore independent of which periodic 
point is chosen as the initial one. In higher dimensions the form of the Floquet 
matrix J p (xq) in (5.3) does depend on the choice of coordinates and the initial 
point xq £ AA p . Nevertheless, as we shall now show, the cycle Floquet multipliers 
are intrinsic property of a cycle in any dimension. Consider the ith eigenvalue, 
eigenvector pair (Ay, e^ ;) ) computed from J p evaluated at a periodic point x, 



J p {x) t u \x) = Aj e w (jc) , xeM 



<J)( 



(5.10) 
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Consider another point on the cycle at time t later, x" = f(x) whose Floquet 
matrix is J p {x'). By the semigroup property (4.20), J T+t = J t+T , and the Jacobian 
matrix at x' can be written either as 

J T+t (x) = J T {x) J\x) = J p (x) J\x) , 

or J\x) J p (x). Multiplying (5.10) by J\x), we find that the Floquet matrix evalu- 
ated at x' has the same Floquet multiplier, 



but with the eigenvector transported along the flow x —> x" to e^\x') = 
J'(x)&\x). Hence, in the spirit of the Floquet theory (5.6) one can define time- 
periodic eigenvectors (in a co-moving 'Lagrangian frame') 



e^OO = e' A 1 J\x) e 0) (0) , e ij) (t) = e (j \x(t)) , x(t) e M p . (5.12) 



J p evaluated anywhere along the cycle has the same set of Floquet multipliers 
{Ai, A2, , A f /_i}. As quantities such as tr J p (x), detJ p (x) depend only 

on the eigenvalues of J p (x) and not on the starting point x, in expressions such as 



We postpone the proof that the cycle Floquet multipliers are smooth conjugacy 
invariants of the flow to sect. 5.4; time-forward map (5.1 1) is the special case of 
this general property of smooth manifolds and their tangent spaces. 

5.3.1 Marginal eigenvalues 

The presence of marginal eigenvalues signals either a continuous symmetry of the 
flow (which one should immediately exploit to simplify the problem), or a non- 
hyperbolicity of a flow (a source of much pain, hard to avoid). In that case (typical 
of parameter values for which bifurcations occur) one has to go beyond linear 
stability, deal with Jordan type subspaces (see example 4.3), and sub-exponential 
growth rates, such as f . For flow-invariant solutions such as periodic orbits, the 
time evolution is itself a continuous symmetry, hence a periodic orbit of a flow 
always has a marginal Floquet multiplier, as we now show. 



The Jacobian matrix f(x) transports the velocity field v(x) by (4.8), v(x(t)) = 
J\xq) v(xo) . In general the velocity at point x(t) does not point in the same di- 
rection as the velocity at point xq, so this is not an eigenvalue condition for /; 
the Jacobian matrix computed for an arbitrary segment of an arbitrary trajectory 



J p (x) e°V) = A y - e°'V) , e°V) = A*) e 0) (jc) , 



(5.11) 





for any x e M p . 



(5.13) 



chapter 24 
exercise 5.1 
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Figure 5.3: Any two points along a periodic orbit 
p are mapped into themselves after one cycle period 
T, hence a longitudinal displacement 8x = v(xo)6t is 
mapped into itself by the cycle Jacobian matrix J p . 




has no invariant meaning. However, if the orbit is periodic, x(T) = x(0), after a 
complete period 

J p {x) v(x) = v(x) x€M p . (5.14) 

Two successive points on the cycle initially distance 5x - x" (0) - x(0) apart, are 
separated by the exactly same distance after a completed period 5x{T) - 8x, see 
figure 5.3, hence for a periodic orbit of a flow the velocity field v at any point 
along cycle is an eigenvector 4^\x) = v(x) of the Jacobian matrix J p with the unit 
Floquet multiplier, zero Floquet exponent 

A|| = l, ^ (ll) -0. (5.15) 

exercise B.3 

The continuous invariance that gives rise to this marginal Floquet multiplier is 
the invariance of a cycle (the set M p ) under a time translation of its points along 
the cycle. As we shall see in sect. 5.5, this marginal stability direction can be 
eliminated by cutting the cycle by a Poincare section and replacing the continuous 
flow Floquet matrix by the Floquet matrix of the Poincare return map. 

If the flow is governed by a time-independent Hamiltonian, the energy is con- 
served, and that leads to an additional marginal Floquet multiplier (we shall show 
in sect. 7.4 that due to the symplectic invariance (7.27) real eigenvalues come in 
pairs). Further marginal eigenvalues arise in presence of continuous symmetries, 
as discussed in chapter 10. 



5.4 Floquet multipliers are metric invariants 

In sect. 5.3 we established that for a given flow, the Floquet multipliers are intrin- 
sic to a given cycle, independent of the starting point along the cycle. Now we 
prove a much stronger statement: cycle Floquet multipliers are smooth conjugacy 
or metric invariants of the flow, the same in any representation of the dynamical 
system. That follows by elementary differential geometry considerations: 



fa 



If the same dynamicals is given by a map / in x coordinates, and a map g 
in the y = h(x) coordinates, then / and g (or any other good representation) are 
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related by a smooth conjugacy, a reparameterization and a coordinate transforma- 
tion g = h o/o/r 1 which maps nearby points of / into nearby points of g. As both 
/ and g are arbitrary representations of the dynamical system, the explicit form 
of the conjugacy h is of no interest, only the properties invariant under any trans- 
formation h are of general import. Furthermore, a good representation should not 
mutilate the data; the mapping h must be a smooth conjugacy which maps nearby 
points of / into nearby points of g. 

This smoothness guarantees that the cycles are not only topological invariants, 
but that their linearized neighborhoods are also metric invariants. For a fixed point 
f(x) = x of a 1 -dimensional map this follows from the chain rule for derivatives, 



f(y) = h\foh- 1 (y))f\h- l (y))j±- 

= h'(x)f'(x)-J—=f'(x). (5.16) 
h'{x) 



In d dimensions the relationship between the maps in different coordinate rep- 
resentations is again g o h = h o / . The chain rule now relates M , the Jacobian 
matrix of the map g, to the Jacobian matrix of map /: 

M'(y)ij = r(f(x)) ik M(x) kl r(x)il , (5.17) 

where 



dyi _i dxi 

T(x) ik = — and Y{x) jk = — . 
dx k dy k 



If x is an equilibrium point, x = fix), T is the matrix inverse ofr 1 , and (5.17) is 
a similarity transformation and thus preserves eigenvalues. It is easy to verify that 
in the case of period n p cycle M' iy) and M p {x) are again related by a similarity 
transformation. (Note, though, that this is not true for Af(x) with r + n p ). As 
stability of a flow can always be reduced to stability of a Poincare return map, a 
Floquet multiplier of any cycle, for a flow or a map in arbitrary dimension, is a 
metric invariant of the dynamical system. 

The /th Floquet (multiplier, eigenvector) pair (A, 1 , e^) are computed from M 
evaluated at a periodic point x, M(x)d'\x) = A,e®(x), x e M p . Multiplying 
by T(jc) from the left, and inserting 1 = r(^:) _1 r(^), we find that the monodromy 
matrix evaluated at y = h(x) has the same Floquet multiplier, 

M' p (y)tP(yY=AitF>(yY, (5.18) 

but with the eigenvector e (,) (^) mapped to e w (y)' = T(x) e^(x). 



exercise B.3 
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5.5 Stability of Poincare map cycles 

(R. Paskauskas and P. Cvitanovic) 

If a continuous flow periodic orbit p pierces the Poincare section P once, the 
section point is a fixed point of the Poincare return map P with stability $-.25) 

Jij = ( 6ik ~^u)) Jkj ' (5 " 19) 

with all primes dropped, as the initial and the final points coincide, i = f T (x) = x. 
If the periodic orbit p pierces the Poincare section n times, the same observation 
applies to the nth iterate of P. 

We have already established in (4.26) that the velocity v(x) is a zero eigen- 
vector of the Poincare section Floquet matrix, Jv = 0. Consider next (A Q ,,e^), 
the full state space ath (eigenvalue, eigenvector) pair 0.10), evaluated at a peri- 
odic point on a Poincare section, 

J(x) e (a) (x) = A a e {a ' ( jc) , x e T> . (5 .20) 

Multiplying (5.19) by e^ ff) and inserting (5.20), we find that the full state space 
Floquet matrix and the Poincare section Floquet matrix J have the same Floquet 
multiplier 

J(x) e {a \x) = A a e {a) (x) , x e 9 , (5.21) 

where e (a) is a projection of the full state space eigenvector onto the Poincare 
section: 

($ ( % = (s ik -^y^) k . (5.22) 

Hence, J p evaluated on any Poincare section point along the cycle p has the same 
set of Floquet multipliers {Ai , A2, • • • Aj} as the full state space Floquet matrix J p , 
except for the marginal unit Floquet multiplier (5.15). 

As established in (4.26), due to the continuous symmetry (time invariance) J p 
is a rank d- 1 matrix. We shall refer to the rank [(d- 1 -AOx (d- 1 - AO] submatrix 
with N-l continuous symmetries quotiented out as the monodromy matrix M p 
(from Greek mono- - alone, single, and dromo - run, racecourse, meaning a 
single run around the stadium). Quotienting continuous symmetries is discussed 
in chapter 10 below. 
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5.6 There goes the neighborhood 

In what follows, our task will be to determine the size of a neighborhood of x(t), 
and that is why we care about the Floquet multipliers, and especially the unstable 
(expanding) ones. 

Nearby points aligned along the stable (contracting) directions remain in the 
neighborhood of the trajectory x(t) = f(xo); the ones to keep an eye on are the 
points which leave the neighborhood along the unstable directions: all chaos arises 
from flights along these these directions. The sub-volume |Al Vo | = rj; Ax, of the 
set of points which get no further away from /(xo) than L, the typical size of the 
system, is fixed by the condition that Ax/A; = 0(L) in each expanding direction 
i. Hence the neighborhood size scales as |At ro l x 0(L e )/\A p \ <x l/|A p | where A p 
is the product of expanding Floquet multipliers (5.9) only; contracting ones play 
a secondary role. Discussion of sect. 1.5.1, figure 1.9, and figure 5.1 illustrate 
intersection of initial volume with its return, and chapters 12 and 18 illustrate the 
key role that the unstable directions play in systematically partitioning the state 
space of a given dynamical system. The contracting directions are so secondary 
that even infinitely many of them (for example, the infinity of contracting eigen- 
directions of the spatiotemporally chaotic dynamics described by a PDE will not 
matter. 

So the dynamically important information is carried by the expanding sub- 
volume, not the total volume computed so easily in (4.29). That is also the reason 
why the dissipative and the Hamiltonian chaotic flows are much more alike than 
one would have naively expected for 'compressible' vs. 'incompressible' flows. 
In hyperbolic systems what matters are the expanding directions. Whether the 
contracting eigenvalues are inverses of the expanding ones or not is of secondary 
importance. As long as the number of unstable directions is finite, the same theory 
applies both to the finite-dimensional ODEs and infinite-dimensional PDEs. 



Resume 

Periodic orbits play a central role in any invariant characterization of the dynam- 
ics, because (a) their existence and inter-relations are a topological, coordinate- 
independent property of the dynamics, and (b) their Floquet multipliers form an 
infinite set of metric invariants: The Floquet multipliers of a periodic orbit remain 
invariant under any smooth nonlinear change of coordinates / ^Ao/o/f 1 . Let 
us summarize the linearized flow notation used throughout the ChaosBook. 

Differential formulation, flows: Equations 

x — v , Sx - A6x 

govern the dynamics in the tangent bundle (x, 5x) € TA1 obtained by adjoining the 
J-dimensional tangent space 5x e TM X to every point x e M in the <i-dimension- 
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al state space M c Mf 1 . The stability matrix A - dv/dx describes the instantaneous 
rate of shearing of the infinitesimal neighborhood of x(t) by the flow. 

Finite time formulation, maps: A discrete sets of trajectory points {xq,x\, ■■■ , 
x n , ■ ■ •} € M can be generated by composing finite-time maps, either given as 
x n+\ = f(x n ), or obtained by integrating the dynamical equations 

ptn+l 

x n +i = f(*n) -x n + I drv(x(T)) , At,, = t n+ i - t„ , (5.23) 

for a discrete sequence of times {to, t\, ■ ■ ■ , t„, ■ ■ •}, specified by some criterion such 
as strobing or Poincare sections. In the discrete time formulation the dynamics in 
the tangent bundle (x, 5x) e TM is governed by 

X n +1 = /On) , Sx„+\ = J(X„)5X„ , J(x„) = J A '"(x n ) , 

where J(x n ) = dx n+ i/dx n = J dr exp (At) is the 1-time step Jacobian matrix. 

Stability of invariant solutions: The linear stability of an equilibrium v(j^) = 0 
is described by the eigenvalues and eigenvectors {/P\e ( ^} of the stability matrix 
A evaluated at the equilibrium point, and the linear stability of a periodic orbit 

f T (x) = x,xe M p , 

J p (x) e u \x) = Aj e (j \x) , Ay = cr U) e A(J>T , 

by its Floquet multipliers, vectors and exponents {Aj, e^}, where A^' = fjy' ± 
iuft) For every continuous symmetry there is a marginal eigen-direction, with 
Aj = 1, Ay* = 0. With all 1 + ,/V continuous symmetries quotiented out (Poincare 
sections for time, slices for continuous symmetries of dynamics, see sect. 10.4) 
linear stability of a periodic orbit (and, more generally, of a partially hyperbolic 
torus) is described by the [(d-l-N) x (d-l-N)] monodromy matrix, all of whose 
Floquet multipliers |A ; | + 1 are generically strictly hyperbolic, 

M p (x) e u \x) = Aj e U) (x) , xe M p /G . 

We shall show in chapter 1 1 that extending the linearized stability hyperbolic 
eigen-directions into stable and unstable manifolds yields important global infor- 
mation about the topological organization of state space. What matters most are 
the expanding directions. The physically important information is carried by the 
unstable manifold, and the expanding sub-volume characterized by the product of 
expanding Floquet multipliers of J p . As long as the number of unstable directions 
is finite, the theory can be applied to flows of arbitrarily high dimension. 



in depth: 

appendix C, p. 806 



fast track: 
chapter 9, p. 152 
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Commentary 



Remark 5.1 Periodic orbits vs. 'cycles'. Throughout this text, the terms 'periodic 
orbit' and 'cycle' (which has many other uses in mathematics) are used interchangeably; 
while 'periodic orbit' is more precise, 'pseudo-cycle' is easier on the ear than 'pseudo- 
periodic-orbit.' In Soviet times obscure abbreviations were a rage, but here we shy away 
from acronyms such as UPOs (Unstable Periodic Orbits). We refer to unstable periodic 
orbits simply as 'periodic orbits', and the stable ones 'limit cycles'. Lost in the mists of 
time is the excitement experienced by the first physicist to discover that there are periodic 
orbits other than the limit cycles reached by mindless computation forward in time; but 
once one understands that there are at most several stable limit cycles (SPOs?) as opposed 
to the Smale horseshoe infinities of unstable cycles (UPOs?), what is gained by prefix 'U'? 
A bit like calling all bicycles 'unstable bicycles' . 

Remark 5.2 Periodic orbits and Floquet theory. Study of time-dependent and T- 
periodic vector fields is a classical subject in the theory of differential equations [ 1, 2]. In 
physics literature Floquet exponents often assume different names according to the context 
where the theory is applied: they are called Bloch phases in the discussion of Schrodinger 
equation with a periodic potential [ ], or quasi-momenta in the quantum theory of time- 
periodic Hamiltonians. For further reading on periodic orbits, consult Moehlis and K. 
Josic [?] Scholarpedia.org article. 



5.7 Examples 

The reader is urged to study the examples collected here. To return back to the 
main text, click on [click to return] pointer on the margin. 

Example 5.1 Stability of cycles of 1 -dimensional maps: The stability of a prime 
cycle p of a 1 -dimensional map follows from the chain rule (4.42) for stability of the n p th 
iterate of the map 



A p is a property of the cycle, not the initial periodic point, as taking any periodic point 
in the p cycle as the initial one yields the same A p . 

A critical point x c is a value of x for which the mapping f(x) has vanishing 
derivative, f'(x c ) = 0. A periodic orbit of a 1 -dimensional map is stable if 

N = \f'(x„X(xn p -i)---f'(x 2 )f'(x 1 )\ < 1 , 

and superstable if the orbit includes a critical point, so that the above product vanishes. 
For a stable periodic orbit of period n the slope A p of the nth iterate f"(x) evaluated 
on a periodic point x (fixed point of the nth iterate) lies between -1 and 1. If \a p \ > 1, 
p -cycle is unstable. 




(5.24) 
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Example 5.2 Stability of cycles for maps: No matter what method one uses to 
determine unstable cycles, the theory to be developed here requires that their Floquet 
multipliers be evaluated as well. For maps a Floquet matrix is easily evaluated by 
picking any periodic point as a starting point, running once around a prime cycle, and 
multiplying the individual periodic point Jacobian matrices according to (4.22). For 
example, the Floquet matrix M p for a prime cycle p of length n p of the Henon map 
(3.17) is given by (4.43), 



and the Floquet matrix M p for a 2-dimensional billiard prime cycle p of length n p 



follows from (8.11) of chapter 8 below. The decreasing order in the indices of the 
products in above formulas is a reminder that the successive time steps correspond 
to multiplication from the left, M p (x{) = M(x np ) ■ ■ -M(x\). We shall compute Floquet 
multipliers of Henon map cycles once we learn how to find their periodic orbits, see 




b 
0 



) 




0 

1 



) 



exercise 13.13. 



click to return: p. 104 
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Exercises 

5.1. A limit cycle with analytic Floquet exponent. 

There are only two examples of nonlinear flows for 
which the Floquet multipliers can be evaluated ana- 
lytically. Both are cheats. One example is the 2- 
dimensional flow 

q = p + q(l-q 2 -p 2 ) 

P = -q + pQ- -q 2 -p 1 )- 

Determine all periodic solutions of this flow, and deter- 
mine analytically their Floquet exponents. Hint: go to 
polar coordinates (q, p) = (r cos 9, r sin 9). G. Bard 



Ermentrout 

5.2. The other example of a limit cycle with analytic Flo- 
quet exponent. What is the other example of a 
nonlinear flow for which the Floquet multipliers can be 
evaluated analytically? Hint: email G.B. Ermentrout. 

5.3. Yet another example of a limit cycle with analytic 
Floquet exponent. Prove G.B. Ermentrout wrong 
by solving a third example (or more) of a nonlinear flow 
for which the Floquet multipliers can be evaluated ana- 
lytically. 
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Chapter 6 

Lyapunov exponents 



Let us apply the newly acquired tools to the fundamental diagnostics in dy- 
namics: Is a given system 'chaotic'? And if so, how chaotic? If all points 
in a neighborhood of a trajectory converge toward the same trajectory, the 
attractor is a fixed point or a limit cycle. However, if the attractor is strange, any 
two trajectories x(t) - f(xo) and x(t) + 5x(t) = f(xo + 5xq) that start out very 
close to each other separate exponentially with time, and in a finite time their 
separation attains the size of the accessible state space. 



example 2.3 

section 1.3.1 
remark 6.1 



This sensitivity to initial conditions can be quantified as 



\6x(t)\ 



e M \6x 0 \ 



(6.1) 



where A, the mean rate of separation of trajectories of the system, is called the 
leading Lyapunov exponent. In the limit of infinite time the Lyapunov exponent 
is a global measure of the rate at which nearby trajectories diverge, averaged over 
the strange attractor. As it so often goes with easy ideas, it turns out that Lyapunov 
exponents are not natural for study of dynamics, and we would have passed them 
over in silence, were it not for so much literature that talks about them. So in a 
textbook we are duty bound to explain what all the excitement is about. But we 
do round the chapter of with a remark longer than the chapter itself. 



6.1 Stretch, strain and twirl 



Diagonalizing the matrix: that's the key to the whole thing. 
— Governor Arnold Schwarzenegger 



In general the Jacobian matrix f is neither diagonal, nor diagonalizable, nor con- 
stant along the trajectory. What is a geometrical meaning of the mapping of a 
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f(*>) 

/ /; 

Figure 6.1: The linearized flow maps a swarm 
of initial points in an infinitesimal spherical neigh- I x(t) + J8x 

borhood of squared radius Sx 2 at x 0 into an ellip- I r 
soid 6x T (J T J)6x at x(t) finite time t later, rotated 
and stretched/compressed along the principal axes by 
streches (a - ;) . 



X 0 + 5x ( 



neighborhood by /? Here the continuum mechanics insights are helpful, in par- 
ticular the polar decomposition which affords a visualization of the linearization 
of a flow as a mapping of the initial ball into an ellipsoid (figure 6.1). 

First, a few definitions: A symmetric [dxd] matrix Q is positive definite, 
Q > 0, if x T Qx > 0 for any nonzero vector x e Q is negative definite, 
Q < 0, if x T Qx < 0 for any nonzero vector x. Alternatively, Q is a positive 
(negative) definite matrix if all its eigenvalues are positive (negative). A matrix 
R is orthogonal if R T R = 1, and proper orthogonal if detR = +1. Here the 
superscript T denotes the transpose. For example, (x\, • • • , Xd) is a row vector, 
(xi, - ■ ■ , Xii) T is a column vector. 

By the polar decomposition theorem, a deformation J can be factored into a 
rotation R and a right / left stretch tensor U / V, 

J = RU - VR , (6.2) 

where R is a proper-orthogonal matrix and U, V are symmetric positive definite 
matrices with strictly positive real eigenvalues {o~\, o~2, • • • , Cd) called principal 
stretches (singular values, Hankel singular values), and with orthonormal eigen- 
vector bases, 

t/M® = 0~ (M® , {U^M 2 \---,U^} 

Vv {i) = cr iV (i) , {v (1) ,v (2) ,---,v (rf )}. (6.3) 

(Ti > 1 for stretching and 0 < cr, < 1 for compression along the direction w» 
or v®. are the principal axes of strain at the initial point xo; {v^} are the 

principal axes of strain at the present placement x. From a geometric point of 
view, / maps the unit sphere into an ellipsoid, figure 6.1; the principal stretches 
are then the lengths of the semiaxes of this ellipsoid. The rotation matrix R carries 
the initial axes of strain into the present ones, V = RUR T . The eigenvalues of the 



right Cauchy-Green strain tensor: J J - U 
left Cauchy-Green strain tensor: J J T - V 2 (6.4) 

are {o~ 2 -}, the squares of principal stretches. 



remark 6.2 



remark 6.1 
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Figure 6.2: A long-time numerical calculation of the 
leading Lyapunov exponent requires rescaling the dis- 
tance in order to keep the nearby trajectory separation 
within the linearized flow range. 




example 6.2 
p. 122 



6.2 Lyapunov exponents 



(J. Mathiesen and P. Cvitanovic) 



The mean growth rate of the distance || 8x{i) || / || 8x$ || between neighboring 
trajectories (6.1) is given by the leading Lyapunov exponent which can be esti- 
mated for long (but not too long) time t as 



(For notational brevity we shall often suppress the dependence of quantities such 
as A - A(xo), 8x(t) - 8x(x$, t) on the initial point xq). One can take (6.5) as is, take 
a small initial separation Sxq, track the distance between two nearby trajectories 
until || 6x(ti) || gets significantly bigger, then record t\A\ - ln(|| 8x(t\) || / || 8xq ||), 
rescale 5x(t\) by factor 6xo/Sx(ti), and continue add infinitum, as in figure 6.2, 
with the leading Lyapunov exponent given by 



Deciding what is a safe 'linear range', the distance beyond which the separation 
vector 8x{t) should be rescaled is a dark art. 

We can start out with a small 8x and try to estimate the leading Lyapunov ex- 
ponent A from (6.6), but now that we have quantified the notion of linear stability 
in chapter 4, we can do better. The problem with measuring the growth rate of the 
distance between two points is that as the points separate, the measurement is less 
and less a local measurement. In the study of experimental time series this might 
be the only option, but if we have equations of motion, a better way is to measure 
the growth rate of vectors transverse to a given orbit. 





(6.6) 
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Given the equations of motion, for infinitesimal 6x we know the dxj(t)/6xj(0) 
ratio exactly, as this is by definition the Jacobian matrix 



5xi(t) dx t {t) t 

hm J i; {xn) , 

sxffSHO Sxj(0) dxjip) ,J 



so the leading Lyapunov exponent can be computed from the linearization (t.16) 

1 II J'(x 0 )6x 0 || 1 t \ 

A(x 0 ) = lim - In 11 , 11 - lim - In (h T f T fh) . (6.7) 

t->°° t || 6X 0 || r^co 2t V ' 

In this formula the scale of the initial separation drops out, only its orientation 
given by the initial orientation unit vector h = 6xq/ \\ 5xq || matters. If one does not 
care about the orientation of the separation vector between a trajectory and its per- 

II II 2 

turbation, but only its magnitude, one can interpret || J\xo)8xq || = 5xQ T (J tT J') 6xo , 
as the error correlation matrix. In the continuum mechanics language, the right 
Cauchy-Green strain tensor J T J (6.4) is the natural object to describe how lin- 
earized neighborhoods deform. In the theory of dynamical systems the stretches 
of continuum mechanics are called the finite-time Lyapunov or characteristic ex- 
ponents, 

A(x Q , ft; i) = - In || fh || = — In (h T J tT fh) . (6.8) 

They depend on the initial point xq and on the direction of the unit vector h, 
|| h || = 1 at the initial time. If this vector is aligned along the ith principal stretch, 
h = d l) , then the corresponding finite-time Lyapunov exponent (rate of stretching) 
is given by 



Aj(x Q ; t) = A(x 0 , u (j) ; t) = - In cr y (x 0 ; t). (6.9) 



We do not need to compute the strain tensor eigenbasis to determine the leading 

Lyapunov exponent, 

A(x 0 ,h) = lim - In II fh II - lim — In fft T 7 fT /ft) , (6.10) 
f-»oo t r^°° 2t v ' 



as expanding the initial orientation in the strain tensor eigenbasis |5.3), h = 2(ft 
u^ l) )u^ , we have 



i T J tT fn = J(ft • u^foj = (h ■ iPfo\ (l + O(o\lo\)) , 

i=l 
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Figure 6.3: A numerical computation of the loga- 
rithm of the stretch h T (J' T J')h in formula (6. 10) for the 
Rossler flow (2.18), plotted as a function of the Rossler 
time units. The slope is the leading Lyapunov exponent 
A ~ 0.09. The exponent is positive, so numerics lends 
credence to the hypothesis that the Rossler attractor is 
chaotic. The big unexplained jump illustrates perils of 
Lyapunov exponents numerics. (J. Mathiesen) 



with stretches ordered by decreasing magnitude, en > cr 2 > 0-3 • • •. For long 
times the largest stretch dominates exponentially (6.10), provided the orientation 
h of the initial separation was not chosen perpendicular to the dominant expanding 
eigen-direction Furthermore, for long times fh is dominated by the largest 
stability multiplier Ai , so the leading Lyapunov exponent is 

A(x 0 ) = lim - (in II n ■ e (1) II + In |Ai(jtb, 01 + OiiT 7 *-* 1 '***)] 

= ilnlAiOd),/)!, (6-11) 

where Ai(jco, t) is the leading eigenvalue of /(xq). The leading Lyapunov expo- 
nent now follows from the Jacobian matrix by numerical integration of (1.9). The 
equations can be integrated accurately for a finite time, hence the infinite time limit 
of (6.7) can be only estimated from a finite set of evaluations of \ ln(n T J tT J'n) as 
function of time, such as figure 6.3 for the Rossler flow (2. 18). 

As the local expansion and contraction rates vary along the flow, the tempo- 
ral dependence exhibits small and large humps. The sudden fall to a low value 
in figure 6.3 is caused by a close passage to a folding point of the attractor, an 
illustration of why numerical evaluation of the Lyapunov exponents, and proving 
the very existence of a strange attractor is a difficult problem. The approximately 
monotone part of the curve can be used (at your own peril) to estimate the leading 
Lyapunov exponent by a straight line fit. 

As we can already see, we are courting difficulties if we try to calculate the 
Lyapunov exponent by using the definition (6.11) directly. First of all, the state 
space is dense with atypical trajectories; for example, if xj happens to lie on a 
periodic orbit p, A would be simply In \A p \/T p , a local property of cycle p, not a 
global property of the dynamical system. Furthermore, even if xj happens to be a 
'generic' state space point, it is still not obvious that In |A(jq, t)\/t should be con- 
verging to anything in particular. In a Hamiltonian system with coexisting elliptic 
islands and chaotic regions, a chaotic trajectory gets captured in the neighborhood 
of an elliptic island every so often and can stay there for arbitrarily long time; as 
there the orbit is nearly stable, during such episode In |A(^>, t)\/t can dip arbitrar- 
ily close to 0 + . For state space volume non-preserving flows the trajectory can 
traverse locally contracting regions, and ln|A(jo, can occasionally go nega- 
tive; even worse, one never knows whether the asymptotic attractor is periodic or 

'chaotic', so any finite estimate of A might be dead wrong. 

J & & exercise 6.4 
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Resume 

Let us summarize the 'stability' chapters 4 to 6. A neighborhood of a trajectory 
deforms as it is transported by a flow. In the linear approximation, the stabil- 
ity matrix A describes the shearing / compression / expansion of an infinitesimal 
neighborhood in an infinitesimal time step. The deformation after a finite time t is 
described by the Jacobian matrix /. 

Floquet multipliers and eigen-vectors are property of finite-time, compact in- 
variant solutions, such as periodic orbits and relative periodic orbits; they are 
explained in chapter 5. Stability exponents [1] are the corresponding long-time 
limit. 

Finite-time Lyapunov exponents and the associated principal axes are defined 
in (6.8). Oseledec Lyapunov exponents are the t — > oo limit of these. 

Commentary 

Remark 6.1 Lyapunov exponents are unnatural. Eigenvectors / eigenvalues are suited 
to study of iterated forms of a matrix, such as Jacobian matrix / ' or exponential exp(fA), 
and are thus a natural tool for study of dynamics. Principal vectors are not, they are suited 
to study of the matrix J' itself. The polar (singular value) decomposition is convenient 
for numerical work (any matrix, square or rectangular, can be brought to such form), 
as a way of estimating the effective rank of matrix J by separating the large, significant 
singular values from the small, negligible singular values. 

That is the first problem with Lyapunov exponents: stretches {cr j) are not related to 
the Jacobian matrix J' eigenvalues {A^} in any simple way. The eigenvectors {u <J ^} of 
strain tensor J T J that determine the orientation of the principal axes, are distinct from 
the Jacobian matrix eigenvectors {e^'}. The strain tensor J T J satisfies no multiplicative 
semigroup property such as (4.20); unlike the Jacobian matrix (5.3), the strain tensor 
J Tr J r for the rth repeat of a prime cycle p is not given by a power of J T J for the single 
traversal of the prime cycle p. Under time evolution the covariant vectors map forward 
as — > Je ( ^ (transport of the velocity vector (4.8) is an example). In contrast, the 
principal axes have to be recomputed from the scratch for each time t. 

Lorenz [2, 3, 4] pioneered the use of singular vectors in chaotic dynamics. We 
found the Goldhirsch, Sulem and Orszag [ 1 ] exposition very clear, and we also enjoyed 
Hoovers [5] pedagogical introduction to computation of Lyapunov spectra by the method 
of Lagrange multipliers. Greene and Kim [6] discuss singular values vs. Jacobian matrix 
eigenvalues. While they conclude that "singular values, rather than eigenvalues, are the 
appropriate quantities to consider when studying chaotic systems," we beg to differ: their 
Fig. 3, which illustrates various semiaxes of the ellipsoid in the case of Lorenz attractor, as 
well as the figures in ref. [7], are a persuasive argument for not using singular values. The 
covariant vectors are tangent to the attractor, while the principal axes of strain point away 
from it. It is the perturbations within the attractor that describe the long-time dynamics 
within the attractor; these perturbations lie within the subspace spanned by the leading 
covariant vectors. 

If Lyapunov exponents are not dynamical, why are they invoked so frequently? One 
reason is fear of mathematics: the monumental and therefore rarely read Oseledec [ 8, 9] 
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Multiplicative Ergodic Theorem states that the limits (6.7-??) exist for almost all points 
xq and all vectors h, and that there are at most d distinct Lyapunov exponents Aj(xo) 
as h ranges over the tangent space. To intimidate the reader further we note in passing 
that "moreover there is a fibration of the tangent space T X M, L l {x) c L 2 (x) c ■ ■ ■ c 
U(x) - T X M, such that if h e L'(x) \ L i_1 (x) the limit (6.7) equals /t;(x)." Oseledec proof 
is important mathematics, but the method is not helpful in elucidating dynamics. The 
other reason is physical and practical: Lorenz [2, 3, 4] asked how does a cloud of initial 
points x(Q) + 6x(Q) distributed as a Gaussian with covariance matrix Q(Q) - (dx(0)6x(0) T ) 
evolves in time. For linearized flow with initial isotropic distribution Q(Q) = el the answer 
is given by the left Cauchy-Green strain tensor, 



The second problem with Lyapunov exponents is deeper: the intuitive definition ( 6.5) 
depends on the notion of distance || 5x{t) || between two state space points. The Euclidean 
(or L 2 ) distance is natural in the theory of 3D continuous media, but what the norm should 
be in for other state spaces is far from clear, especially in high dimensions and for PDEs. 
As we have shown in sect. 5.3, Floquet multipliers are invariant under all local smooth 
nonlinear coordinate transformation, intrinsic to the flow, the Floquet eigenvectors are 
independent of the definition of the norm [7]. In contrast, the stretches {<r 7 }, and the 
right/left principal axes depend on the choice of the norm. 

There is probably no name more liberally and more confusingly used in dynamical 
systems literature than that of Lyapunov (AKA Liapunov). Singular values / principal 
axes of strain tensor J T J (objects natural to the theory of deformations) and their long- 
time limits can indeed be traced back to the thesis of Lyapunov [ 10, 8], and justly deserve 
sobriquet 'Lyapunov'. Oseledec [8] refers to them as 'Liapunov characteristic numbers', 
and Eckmann and Ruelle [II] as 'characteristic exponents'. The natural objects in dy- 
namics are the linearized flow Jacobian matrix J', and its eigenvalues and eigenvectors 
(stability exponents and covariant vectors). Why should they also be called 'Lyapunov'? 
The covariant vectors are misnamed in recent papers as 'covariant Lyapunov vectors' or 
'Lyapunov vectors', even though they are not the eigenvectors that correspond to the Lya- 
punov exponents. That's just confusing, for no good reason - Lyapunov has nothing to 
do with linear stability described by the Jacobian matrix J, as far as we understand his 
paper [10] is about J T J and the associated principal axes. To emphasize the distinction, 
the Jacobian matrix eigenvectors {e <j) } are in recent literature called 'covariant' or 'co- 
variant Lyapunov vectors', or 'stationary Lyapunov basis' [ 12]. However, Trevisan [7] 
refers to covariant vectors as 'Lyapunov vectors', and Radons [ 13] calls them 'Lyapunov 
modes' , motivated by thinking of these eigenvectors as a generalization of 'normal modes' 
of mechanical systems, whereas by z'th 'Lyapunov mode' Takeuchi and Chate [ ] mean 
{A j, e^}, the set of the z'th stability exponent and the associated covariant vector. Kuni- 
hiro et al. [15] call the eigenvalues of stability matrix (4.3), evaluated at a given instant 
in time, the 'local Lyapunov exponents', and they refer to the set of stability exponents 
(4.7) for a finite time Jacobian matrix as the 'intermediate Lyapunov exponent', "aver- 
aged" over a finite time period. The list goes on: there is 'Lyapunov equation' of control 
theory, which is the linearization of the 'Lyapunov function', and the entirely unrelated 
'Lyapunov orbit' of celestial mechanics. 

In short: Lyapunov exponents are an abomination. We are doubtful of their utility 
as means of predicting any observables of physical significance, but that is the minority 
position - in the literature one encounters many provocative speculations, especially in the 
context of foundations of statistical mechanics ('hydrodynamic' modes) and the existence 
of a Lyapunov spectrum in the thermodynamic limit of spatiotemporal chaotic systems. 

Lyapunov - 21apr2013 ChaosBook.org versionl4.4.1, Apr 21 2013 




(6.12) 



CHAPTER 6. LYAPUNOV EXPONENTS 



121 



Remark 6.2 Matrix decompositions of the Jacobian matrix. A 'polar decomposi- 
tion' of a matrix or linear operator is a generalization of the factorization of complex 
number into the polar form, z = r exp{<p). Matrix polar decomposition is explained in 
refs. [16, 17, 18, 19]. One can go one step further than the polar decomposition ( 6.2) into 
a product of a rotation and a symmetric matrix by diagonalizing the symmetric matrix by 
a second rotation, and thus express any matrix with real elements in the singular value 
decomposition (SVD) form 



where D is diagonal and real, and Ri, R2 are orthogonal matrices, unique up to permuta- 
tions of rows and columns. The diagonal elements {<ti, ct 2 , . . . , crj) of D are the singular 
values of J. 

Though singular values decomposition provides geometrical insights into how tan- 
gent dynamics acts, many popular algorithms for asymptotic stability analysis (computing 
Lyapunov spectrum) employ another standard matrix decomposition: the QR scheme [ 20], 
through which a nonsingular matrix J is (uniquely) written as a product of an orthogonal 
and an upper triangular matrix J = QR. This can be thought as a Gram-Schmidt decom- 
position of the column vectors of J. The geometric meaning of QR decomposition is that 
the volume of the cZ-dimensional parallelepiped spanned by the column vectors of J has a 
volume coinciding with the product of the diagonal elements of the triangular matrix R, 
whose role is thus pivotal in algorithms computing Lyapunov spectra [21]. 

Remark 6.3 Numerical evaluation of Lyapunov exponents. There are volumes of 
literature on numerical computation of the Lyapunov exponents, see for example refs. [ 22, 
11, 23, 24]. For early numerical methods to compute Lyapunov vectors, see refs. [25, 
26]. The drawback of the Gram-Schmidt method is that the vectors so constructed are 
orthogonal by fiat, whereas the stable / unstable eigenvectors of the Jacobian matrix are 
in general not orthogonal. Hence the Gram-Schmidt vectors are not covariant, i.e., the 
linearized dynamics does not transport them into the eigenvectors of the Jacobian matrix 
computed further downstream. For computation of covariant vectors, see refs. [ 27, 28]. 

6.3 Examples 

The reader is urged to study the examples collected here. To return back to the 
main text, click on [click to return] pointer on the margin. 

Example 6.1 Lyapunov exponent. Given a 1 -dimensional map, consider observable 
A(x) = ln\f'(x)\ and integrated observable 



J = R l DR 2 T , 



(6.13) 




The Lyapunov exponent is the average rate of the expansion 
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See sect. 6.2 for further details. 



Example 6.2 Singular values and geometry of deformations: Suppose we are 
in three dimensions, and the Jacobian matrix J is not singular, so that the diagonal 
elements of D in (6.13) satisfy ct\ > <r 2 > cr 3 > 0. Consider how J maps the unit 
ball S = {x e R 3 \x 2 = 1). V is orthogonal (rotation/reflection), so V T S is still the unit 
sphere: then D maps S onto ellipsoids = {y e R 3 \ y\l<j\ +y\lo\ + y\lcr\ = 1} whose 
principal axes directions - y coordinates - are determined by V. Finally the ellipsoid is 
further rotated by the orthogonal matrix U. The local directions of stretching and their 
images under J are called the right-hand and left-hand singular vectors for J and are 
given by the columns in V and U respectively: it is easy to check that J\<k = o- k u k , if 
Vk, Uk are the k-th columns of V and U. 

click to return: p. 116 



Exercises 

6. 1 . Principal stretches. Consider dx = f(xo + dxo) - 
f(xo), and show that dx = Mdxo+ higher order terms 
when \\dxo\\ «c 1. (Hint: use Taylor's theorem for 
a vector function.) Here, \\dxo\\ = V^o ■ dxo is the 
norm induced by the usual Euclidean dot (inner) prod- 
uct. Then let dxo = {d€)e, and show that \\dxo\\ = dt and 
||iix|| = (Tidl. (Christov et al. [1]) 

6.2. Eigenvalues of the Cauchy-Green strain tensor. 

Show that Kj = erf using the definition of C, the polar 
decomposition theorem, and the properties of eigenval- 
ues. (Christov et al. [1]) 

6.3. ??. Derive (??) using the Rayleigh-Ritz theorem from 
linear algebra [24]. Hint: write out the norm in the def- 
inition of A as a dot product and re-arrange terms. Also, 
you are allowed to bring the max inside any logarithm or 
square root because these functions are monotonically 
increasing. (Christov et al. [1]) 

6.4. How unstable is the Henon attractor? 

(a) Evaluate numerically the Lyapunov exponent A by 
iterating some 100,000 times or so the Henon map 



x' 




1 - ax 1 + y 


y 




bx 



fora= 1.4,fc = 0.3. 

(b) Would you describe the result as a 'strange attrac- 
tor'? Why? 



(c) How robust is the Lyapunov exponent for the 
Henon attractor? Evaluate numerically the Lya- 
punov exponent by iterating the Henon map for 
a = 1.39945219, b = 0.3. How much do you now 
trust your result for part (a) of this exercise? 

(d) Re-examine this computation by plotting the iter- 
ates, and erasing the plotted points every 1000 it- 
erates or so. Keep at it until the 'strange' attractor 
vanishes like the smile of the Chesire cat. What 
replaces it? Do a few numerical experiments to 
estimate the length of typical transient before the 
dynamics settles into this long-time attractor. 

(e) Use your Newton search routine to confirm exis- 
tence of this attractor. Compute its Lyapunov ex- 
ponent, compare with your numerical result from 
above. What is the itinerary of the attractor. 

(f) Would you describe the result as a 'strange attrac- 
tor'? Do you still have confidence in claims such 
as the one made for the part (b) of this exercise? 

6.5. Rossler attractor Lyapunov exponents. 

(a) Evaluate numerically the expanding Lyapunov ex- 
ponent A e of the Rossler attractor (2.18). 

(b) Plot your own version of figure 6.3. Do not worry 
if it looks different, as long as you understand why 
your plot looks the way it does. (Remember the 
nonuniform contraction/expansion of figure 4.3.) 

(c) Give your best estimate of A e . The literature gives 
surprisingly inaccurate estimates - see whether 
you can do better. 



exerLyapunov - 18mar2013 



ChaosBook.org versionl4.4. 1, Apr 21 2013 



REFERENCES 



123 



(d) Estimate the contracting Lyapunov exponent A c . 
Even though it is much smaller than A e , a glance 



at the stability matrix (4.30) suggests that you can 
probably get it by integrating the infinitesimal vol- 
ume along a long-time trajectory, as in (4.29). 
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Chapter 7 



Hamiltonian dynamics 



Conservative mechanical systems have equations of mo- 
tion that are symplectic and can be expressed in Hamilto- 
nian form. The generic properties within the class of sym- 
plectic vector fields are quite different from those within 
the class of all smooth vector fields: the system always 
has a first integral ("energy") and a preserved volume, and 
equilibrium points can never be asymptotically stable in 
their energy level. 



— John Guckenheimer 




ou might think that the strangeness of contracting flows, flows such as the 



Rossler flow of figure 2.6 is of concern only to chemists or biomedical 



engineers or the weathermen; physicists do Hamiltonian dynamics, right? 
Now, that's full of chaos, too! While it is easier to visualize aperiodic dynam- 
ics when a flow is contracting onto a lower-dimensional attracting set, there are 
plenty examples of chaotic flows that do preserve the full symplectic invariance of 
Hamiltonian dynamics. The whole story started with Poincare's restricted 3-body 
problem, a realization that chaos rules also in general (non-Hamiltonian) flows 
came much later. 

Here we briefly review parts of classical dynamics that we will need later 
on; symplectic invariance, canonical transformations, and stability of Hamiltonian 
flows. If your eventual destination are applications such as chaos in quantum 
and/or semiconductor systems, read this chapter. If you work in neuroscience 
or fluid dynamics, skip this chapter, continue reading with the billiard dynamics 
of chapter 8 which requires no incantations of symplectic pairs or loxodromic 
quartets. 



fast track: 
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7.1 Hamiltonian flows 

(P. Cvitanovic and L.V. Vela-Arevalo) 
An important class of flows are Hamiltonian flows, given by a Hamiltonian 

appendix C 

H(q, p) together with the Hamilton's equations of motion 

v ^ ^' b M remark 2.1 

• dH dH nn 
op, <% 

with the d = 2D phase-space coordinates x split into the configuration space 
coordinates and the conjugate momenta of a Hamiltonian system with D degrees 
of freedom (dof): 

* = (q,p), q = (qi,q 2 ,...,q D ), P = (pi,P2 Pd) • (7.2) 

The equations of motion (7.1) for a time-independent, D-dof Hamiltonian can be 
written compactly as 

id = (DijHjix) , Hj(x) - t-H(x) , (7.3) 

OXj 

where x = (q,p) e At is a phase-space point, and the a derivative of (•) with 
respect to xj is denoted by comma-index notation (•),/, 



• °i 0 ) ' 



is an antisymmetric [dxd] matrix, and I is the [DxD] unit matrix. 

The energy, or the value of the time-independent Hamiltonian function at the 
state space point x = (q, p) is constant along the trajectory x(t), 



d dH dH 

-H(q(t),p(t)) = —Q t (f) + —p i (f) 
dt oqi a pi 

dHdH dHdH n 

dq t dpi dpi dq t 



so the trajectories lie on surfaces of constant energy, or level sets of the Hamilto- 
nian {{q, p) : H(q,p) = E}. For 1-dof Hamiltonian systems this is basically the 
whole story. 
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Figure 7.1: Phase plane of the unforced, undamped 
Duffing oscillator. The trajectories lie on level sets of 
the Hamiltonian (7.6). 




Figure 7.2: A typical collinear helium trajectory in 
the [n, ri\ plane; the trajectory enters along the n-axis 
and then, like almost every other trajectory, after a few 
bounces escapes to infinity, in this case along the r2- 
axis. In this example the energy is set to H = E = -1, 
and the trajectory is bounded by the kinetic energy = 0 
line. 




Example 7.1 Unforced undamped Duffing oscillator: When the damping term 
is removed from the Duffing oscillator (2.8), the system can be written in Hamiltonian 
form, 



2 2 4 

H(q,p) = + 



(7.6) 



This is a 1-dof Hamiltonian system, with a 2-dimensional state space, the plane (q,p). 
The Hamilton's equations (7.1) are 



q = Pi 



q-q 



(7.7) 



For 1-dof systems, the 'surfaces' of constant energy (7.5) are curves that foliate the 
phase plane (q, p), and the dynamics is very simple: the curves of constant energy are 
the trajectories, as shown in figure 7.1. 



Thus all 1-dof systems are integrable, in the sense that the entire phase plane 
is foliated by curves of constant energy, either periodic, as is the case for the 
harmonic oscillator (a 'bound state'), or open (a 'scattering trajectory'). Add one 
more degree of freedom, and chaos breaks loose. 



example B.1 



Example 7.2 Collinear helium: In the quantum chaos part of ChaosBook.org 
we shall apply the periodic orbit theory to the quantization of helium. In particular, we 
will study collinear helium, a doubly charged nucleus with two electrons arranged on a 
line, an electron on each side of the nucleus. The Hamiltonian for this system is 



1,1,22 1 

H = -V\ + ~Pl + — — 

2 2 ri T2 T\ + n 



(7.8) 
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Collinear helium has 2 dof, and thus a 4-dimensional phase space M, which energy 
conservation foliates by 3-dimensional constant energy hypersurfaces. In order to vi- 
sualize it, we often project the dynamics onto the 2-dimensional configuration plane, 
the Oi,r 2 ), r i ^ 0 quadrant, figure 7.2. It looks messy, and, indeed, it will turn out to 
be no less chaotic than a pinball bouncing between three disks. As always, a Poincare 
section will be more informative than this rather arbitrary projection of the flow. The 
difference is that in such projection we see the flow from an arbitrary perspective, with 
trajectories crisscrossing. In a Poincare section the flow is decomposed into intrinsic 
coordinates, a pair along the marginal stability time and energy directions, and the rest 
transverse, revealing the phase-space structure of the flow. 

Note an important property of Hamiltonian flows: if the Hamilton equations 
(7.1) are rewritten in the 2D phase-space form Jq = v,(x), the divergence of the 
velocity field v vanishes, namely the flow is incompressible, V-v = c£v,- = (OiHjj = 
0. The symplectic invariance requirements are actually more stringent than just 
the phase-space volume conservation, as we shall see in sect. 7.3. 

Throughout ChaosBook we reserve the term 'phase space' to Hamiltonian 
flows. A 'state space' is the stage on which any flow takes place. 'Phase space' 
is a special but important case, a state space with symplectic structure, preserved 
by the flow. For us the distinction is necessary, as ChaosBook covers dissipative, 
mechanical, stochastic and quantum systems, all as one happy family. 



7.2 Symplectic group 

Either you're used to this stuff... or you have to get used 
to it. 

— Maciej Zworski 
A matrix transformation g is called symplectic, 

g T cog = co , (7.9) 

if it preserves the symplectic bilinear form (x\x) = x T (ox, where g T denotes the 
transpose of g, and co is a non-singular [2D x 2D] antisymmetric matrix which 
satisfies 

co T = -co, to 2 = -l. (7.10) 

While these are defining requirements for any symplectic bilinear form, co is often 
conventionally taken to be of form (7.4). 

Example 7.3 Symplectic form for D = 2: For two degrees of freedom the phase 
space is 4-dimensional, x = (qi,q2,pi,px) , and the symplectic 2-form is 



remark 7.3 



co - 



( 0 


0 


1 


0 > 


0 


0 


0 


1 


-1 


0 


0 


0 


I 0 


-1 


0 


0 J 



(7.11) 
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The symplectic bilinear form (x (l) \x (2) ) is the sum over the areas of the parallelepipeds 
spanned pairwise by components of the two vectors, 

<^>|* (2) > = W 2) = {q^pf - qfpf) + {qfpf - qfpf) . (7.12) 

It is this sum over oriented areas (not the Euclidean distance between the two vectors, 
\x {2) - x w \) that is preserved by the symplectic transformations. 



If g is symplectic, so is its inverse g~ l , and if gi and g2 are symplectic, so 
is their product g2gi- Symplectic matrices form a Lie group called the symplec- 
tic group Sp(d). Use of the symplectic group necessitates a few remarks about 
Lie groups in general, a topic that we study in more depth in chapter 10. A Lie 
group is a group whose elements g(<p) depend smoothly on a finite number N of 
parameters <p a . In calculations one has to write these matrices in a specific basis, 
and for infinitesimal transformations they take form (repeated indices are summed 
throughout this chapter, and the dot product refers to a sum over Lie algebra gen- 
erators): 



g(5<p) - \ + 5(p - T , 6<peR N , \6<p\ « 1 , (7.13) 



where {Ti,T2 • • • , TV}, the generators of infinitesimal transformations, are a set 
of /V linearly independent [dxd] matrices which act linearly on the J-dimensional 
phase space At The infinitesimal statement of symplectic invariance follows by 
substituting (7.13) into (7.9) and keeping the terms linear in 8<p, 

T a T w + wT fl = 0. (7.14) 

This is the defining property for infinitesimal generators of symplectic transfor- 
mations. Matrices that satisfy (7.14) are sometimes called Hamiltonian matrices. 
A linear combination of Hamiltonian matrices is a Hamiltonian matrix, so Hamil- 
tonian matrices form a linear vector space, the symplectic Lie algebra sp(d). By 
the antisymmetry of a>, 

(coT a ) T = iuT a . (7.15) 

is a symmetric matrix. Its number of independent elements gives the dimen- 
sion (the number of independent continuous parameters) of the symplectic group 
Sp(<0, 

N = d(d+ l)/2 = D(2D+ 1). (7.16) 

The lowest-dimensional symplectic group Sp(2), of dimension N = 3, is isomor- 
phic to SU(2) and SO(3). The first interesting case is Sp(3) whose dimension is 
N = 10. 
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It is easily checked that the exponential of a Hamiltonian matrix 



g = et- J (7.17) 



is a symplectic matrix; Lie group elements are related to the Lie algebra elements 
by exponentiation. 



7.3 Stability of Hamiltonian flows 



Hamiltonian flows offer an illustration of the ways in which an invariance of equa- 
tions of motion can affect the dynamics. In the case at hand, the symplectic in- 
variance will reduce the number of independent Floquet multipliers by a factor of 
2 or 4. 



7.3.1 Canonical transformations 

The evolution of / (4.5) is determined by the stability matrix A, (4.9): 

jf(x) = A(x)j\x) , Aij(x) = (o ik H kj (x) , (7.18) 

where the symmetric matrix of second derivatives of the Hamiltonian, = 
dkd n H, is called the Hessian matrix. From (7.18) and the symmetry of H^ n it 
follows that for Hamiltonian flows (7.3) 

A T to + coA = 0. (7.19) 

This is the defining property (7.14) for infinitesimal generators of symplectic (or 
canonical) transformations. 

Consider now a smooth nonlinear coordinate change form y = hi(x) (see 
sect. 2.3 for a discussion), and define a 'Kamiltonian' function K(x) = H(h(x)). 
Under which conditions does K generate a Hamiltonian flow? In what follows we 
will use the notation dj = d/dyj, s,-j = dhi/dxj. By employing the chain rule we 
have that 

Kj = Hjsj. (7.20) 

(Here, as elsewhere in this book, a repeated index implies summation.) By virtue 
of (7.1), diH = -(jJimjm, so that, again by employing the chain rule, we obtain 

COjjdjK = -OJijSjjCObnS^nXn (7.21) 
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Figure 7.3: Stability exponents of a Hamiltonian equi- 
librium point, 2-dof. 



complex saddle 



d egen er ate sad d I e 



generic center 



saddle-center 



degenerate center 



The right hand side simplifies to j% (yielding Hamiltonian structure) only if 



(7.22) 



or, in compact notation, 



a>(dh) T oj(dh) = 1 



(7.23) 



which is equivalent to the requirement (7.9) that dh is symplectic. h is then called 
a canonical transformation. We care about canonical transformations for two 
reasons. First (and this is a dark art), if the canonical transformation h is very 
cleverly chosen, the flow in new coordinates might be considerably simpler than 
the original flow. Second, Hamiltonian flows themselves are a prime example of 
canonical transformations. 



example B.1 



Dream student Henriette Roux: "I hate these s, nfl . Can't you use a more sensible 
notation?" A: "Be my guest." 



Example 7.4 Hamiltonian flows are canonical: For Hamiltonian flows it follows 
from (7. 19) that f t (J T u>J) = 0, and since at the initial time J°(x 0 ) = 1, Jacobian matrix 
is a symplectic transformation (7.9). This equality is valid for all times, so a Hamilto- 
nian flow f'(x) is a canonical transformation, with the linearization d x f{x) a symplectic 
transformation (7.9): For notational brevity here we have suppressed the dependence 
on time and the initial point, J = J'(x 0 ). By elementary properties of determinants it fol- 
lows from (7.9) that Hamiltonian flows are phase-space volume preserving, |det/| = 1 . 
The initial condition (4.9) for J is J° = 1, so one always has 

det/ = +l. (7.24) 



7.3.2 Stability of equilibria of Hamiltonian flows 



For an equilibrium point x q the stability matrix A is constant. Its eigenvalues 
describe the linear stability of the equilibrium point. A is the matrix (7. 19) with 
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real matrix elements, so its eigenvalues (the Floquet exponents of (5.1)) are either 
real or come in complex pairs. In the case of Hamiltonian flows, it follows from 
(7.19) that the characteristic polynomial of A for an equilibrium satisfies 

det (A - Al) = det(w" 1 (A - Al)co) = det (-oAco - Al) 

= det (A T + Al) = det (A + Al) . (7.25) 

That is, the symplectic invariance implies in addition that if A is an eigenvalue, 
then -A, A* and -A* are also eigenvalues. Distinct symmetry classes of the Floquet 
exponents of an equilibrium point in a 2-dof system are displayed in figure 7. 3. 
It is worth noting that while the linear stability of equilibria in a Hamiltonian 
system always respects this symmetry, the nonlinear stability can be completely 
different. 



7.4 Symplectic maps 

So far we have considered only the continuous time Hamiltonian flows. As dis- 
cussed in sect. 4.4 for finite time evolution mappings, and in sect. 4.5 the iterated 
discrete time mappings, the stability of maps is characterized by eigenvalues of 
their Jacobian matrices, or 'multipliers.' A multiplier A = A(jt&, t) associated to 
a trajectory is an eigenvalue of the Jacobian matrix /. As / is symplectic, (7.9) 
implies that 

r l = -ojJ T aj, (7.26) 

so the characteristic polynomial is reflexive, namely it satisfies 

det (J - Al) = det (/ T - Al) = det (-coJ T co - Al) 

= det(/ _1 - Al) = det (.T 1 ) det (1 - AT) 
= A 2£) det(/- A~ ! l). (7.27) 

Hence if A is an eigenvalue of J, so are 1/A, A* and 1/A*. Real eigenvalues 
always come paired as A, 1/A. The Liouville conservation of phase-space vol- 
umes (7.24) is an immediate consequence of this pairing up of eigenvalues. The 
complex eigenvalues come in pairs A, A*, |A| = 1, or in loxodromic quartets A, 
1/A, A* and 1/A*. These possibilities are illustrated in figure 7.4. 

Example 7.5 Hamiltonian Henon map, reversibility: By (4.44) the Henon 

map (3. 17) for b = -1 value is the simplest 2-dimensional orientation preserving area- 
preserving map, often studied to better understand topology and symmetries of Poincare 
sections of 2 dof Hamiltonian flows. We find it convenient to multiply (3.18) by a and 



section 5.1 
exercise 7.4 
exercise 7.5 
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Figure 7.4: Stability of a symplectic map in ] 
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absorb the a factor into x in order to bring the Henon map for the b = - 1 parameter 
value into the form 

Xj+i + Xi-\ — a — xf , i = 1, n p , (7.28) 
The 2-dimensional Henon map for b - -1 parameter value 

x n+ i = a-x 2 n -y„ 

y n +i = x„. (7.29) 

is Hamiltonian (symplectic) in the sense that it preserves area in the [x,y] plane. 

For definitiveness, in numerical calculations in examples to follow we shall fix 
(arbitrarily) the stretching parameter value to a = 6, a value large enough to guarantee 
that all roots of 0 = f(x) - x (periodic points) are real. 

6X6CCIS6 o.b 



Example 7.6 2-dimensional symplectic maps: In the 2-dimensional case the 
eigenvalues (5.8) depend only on trM' 

Ai,2 = \ (trM' + V(trM'-2)(trM' + 2)) . (7.30) 

Greene's residue criterion states that the orbit is (i) elliptic if the stability residue |tr Af'| - 
2 < 0, with complex eigenvalues Ai = e' 6 ', A 2 = A* = e~' m ' . If |tr Af| - 2 > 0, A is real, 
and the trajectory is either 

(ii) hyperbolic Ai = e A ' , A 2 = e"'",or (7.31) 
(iii) inverse hyperbolic Ai=-e /! ', A 2 = -e~ M . (7.32) 



Example 7.7 Standard map. Given a smooth function g(x), the map 

x n+\ — x n + Jn+l 

y n +\ = y n + g(x„) (7.33) 
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Figure 7.5: Phase portrait for the standard map 
for (a) k = 0: symbols denote periodic orbits, full 
lines represent quasiperiodic orbits, (b) k = 0.3, 
k = 0.85 and k = 1.4: each plot consists of 20 
random initial conditions, each iterated 400 times. 




is an area-preserving map. The corresponding nth iterate Jacobian matrix (4.21) is 

-flCi}^ I). (734, 



k=n 

The map preserves areas, detM = 1, and one can easily check that M is symplectic. 
In particular, one can consider x on the unit circle, and y as the conjugate angular 
momentum, with a function g periodic with period 1 . The phase space of the map is 
thus the cylinder Si x R (S i stands for the 1 -torus, which is fancy way to say "circle"): 
by taking (7.33) mod 1 the map can be reduced on the 2-torus SY 

The standard map corresponds to the choice g(x) = k/2n sin(2;rx). When k-Q, 
y n +\ - y n - yo, so that angular momentum is conserved, and the angle x rotates with 
uniform velocity 

x n +i = x„ + y 0 = x 0 + (« + l)y 0 mod 1 . 

The choice of yo determines the nature of the motion (in the sense of sect. 2.1.1): for 
yo = 0 we have that every point on the yo = 0 line is stationary, foryo = p/q the motion 
is periodic, and for irrational yo any choice of xo leads to a quasiperiodic motion (see 
figure 7.5 (a)). 

Despite the simple structure of the standard map, a complete description of its 
dynamics for arbitrary values of the nonlinear parameter k is fairly complex: this can 
be appreciated by looking at phase portraits of the map for different k values: when 
k is very small the phase space looks very much like a slightly distorted version of 
figure 7.5 (a), while, when k is sufficiently large, single trajectories wander erratically on 
a large fraction of the phase space, as in figure 7.5 (b). 

This gives a glimpse of the typical scenario of transition to chaos for Hamilto- 
nian systems. 

Note that the map (7.33) provides a stroboscopic view of the flow generated by 
a (time-dependent) Hamiltonian 

H(x,y;t) = l -y 2 +G(x)6 l (t) (7.35) 
where 6i denotes the periodic delta function 

oo 

Si(t) = ^ 6(t-m) (7.36) 

and 

G'(x) = -g{x). (7.37) 

Important features of this map, including transition to global chaos (destruction 
of the last invariant torus), may be tackled by detailed investigation of the stability of 
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periodic orbits. A family of periodic orbits of period Q already present in the k - 0 
rotation maps can be labeled by its winding number P/Q The Greene residue describes 
the stability of a P/Q-cycle: 

Rp/Q = ^(2-trMpfQ). (7.38) 

If Rp/q € (0, 1) the orbit is elliptic, for R P/Q > 1 the orbit is hyperbolic orbits, and for 
Rp/q < 0 inverse hyperbolic. 

For k = 0 all points on the y<j - P/Q line are periodic with period Q, winding 
number P/Q and marginal stability R P / Q = 0. As soon as k > 0, only a 2Q of such 
orbits survive, according to Poincare-Birkhoff theorem: half of them elliptic, and half 
hyperbolic. If we further vary k in such a way that the residue of the elliptic Q-cycle 
goes through 1, a bifurcation takes place, and two or more periodic orbits of higher 
period are generated. 



7.5 Poincare invariants 



Let C be a region in phase space and V(0) its volume. Denoting the flow of the 
Hamiltonian system by f(x), the volume of C after a time t is V(t) = f(C), and 
using (7.24) we derive the Liouville theorem: 



V(t) 



df\x') 



- I dx - I 

Jf'(C) Jc 

J det (J)dx = J dx' = 



dx 
V(0), 



(7.39) 



Hamiltonian flows preserve phase-space volumes. 

The symplectic structure of Hamilton's equations buys us much more than 
the 'incompressibility,' or the phase-space volume conservation. Consider the 
symplectic product of two infinitesimal vectors 



(5x\8x) = 5x T u>Sx = SpiSqi - SqjSpi 



{oriented area in the (<#, /?,-) plane) . 



(7.40) 



7=1 



Time t later we have 



(5x'\6x) = 6x T J T ojJSx = 6x T u>6x. 



This has the following geometrical meaning. Imagine that there is a reference 
phase-space point. Take two other points infinitesimally close, with the vectors 5x 
and 5x describing their displacements relative to the reference point. Under the 
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dynamics, the three points are mapped to three new points which are still infinites- 
imally close to one another. The meaning of the above expression is that the area 
of the parallelepiped spanned by the three final points is the same as that spanned 
by the initial points. The integral (Stokes theorem) version of this infinitesimal 
area invariance states that for Hamiltonian flows the sum of D oriented areas 'Vj 
bounded by D loops £YVj, one per each (qi,pi) plane, is conserved: 



One can show that also the 4, 6, • • • , 2D phase-space volumes are preserved. The 
phase space is 2D-dimensional, but as there are D coordinate combinations con- 
served by the flow, morally a Hamiltonian flow is D-dimensional. Hence for 
Hamiltonian flows the key notion of dimensionality is D, the number of the de- 
grees of freedom (dof), rather than the phase-space dimensionality d = 2D. 

Dream student Henriette Roux: "Would it kill you to draw some pictures here?" 
A: "Be my guest." 



Resume 

Physicists do Lagrangians and Hamiltonians. Many know of no world other 
than the perfect world of quantum mechanics and quantum field theory in which 
the energy and much else is conserved. From the dynamical point of view, a 
Hamiltonian flow is just a flow, but a flow with a symmetry: the stability matrix 
Ajj = QJik H,kj(x) °f a Hamiltonian flow Xj = a)ijHj(x) satisfies A T u) + a>A = 0. Its 
integral along the trajectory, the linearization of the flow / that we call the 'Jaco- 
bian matrix,' is symplectic, and a Hamiltonian flow is thus a canonical transforma- 
tion in the sense that the Hamiltonian time evolution i = f{x) is a transformation 
whose linearization (Jacobian matrix) J = dx" /dx preserves the symplectic form, 
J T u>J = a> . This implies that A are in the symplectic algebra sp(2D), and that the 
2D-dimensional Hamiltonian phase-space flow preserves D oriented infinitesimal 
volumes, or Poincare invariants. The Liouville phase-space volume conservation 
is one consequence of this invariance. 

While symplectic invariance enforces |A| = 1 for complex eigenvalue pairs 
and precludes existence of attracting equilibria and limit cycles typical of dissipa- 
tive flows, for hyperbolic equilibria and periodic orbits |A| > 1, and the pairing 
requirement only enforces a particular value on the 1/A contracting direction. 
Hence the description of chaotic dynamics as a sequence of saddle visitations is 
the same for the Hamiltonian and dissipative systems. You might find symplec- 
ticity beautiful. Once you understand that every time you have a symmetry, you 
should use it, you might curse the day [26] you learned to say 'symplectic'. 




(7.41) 




in depth: 

appendix C.4, p. 817 
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Commentary 

In theory there is no difference between theory and prac- 
tice. In practice there is. 
— Yogi Berra 

Remark 7.1 Hamiltonian dynamics, sources. If you are reading this book, in theory 
you already know everything that is in this chapter. In practice you do not. Try this: 
Put your right hand on your heart and say: "I understand why nature prefers symplectic 
geometry." Honest? 

Where does the skew-symmetric a> come from? Newton / = ma law for a motion in 
a potential is m'q - —dV . Rewrite this as a pair of first order ODEs, q = p/m,p = —dV , 
define the total energy H(q, p) - p 2 /2m + V{q) , and voila, the equation of motion take on 
the symplectic form (7.3). What makes this important is the fact that the evolution in time 
(and more generally any canonical transformation) preserves this symplectic structure, as 
shown in sect. 7.3.1. Another way to put it: a gradient flow i = —dV(x) contracts a state 
space volume into a fixed point. When that happens, V(x) is a 'Lyapunov function', and 
the equilibrium x = 0 is 'Lyapunov asymptotically stable'. In contrast, the '-' sign in the 
symplectic action on (q, p) coordinates, p = —dV induces a rotation, and conservation of 
phase-space areas: for a symplectic flow there can be no volume contraction. 

Out there there are centuries of accumulated literature on Hamilton, Lagrange, Jacobi 
etc. formulation of mechanics, some of it excellent. In context of what we will need here, 
we make a very subjective recommendation-we enjoyed reading Percival and Richards [ 3] 
and Ozorio de Almeida [4]. Exposition of sect. 7.2 follows Dragt [ I 5]. There are two con- 
ventions in literature for what the integer argument of Sp(- ■ ■) stands for: either Sp(D) or 
Sp(d) (used, for example, in refs. [15, 17]), where D = dof, and d = 2D. As explained 
in Chapter 13 of ref. [17], symplectic groups are the 'negative dimensional,' d — > —d 
sisters of the orthogonal groups, so only the second notation makes sense in the grander 
scheme of things. Mathematicians can even make sense of the d =odd-dimensional case, 
see Proctor [18, 19], by dropping the requirement that a> is non-degenerate, and defining 
a symplectic group Sp(A1, oS) acting on a vector space M as a subgroup of Gl(M) which 
preserves a skew-symmetric bilinear form a> of maximal possible rank. The odd sym- 
plectic groups Sp(2D + 1) are not semisimple. If you care about group theory for its own 
sake (the dynamical systems symmetry reduction techniques of chapter 10 are still too 
primitive to be applicable to Quantum Field Theory), chapter 14 of ref. [ 1 7] is fun, too. 

Referring to the Sp(d) Lie algebra elements as 'Hamiltonian matrices' as one some- 
times does [15, 20] conflicts with what is meant by a 'Hamiltonian matrix' in quantum 
mechanics: the quantum Hamiltonian sandwiched between vectors taken from any com- 
plete set of quantum states. We are not sure where this name comes from; Dragt cites 
refs. [21, 22], and chapter 17 of his own book in progress [ 16]. Fulton and Harris [21] 
use it. Certainly Van Loan [23] uses in 1981, and Taussky in 1972. Might go all the way 
back to Sylvester? 

Dream student Henriette Roux wants to know: "Dynamics equals a Hamiltonian plus a 
bracket. Why don't you just say it?" A: "It is true that in the tunnel vision of atomic 
mechanicians the world is Hamiltonian. But it is much more wondrous than that. This 
chapter starts with Newton 1687: force equals acceleration, and we always replace a 
higher order time derivative with a set of first order equations. If there are constraints, or 
fully relativistic Quantum Field Theory is your thing, the tool of choice is to recast New- 
ton equations as a Lagrangian 1788 variational principle. If you still live in material but 
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non-relativistic world and have not gotten beyond Heisenberg 1925, you will find Hamil- 
ton's 1 827 principal function handy. The question is not whether the world is Hamiltonian 
- it is not - but why it is so often profitably formulated this way. For Maupertuis 1744 vari- 
ational principle was a proof of God's existence; for Lagrange who made it mathematics, 
it was just a trick. Our sect. 33.1.1 "Semiclassical evolution" is an attempt to get inside 17 
year old Hamilton's head, but it is quite certain that he did not get to it the way we think 
about it today. He got to the 'Hamiltonian' by studying optics, where the symplectic struc- 
ture emerges as the leading WKB approximation to wave optics; higher order corrections 
destroy it again. In dynamical systems theory, the densities of trajectories are transported 
by Liouville evolution operators, as explained here in sect. 16.6. Evolution in time is a 
one-parameter Lie group, and Lie groups act on functions infinitesimally by derivatives. 
If the evolution preserves additional symmetries, these derivatives have to respect them, 
and so 'brackets' emerge as a statement of symplectic invariance of the flow. Dynamics 
with a symplectic structure are just a special case of how dynamics moves densities of tra- 
jectories around. Newton is deep, Poisson brackets are technology and thus they appear 
naturally only by the time we get to chapter 16. Any narrative is of necessity linear, and 
putting Poisson ahead of Newton [ ] would be a disservice to you, the student. But if you 
insist: Dragt and Habib [24, 15] offer a concise discussion of symplectic Lie operators 
and their relation to Poisson brackets. " 

Remark 7.2 Symplectic. The term symplectic -Greek for twining or plaiting together- 
was introduced into mathematics by Hermann Weyl. 'Canonical' lineage is church- 
doctrinal: Greek 'kanon,' referring to a reed used for measurement, came to mean in 
Latin a rule or a standard. 

Remark 7.3 The sign convention of to. The overall sign of a>, the symplectic invariant 
in (7.3), is set by the convention that the Hamilton's principal function (for energy con- 
serving flows) is given by R(q, q',t) = J pjdq, - Et. With this sign convention the action 
along a classical path is minimal, and the kinetic energy of a free particle is positive. Any 
finite-dimensional symplectic vector space has a Darboux basis such that at takes form 
(7.9). Dragt [15] convention for phase-space variables is as in (7.2). He calls the dynam- 
ical trajectory xo — » x(xq, t) the 'transfer map,' something that we will avoid here, as it 
conflicts with the well established use of 'transfer matrices' in statistical mechanics. 

Remark 7.4 Loxodromic quartets. For symplectic flows, real eigenvalues always 
come paired as A, 1/A, and complex eigenvalues come either in A, A* pairs, |A| = 1, or 
A, 1/A, A*, 1/A* loxodromic quartets. As most maps studied in introductory nonlinear 
dynamics are 2d, you have perhaps never seen a loxodromic quartet. How likely are 
we to run into such things in higher dimensions? According to a very extensive study of 
periodic orbits of a driven billiard with a four dimensional phase space, carried in ref. [ 28], 
the three kinds of eigenvalues occur with about the same likelihood. 

Remark 7.5 Standard map. Standard maps model free rotors under the influence 
of short periodic pulses, as can be physically implemented, for instance, by pulsed op- 
tical lattices in cold atoms physics. On the theoretical side, standard maps illustrate a 
number of important features: small k values provide an example of KAM perturbative 
regime (see ref. [11]), while larger fc's illustrate deterministic chaotic transport [9, 10], 
and the transition to global chaos presents remarkable universality features [5, 12, 7]. 
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The quantum counterpart of this model has been widely investigated, as the first example 
where phenomena like quantum dynamical localization have been observed [13]. Stabil- 
ity residue was introduced by Greene [12]. For some hands-on experience of the standard 
map, download Meiss simulation code [14]. 
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Exercises 



7.1. Complex nonlinear Schrodinger equation. Con- 
sider the complex nonlinear Schrodinger equation in one 
spatial dimension [26]: 



a 2 

dx 1 



(a) 



0 


0 


1 


0 


0 


0 


0 


1 


-1 


0 


0 


—c 


0 


-1 


c 


0 



Show that the function if/ : R — > C defining the 
traveling wave solution <p{x, f) = i//(x-ct) for c > 0 
satisfies a second-order complex differential equa- 
tion equivalent to a Hamiltonian system in K 4 rel- 
ative to the noncanonical symplectic form whose 
matrix is given by 



w c — 



(b) Analyze the equilibria of the resulting Ha- 
miltonian system in R 4 and determine their linear 
stability properties. 

(c) Let if/{s) = e' cs l 2 a{s) for a real function a(s) and 
determine a second order equation for a{s). Show 
that the resulting equation is Hamiltonian and has 
heteroclinic orbits for/? < 0. Find them. 

(d) Find 'soliton' solutions for the complex nonlinear 
Schrodinger equation. 

(Luz V. Vela-Arevalo) 

7.2. Symplectic vs. Hamiltonian matrices. In the 

language of group theory, symplectic matrices form the 
symplectic Lie group Sp(d), while the Hamiltonian ma- 
trices form the symplectic Lie algebra sp(d), or the al- 
gebra of generators of infinitesimal symplectic transfor- 
mations. This exercise illustrates the relation between 
the two: 

(a) Show that if a constant matrix A satisfy the 
Hamiltonian matrix condition (7.14), then J(t) = 
exp(fA) , (el, satisfies the symplectic condition 
(7.9), i.e., J(f) is a symplectic matrix. 

(b) Show that if matrices T fl satisfy the Hamiltonian 
matrix condition (7.14), then g{<f>) = exp(<p ■ T), 
<p € R N , satisfies the symplectic condition (7.9), 
i.e., g(cp) is a symplectic matrix. 

(A few hints: (i) expand exp(A) , A = <f> ■ T , as a power 
series in A. Or, (ii) use the linearized evolution equation 
(7.18). ) 

7.3. When is a linear transformation canonical? 



(a) Let A be a [n x n] invertible matrix. Show that 
the map cf> : R 2 " — > R 2 " given by (q, p) h-> 
(Aq, (A~') T p) is a canonical transformation. 

(b) If R is a rotation in R 3 , show that the map (q, p) i-> 
(R q, R p) is a canonical transformation. 

(Luz V. Vela-Arevalo) 

7.4. Determinants of symplectic matrices. Show that 
the determinant of a symplectic matrix is +1, by going 
through the following steps: 

(a) use (7.27) to prove that for eigenvalue pairs each 
member has the same multiplicity (the same holds 
for quartet members), 

(b) prove that the joint multiplicity of A — ± 1 is even, 

(c) show that the multiplicities of A — 1 and A - -I 
cannot be both odd. (Hint: write 

P(A) = (A - \) lm+l (A + V) ll+l Q(A) 

and show that Q(i) = 0). 

7.5. Cherry's example. What follows refs. [25, 27] is 
mostly a reading exercise, about a Hamiltonian system 
that is linearly stable but nonlinearly unstable. Consider 
the Hamiltonian system on R 4 given by 



H 



\(<i\ + p\) - (ql + p\) + \pi(p\ - q\) - qmp\- 



(a) Show that this system has an equilibrium at the 
origin, which is linearly stable. (The linearized 
system consists of two uncoupled oscillators with 
frequencies in ratios 2:1). 

(b) Convince yourself that the following is a family of 
solutions parameterize by a constant r: 



/-cosO-t) 
q\ = - V2— , 

t — T 

-sin(f - t) 



n= V2- 



t - T 



11 = 
P2 = 



cos 2(t - t) 

t-T 

sin2(f - r) 
t-r 



These solutions clearly blow up in finite time; 
however they start at t — 0 at a distance V3/r 
from the origin, so by choosing t large, we can 
find solutions starting arbitrarily close to the ori- 
gin, yet going to infinity in a finite time, so the 
origin is nonlinearly unstable. 

(Luz V. Vela-Arevalo) 
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Chapter 8 



Billiards 




he dynamics that we have the best intuitive grasp on, and find easiest to grap- 



ple with both numerically and conceptually, is the dynamics of billiards. 



For billiards, discrete time is altogether natural; a particle moving through 
a billiard suffers a sequence of instantaneous kicks, and executes simple motion 
in between, so there is no need to contrive a Poincare section. We have already 
used this system in sect. 1.3 as the intuitively most accessible example of chaos. 
Here we define billiard dynamics more precisely, anticipating the applications to 
come. 



8.1 Billiard dynamics 

A billiard is denned by a connected region Q c R D , with boundary dQ c R D_1 
separating Q from its complement R D \ Q. The region Q can consist of one com- 
pact, finite volume component (in which case the billiard phase space is bounded, 
as for the stadium billiard of figure 8.1), or can be infinite in extent, with its 
complement R° \ Q consisting of one or several finite or infinite volume compo- 
nents (in which case the phase space is open, as for the 3-disk pinball game in 
figure 1.1). In what follows we shall most often restrict our attention to planar 
billiards. 

A point particle of mass m and momentum p n = mv n moves freely within the 
billiard, along a straight line, until it encounters the boundary. There it reflects 
specularly (specular = mirrorlike), with no change in the tangential component 
of momentum, and instantaneous reversal of the momentum component normal to 
the boundary, 



with h the unit vector normal to the boundary dQ at the collision point. The angle 
of incidence equals the angle of reflection, as illustrated in figure8.2. A billiard is 



p = p - 2{p ■ h)h , 



(8.1) 
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Figure 8.1: The stadium billiard is a 2- 
dimensional domain bounded by two semi-circles 
of radius d = 1 connected by two straight walls of 
length 2a. At the points where the straight walls 
meet the semi-circles, the curvature of the border 
changes discontinuously; these are the only sin- 
gular points of the flow. The length a is the only 
parameter. 



Figure 8.2: (a) A planar billiard trajectory is fixed 
by specifying the perimeter length parametrized 
by s and the outgoing trajectory angle 0, both mea- 
sured counterclockwise with respect to the out- 
ward normal h. (b) The Birkhoff phase-space co- 
ordinate pair (s,p) fully specifies the trajectory, 
where p = |/?| sin 0 is the momentum component 
tangential to the boundary As the pinball kinetic 
energy is conserved in elastic scattering, the pin- 
ball mass and the magnitude of the pinball mo- 
mentum are customarily set to m = \p\ = 1. 




-6 -4 -2 0 
(b) s 



a Hamiltonian system with a 2D-dimensional phase space x = (q, p) and potential 
V(q) = 0 for q e Q, V(q) = oo for q € dQ. 

A billiard flow has a natural Poincare section defined by Birkhoff coordinates 
s,„ the arc length position of the nth bounce measured along the billiard boundary, 
and p n - |p|sin0„, the momentum component parallel to the boundary, where 
(f>„ is the angle between the outgoing trajectory and the normal to the boundary. 
We measure both the arc length s, and the parallel momentum p counterclockwise 
relative to the outward normal (see figure 8.2 as well as figure 3.9 (a)). In D = 2, 
the Poincare section is a cylinder (topologically an annulus), figure 8.3, where 
the parallel momentum p ranges for -\p\ to \p\, and the s coordinate is cyclic 
along each connected component of dQ. The volume in the full phase space is 
preserved by the Liouville theorem (7.39). The Birkhoff coordinates x - (s,p) e 
P, are the natural choice, because with them the Poincare return map preserves 
the phase-space volume of the (s, p) parameterized Poincare section (a perfectly 
good coordinate set (s, <p) does not do that). 

Without loss of generality we set m = \v\ - \p\ = 1. Poincare section condition 
eliminates one dimension, and the energy conservation \p\ = 1 eliminates another, 
so the Poincare section return map P is (2D - 2)-dimensional. 



remark 2.1 



exercise 8.6 



exercise 8.6 
section 8.2 



The dynamics is given by the Poincare return map 



P : (S n ,p n ) H-> {s n+ uPn+\) 



(8.2) 
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Figure 8.3: In D = 2 the billiard Poincare section 
is a cylinder, with the parallel momentum p ranging 
over p e { — 1,1}, and with the s coordinate is cyclic 
along each connected component of dQ. The rectangle 
figure 8.2(b) is such cylinder unfolded, with periodic 
boundary conditions glueing together the left and the 
right edge of the rectangle. 




from the nth collision to the (n + l)st collision. The discrete time dynamics map 
P is equivalent to the Hamiltonian flow (7.1) in the sense that both describe the 
same full trajectory. Let t n denote the instant of nth collision. Then the position 
of the pinball e Q at time t n + r < t n +i is given by 2D - 2 Poincare section 
coordinates (s n , p n ) € V together with r, the distance reached by the pinball along 
the «th section of its trajectory (as we have set the pinball speed to 1 , the time of 
flight equals the distance traversed). 



Example 8.1 3-disk game of pinball: In the case of bounces off a circular disk, 
the position coordinate s - rO is given by angle 6 e [0, 2n]. For example, for the 3-disk 
game of pinball of figure 1.6 and figure 3.9(a) we have two types of collisions: 

©xsrcis© o.i 

( <b' = — d> + 2 arcsin n , 
P 0 :< „ back-reflection (8.3) 

I P' =~P+ f SU10' 

( (/>' = (f> - 2 arcsin p + 2n/3 „ „ , ,. , „ „, 

Pi -A , a .._ reflect to 3rd disk. (8.4) 

Here a = radius of a disk, and R = center-to-center separation. Actually, as in this 
example we are computing intersections of circles and straight lines, nothing more 
than high-school geometry is required. There is no need to compute arcsin - one only 
needs to compute one square root per each reflection, and the simulations can be very 

exercise 8.2 

Trajectory of the pinball in the 3-disk billiard is generated by a series ofPo s and 
P\ 's. At each step one has to check whether the trajectory intersects the desired disk 
(and no disk in-between). With minor modifications, the above formulas are valid for 
any smooth billiard as long as we replace a by the local curvature of the boundary at 
the point of collision. 



8.2 Stability of billiards 



We turn next to the question of local stability of discrete time billiard systems. In- 
finitesimal equations of variations (4.2) do not apply, but the multiplicative struc- 
ture (4.20) of the finite-time Jacobian matrices does. As they are more physical 
than most maps studied by dynamicists, let us work out the billiard stability in 
some detail. 
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On the face of it, a plane billiard phase space is 4-dimensional. However, one 
dimension can be eliminated by energy conservation, and the other by the fact that 
the magnitude of the speed is constant. We shall now show how going to a local 
frame of motion leads to a [2x2] Jacobian matrix. 

Consider a 2-dimensional billiard with phase-space coordinates x = (q,q2, Pi, pi)- 
Let t n be the instant of the nth collision of the pinball with the billiard boundary, 
and t* = t n ± e, e positive and infinitesimal. With the mass and the speed equal to 
1, the momentum direction can be specified by angle 9: x = ($, 172, sin (9, cos (9). 
Now parametrize the 2-dimensional neighborhood of a trajectory segment by 
8x = (8z, 59), where 

8z = Sqi cos 9 - 8q2 sin 9 , (8.5) 

89 is the variation in the direction of the pinball motion. Due to energy conserva- 
tion, there is no need to keep track of 8q\ , variation along the flow, as that remains 
constant. {8q\,8q2) is the coordinate variation transverse to the kth segment of the 
flow. From the Hamilton's equations of motion for a free particle, dq/dt = pj, 
dpildt - 0, we obtain the equations of motion (4.1) for the linearized neighbor- 
hood 

4-5(9 = 0, —5z = 5e. (8.6) 
dt dt 

Let 89„ = 89(t„) and 8z n = 8z{t„) be the local coordinates immediately after the 
nth collision, and 89^ = 89{t~), 8z„ - 8z(t~) immediately before. Integrating the 
free flight from t + _^ to t~ we obtain 

5z~ = 8z n -\ + T n 89„-\ , T n = t n - ?„_i 

89- = 89 n ^, (8.7) 
and the Jacobian matrix (4. 19) for the nth free flight segment is 



I o T \ I 



At incidence angle <p n (the angle between the outgoing particle and the outgoing 
normal to the billiard edge), the incoming transverse variation 8z^ projects onto an 
arc on the billiard boundary of length 8z„ I cos <p n . The corresponding incidence 
angle variation 8<p n = 8z~ n lp n cos <p n , p n = local radius of curvature, increases the 
angular spread to 

8z n = -8z n 

89 n = -66- 1 —r5z n , (8.9) 

Pn COS (p n 
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Figure 8.4: Defocusing of a beam of nearby trajecto- 
ries at a billiard collision. (A. Wirzba) 



so the Jacobian matrix associated with the reflection is 



I I ? ), r n = 2 -—. (8.10) 

r " 1 / Pn COS (p n 



The full Jacobian matrix for n p consecutive bounces describes a beam of tra- 
jectories defocused by Mj along the free flight (the t„ terms below) and defo- 
cused/refocused at reflections by Mr (the r n terms below) 



exercise 8.4 



M p 



<-»'"fl(o ?)(,' ?)• < 8 -'» 



where t„ is the flight time of the kth free-flight segment of the orbit, r n = 2/p n cos <f>„ 
is the defocusing due to the kth reflection, and p,, is the radius of curvature of 
the billiard boundary at the nth scattering point (for our 3-disk game of pinball, 
p = 1). As the billiard dynamics is phase-space volume preserving, detM = 1, 
and the eigenvalues are given by (7.30). 

This is an example of the Jacobian matrix chain rule (4.22) for discrete time 
systems (the Henon map stability (4.43) is another example). Stability of every 
flight segment or reflection taken alone is a shear with two unit eigenvalues, 



( J T * j , det M R = det ( j J | 



but acting in concert in the interwoven sequence (8.11) they can lead to a hyper- 
bolic deformation of the infinitesimal neighborhood of a billiard trajectory. 

As a concrete application, consider the 3-disk pinball system of sect. 1 .3. An- 
alytic expressions for the lengths and eigenvalues of 0, 1 and 10 cycles follow 
from elementary geometrical considerations. Longer cycles require numerical exercjse 138 
evaluation by methods such as those described in chapter 13. exercise 8 3 

chapter 13 



exercise 13.7 
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Resume 

A particulary natural application of the Poincare section method is the reduction 
of a billiard flow to a boundary-to-boundary return map. 



Commentary 

Remark 8.1 Billiards. The 3-disk game of pinball is to chaotic dynamics what a 
pendulum is to integrable systems; the simplest physical example that captures the essence 
of chaos. Another contender for the title of the 'harmonic oscillator of chaos' is the baker's 
map which is used as the red thread through Ott's introduction to chaotic dynamics [ 11]. 
The baker's map is the simplest reversible dynamical system which is hyperbolic and 
has positive entropy. We will not have much use for the baker's map here, as due to its 
piecewise linearity it is so nongeneric that it misses all of the subtleties of cycle expansions 
curvature corrections that will be central to this neatise. 

chapter 20 

That the 3-disk game of pinball is a quintessential example of deterministic chaos 
appears to have been first noted by B. Eckhardt [1]. The model was studied in depth 
classically, semiclassically and quantum mechanically by P. Gaspard and S.A. Rice [3], 
and used by P. Cvitanovic and B. Eckhardt [4] to demonstrate applicability of cycle ex- 
pansions to quantum mechanical problems. It has been used to study the higher order h 
corrections to the Gutzwiller quantization by P. Gaspard and D. Alonso Ramirez [5], con- 
struct semiclassical evolution operators and entire spectral determinants by P. Cvitanovic 
and G. Vattay [6], and incorporate the diffraction effects into the periodic orbit theory 
by G. Vattay, A. Wirzba and PE. Rosenqvist [ ]. Gaspard's monograph [8], which we 
warmly recommend, utilizes the 3-disk system in much more depth than will be attained 
here. For further links check ChaosBook.org. 

A pinball game does miss a number of important aspects of chaotic dynamics: generic 
bifurcations in smooth flows, the interplay between regions of stability and regions of 
chaos, intermittency phenomena, and the renormalization theory of the 'border of order' 
between these regions. To study these we shall have to face up to much harder challenge, 
dynamics of smooth flows. 

Nevertheless, pinball scattering is relevant to smooth potentials. The game of pinball 
may be thought of as the infinite potential wall limit of a smooth potential, and pinball 
symbolic dynamics can serve as a covering symbolic dynamics in smooth potentials. One 
may start with the infinite wall limit and adiabatically relax an unstable cycle onto the 
corresponding one for the potential under investigation. If things go well, the cycle will sect on 29 
remain unstable and isolated, no new orbits (unaccounted for by the pinball symbolic 
dynamics) will be born, and the lost orbits will be accounted for by a set of pruning rules. 
The validity of this adiabatic approach has to be checked carefully in each application, as 
things can easily go wrong; for example, near a bifurcation the same naive symbol string 
assignments can refer to a whole island of distinct periodic orbits. 

Remark 8.2 Stability analysis. The chapter 1 of Gaspard monograph [8] is rec- 
ommended reading if you are interested in Hamiltonian flows, and billiards in particular. 
A. Wirzba has generalized the stability analysis of sect. 8.2 to scattering off 3-dimensional 
spheres (follow the links in ChaosBook.org/extras). A clear discussion of linear sta- 
bility for the general ^-dimensional case is given in Gaspard [ 8], sect. 1.4. 
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Exercises 

8.1. A pinball simulator. Implement the disk — > disk 
maps to compute a trajectory of a pinball for a given 
starting point, and a given R.a = (center-to-center dis- 
tance): (disk radius) ratio for a 3-disk system. As this 
requires only computation of intersections of lines and 
circles together with specular reflections, implementa- 
tion should be within reach of a high-school student. 
Please start working on this program now; it will be con- 
tinually expanded in chapters to come, incorporating the 
Jacobian calculations, Newton root-finding, and so on. 
Fast code will use elementary geometry (only one 
-sf^ per iteration, rest are multiplications) and eschew 
trigonometric functions. Provide a graphic display of 
the trajectories and of the Poincare section iterates. To 
be able to compare with the numerical results of coming 
chapters, work with R.a = 6 and/or 2.5 values. Draw the 
correct versions of figure 1 .9 or figure 12.3 for R:a = 2.5 
and/or 6. 

8.2. Trapped orbits. Shoot 100,000 trajectories from one 
of the disks, and trace out the strips of figure 1 .9 for vari- 
ous R.a by color coding the initial points in the Poincare 
section by the number of bounces preceding their es- 
cape. Try also R:a = 6:1, though that might be too thin 
and require some magnification. The initial conditions 
can be randomly chosen, but need not - actually a clearer 
picture is obtained by systematic scan through regions of 
interest. 

8.3. Pinball stability. Add to your exercise 8.1 pinball 
simulator a routine that computes the [2x2] Jacobian 
matrix. To be able to compare with the numerical re- 
sults of coming chapters, work with R.a = 6 and/or 2.5 
values. 

8.4. Stadium billiard. Consider the Bunimovich sta- 
dium [9, 10] defined in figure 8.1. The Jacobian matrix 
associated with the reflection is given by (8. 10). Here we 



take pk = — 1 for the semicircle sections of the bound- 
ary, and cos <pk remains constant for all bounces in a ro- 
tation sequence. The time of flight between two semi- 
circle bounces is = 2 cos (pk- The Jacobian matrix 
of one semicircle reflection folowed by the flight to the 
next bounce is 

J ~ ( L) \ 0 1 j\ -2/cos0 t 1 J 

( _,J -3 2COS0* \ 
~ ( l) \ 2/cos0 t 1 J ■ 

A free flight must always be followed by k — 1, 2, 3, ■ ■ • 
bounces along a semicircle, hence the natural symbolic 
dynamics for this problem is nary, with the correspond- 
ing Jacobian matrix given by shear (ie. the eigenvalues 
remain equal to 1 throughout the whole rotation), and k 
bounces inside a circle lead to 

* = <-'>'( i3U <813> 
The Jacobian matrix of a cycle p of length n p is given 

by 

''- w)S *D(o ?)U ?)•*'*> 

Adopt your pinball simulator to the stadium billiard. 

8.5. A test of your pinball simulator. Test your exer- 
cise 8.3 pinball simulator by computing numerically 
cycle stabilities by tracking distances to nearby orbits. 
Compare your result with the exact analytic formulas of 
exercise 13.7 and 13.8. 

8.6. Birkhoff coordinates. Prove that the Birkhoff coor- 
dinates are phase-space volume preserving. 
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Chapter 9 



World in a mirror 



A detour of a thousand pages starts with a single misstep. 
— Chairman Miaw 



Dynamical systems often come equipped with symmetries, such as the re- 
flection and rotation symmetries of various potentials. In this chapter we 
study quotienting of discrete symmetries, and in the next chapter we study 
symmetry reduction for continuous symmetries. We look at individual orbits, and 
the ways they are interrelated by symmetries. This sets the stage for a discussion 
of how symmetries affect global densities of trajectories, and the factorization of 
spectral determinants to be undertaken in chapter 21. 

As we shall show here and in chapter 2 1 , discrete symmetries simplify the dy- 
namics in a rather beautiful way: If dynamics is invariant under a set of discrete 
symmetries G, the state space M is tiled by a set of symmetry-related tiles, and 
the dynamics can be reduced to dynamics within one such tile, the fundamental 
domain M/G. In presence of a symmetry the notion of a prime periodic orbit 
has to be reexamined: a set of symmetry-related full state space cycles is replaced 
by often much shorter relative periodic orbit, the shortest segment of the full state 
space cycle which tiles the cycle and all of its copies under the action of the group. 
Furthermore, the group operations that relate distinct tiles do double duty as letters 
of an alphabet which assigns symbolic itineraries to trajectories. 

Familiarity with basic group-theoretic notions is assumed, with details rele- 
gated to appendix 1. 1 . We find the abstract notions easier to digest by working 
out the examples interspersed throughout this chapter. The erudite reader might 
prefer to skip the lengthy group-theoretic overture and go directly to Q = Di 
example 9.12, example 9.14, and C3 V = D3 example 9.1, backtrack as needed. 
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Figure 9.1: The symmetries of three disks on an equi- 
lateral triangle. A fundamental domain is indicated by 
the shaded wedge. 



a 23~5 M 




9.1 Discrete symmetries 



Normal is just a setting on a washing machine. 
— Borgette, Borgo's daughter 




We show that a symmetry equates multiplets of equivalent orbits, or 'stratifies' the 
state space into equivalence classes, each class a 'group orbit'. We start by defin- 
ing a finite (discrete) group, its state space representations, and what we mean by 
a symmetry (invariance or equivariance) of a dynamical system. As is always the 
problem with 'gruppenpest' (read appendix A.6) way too many abstract notions 
have to be defined before an intelligent conversation can take place. Perhaps best 
to skim through this section on the first reading, then return to it later as needed. 

Definition: A group consists of a set of elements 



and a group multiplication rule gj o g { (often abbreviated as gjgi), satisfying 

1. Closure: If gi, gj € G, then gj o^eG 

2. Associativity: g k o ( gj o gi ) = (g k o gj ) o gi 

3. Identity e: g o e = e o g = g for all g e G 

4. Inverse g~ l : For every g e G, there exists a unique element h = g~ l € G 
such that 

hog-goh-e. 

If the group is finite, the number of elements, |G| = n, is called the order of the 

g rou P- , , H 

example 1.1 

example 1. 2 

Example 9.1 C 3v = D 3 symmetry of the 3-disk game of pintail: If the thre&WiftPte '-3 
radius disks in figure 9. 1 are equidistantly spaced, our game of pinball has a sixfold 



G - {e,g 2 , ...,g n , ••• 



(9.1) 
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symmetry. The symmetry group of relabeling the 3 disks is the permutation group S3; 
however, it is more instructive to think of this group geometrically as C% v , also known 
as the dihedral group 

D 3 = [e, o-i 2) fTi 3 ,a- 23 , C 1/3 , C 2/3 } , (9.2) 

the group of order \G\ = 6 consisting of the identity element e, three reflections across 
symmetry axes {cr 12 , 0-23, 0-13}, and two rotations by 2n/3 and 4-n /3 denoted {C 1/3 , C 2/3 }. 
(continued in example 9.6) 



Definition: Coordinate transformations. Consider a map x" = f(x), x,x' € 
At. An active coordinate transformation Mx corresponds to a non-singular [dxd] 
matrix M that maps the vector x e At onto another vector Mx e At. The corre- 
sponding passive coordinate transformation f(x) —> M~ l f(x) changes the coor- 
dinate system with respect to which the vector f(x) € At is measured. Together, 
a passive and active coordinate transformations yield the map in the transformed 
coordinates: 



f(x) = M-'f(Mx). (9.3) 



Example 9.2 Discrete groups of order 2 on R . Three types of discrete group of 
order 2 can arise by linear action on our 3-dimensional Euclidian space R 3 : 

reflections: cr{x, y, z) = (x, y, -z) 

rotations: C U2 (x,y,z) = (-x,-y,z) (9.4) 
inversions: P(x, y, z) = (-x, -y, -z) . 

cr is a reflection (or an inversion) through the [x,y] plane. C l/1 is [x,y]-plane, constant z 
rotation by n about thez-axis (or an inversion thorough thez-axis). P is an inversion (or 
parity operation) through the point (0, 0, 0). Singly each operation generates a group 
of order 2: D x = {e,o-}, C 2 = {e,C 1/2 }, andD x = {e,P}. Together, they form the dihedral 
group Dn = {e, cr, C 1/2 , P) of order 4. (continued in example 9.3) 



Definition: Matrix group. The set of [dx d] -dimensional real non-singular ma- 
trices A,B,C,... € GL(d) acting in a <i-dimensional vector space V € forms 
the general linear group GL(d) under matrix multiplication. The product of matri- 
ces A and B gives the matrix C, Cx = B(Ax) = (BA)x e V, for all x e V . The unit 
matrix 1 is the identity element which leaves all vectors in V unchanged. Every 
matrix in the group has a unique inverse. 
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Definition: Matrix representation. Linear action of a group element g on 
states x € M is given by a finite non-singular [d x d] matrix g, the matrix rep- 
resentation of element g e G. We shall denote by 'g' both the abstract group 
element and its matrix representation. 

However, when dealing simultaneously with several representations of the 
same group action, notation Dj(g), j a representation label, is preferable (see ap- 
pendix 1.1). A linear or matrix representation D(G) of the abstract group G acting 
on a representation space V is a group of matrices D(G) such that 

1. Any g e G is mapped to a matrix D(g) e D(G). 

2. The group product g2 ° gi is mapped onto the matrix product D(g 2 ° gi) — 
D(g 2 )D( gl ). 

3. The associativity follows from the associativity of matrix multiplication, 

D(g 3 o (g 2 o gi)) - D(g 3 )(D(g 2 )D( gl )) = (D(g 3 )(D(g 2 ))D( gl ). 

4. The identity element e € G is mapped onto the unit matrix D(e) = 1 and 
the inverse element g~ l e G is mapped onto the inverse matrix Dig" 1 ) = 
[D(g)T l = D~ l (g). 

Example 9.3 Discrete operations on R 3 . (continued from example 9.2) The matrix 
representation of reflections, rotations and inversions defined by (9.4) is 





■ 1 


0 


o ) 




< -1 


0 


0 ] 




< -1 


0 


0 ^ 


cr 


0 


1 


0 


, c 1/2 = 


0 


-1 


0 


, p = 


0 


-1 


0 




I o 


0 


-1 ) 




I o 


0 


1 > 




I o 


0 


-1 J 



with detC 1/2 = 1, deter = detP = -1; that is why we refer to C 1/2 as a rotation, andcr, P 
as inversions. As g 1 - e in all three cases, these are groups of order 2. (continued in 
example 9.5) 

If the coordinate transformation g belongs to a linear non-singular represen- 
tation of a discrete finite group G, for any element g e G there exists a number 
m < \G\ such that 

g m =gogo...og = e -» |detg| = l. (9.6) 

V ' 

m times 

As the modulus of its determinant is unity, det g is an with root of 1 . Hence all 
finite groups have unitary representations. 

Definition: Symmetry of a dynamical system. A group G is a symmetry of the 
dynamics if for every solution f(x) € M and g € G, gf(x) is also a solution. 

Another way to state this: A dynamical system (At,/) is invariant (or G- 
equivariant) under a symmetry group G if the time evolution / : At —> At (a 
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discrete time map /, or the continuous flow f map from the <i-dimensional man- 
ifold M into itself) commutes with all actions of G, 



f{gx) = gf{x). (9.7) 

In the language of physicists: The 'law of motion' is invariant, i.e., retains its form 
in any symmetry-group related coordinate frame (9.3), 

f(x) = g- 1 f(gx), (9.8) 

for x e M and any finite non-singular [dxd] matrix representation g of element 
g e G. As these are true any state x, one can state this more compactly as / o g - 

g ° /. or / = g' 1 ofog. 

Why 'equivariant?' A scalar function h(x) is said to be G-invariant if h(x) = 
h(gx) for all g e G. The group actions map the solution / : M. — > At into different 
(but equivalent) solutions gf(x), hence the invariance condition f(x) = g~ l f{gx) 
appropriate to vectors (and, more generally, tensors). The full set of such solu- 
tions is G-invariant, but the flow that generates them is said to be G-equivariant. 
It is obvious from the context, but for verbal emphasis applied mathematicians 
like to distinguish the two cases by in/equi-vanant. The distinction is helpful in 
distinguishing the dynamics written in the original, equivariant coordinates from 
the dynamics rewritten in terms of invariant coordinates, see sects. 9.5 and 10.4. 

exercise 9.7 
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Figure 9.3: The 3-disk pinball cycles: (a) 12, 13, 
23, 123; the clockwise 132 not drawn, (b) Cy- 
cle 1232; the symmetry related 1213 and 1323 not 
dra wn, (c) 12323; 12123, 12132, 12313, 13131 
and 13232 not drawn, (d) The fundamental do- 
main, i.e., the l/6th wedge indicated in (a), con- 
sisting of a section of a disk, two segments of sym- 
metry axes acting as straight mirror walls, and the 
escape gap to the left. The above 14 full-space cy- 
cles restricted to the fundamental domain and re- 
coded in binary reduce to the two fixed points 0, 
1, 2-cycle 10, and 5-cycle 00111 (not drawn). See 
figure 9.9 for the 001 cycle. 




\ / / 


f 


J\/1232 






r / \ 1 

/ \ 

/ \ 

/ \ 

\ \ 





(c) 




Example 9.4 A reflection symmetric Id map. Consider a 1d map f with reflection 
symmetry f(-x) = -f(x), such as the bimodal 'sawtooth' map of figure 9.2, piecewise- 
linear on the state space M = [-1, 1], a compact 1 -dimensional line interval, split into 
three regions M = Ml u Mq U Mr. Denote the reflection operation by crx = -x. The 
2-element group G = {e,cr} goes by many names, such as Z 2 or Co. Here we shall 
refer to it as D., dihedral group generated by a single reflection. The G-equivariance 
of the map implies that if {x„} is a trajectory, than also {crx,,} is a symmetry-equivalent 
trajectory because crx n+ i = crf(x n ) = f(crx n ) (continued in example 9. 12) 



Example 9.5 Equivariance of the Lorenz flow, (continued from example 9.3) The 
velocity field in Lorenz equations (2.13) 



X 




cr(y - x) 


y 




px-y - xz 


z 




xy — bz 



is equivariant under the action of cyclic group C2 = {e, C 1/2 } acting on R 3 by a n rotation 
about the z axis, 

C l,2 (x,y,z) = (-x,-y,z). (9.9) 
(continued in example 9. 14) 



Example 9.6 3-disk game of pinball - symmetry-related orbits: (continued from 
example 9.1) Applying an element (identity, rotation by ±2nl3, or one of the three 
possible reflections) of this symmetry group to a trajectory yields another trajectory. 
For instance, cr 23 , the flip across the s ymmetr y ax is goin g through disk 1 interchanges 
the symbols 2 and 3; it maps the cycle 12123 into 13132, figure 9.3 (c). Cycles 12, 23, 
and 13 in figure 9.3(a) are related to each other by rotation by +2n/3, or, equivalently, 
by a relabeling of the disks, (continued in example 9.8) 
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Example 9.7 Discrete symmetries of the plane Couette flow. The plane Couette 
flow is a fluid flow bounded by two countermoving planes, in a cell periodic in stream- 
wise and spanwise directions. The Navier-Stokes equations for the plane Couette flow 
have two discrete symmetries: reflection through the (streamwise , wall-normal) plane, 
and rotation by n in the (streamwise , wall-normal) plane. That is why the system has 
equilibrium and periodic orbit solutions, (as opposed to relative equilibrium and relative 
periodic orbit solutions discussed in chapter 10). They belong to discrete symmetry 
subspaces. (continued in example 10.4) 



9.1.1 Subgroups, cosets, classes 

Inspection of figure 9.3 indicates that various 3-disk orbits are the same up to a 
symmetry transformation. Here we set up some abstract group-theoretic notions 
needed to describe such relations. The reader might prefer to skip to sect. 9.2, 
backtrack as needed. 



Definition: Subgroup. A set of group elements H = [e, bi,b^, . . . , bh) Q G 
closed under group multiplication forms a subgroup. 

Definition: Coset. Let H = {e, b2, £3, ■ - ■ , b^\ Q G be a subgroup of order h = 
\H\. The set of h elements {c, cb2, cb^, . . . , cb/,}, c e G but not in H, is called left 
coset cH. For a given subgroup H the group elements are partitioned into H and 
m — 1 cosets, where m = \G\/\H\. The cosets cannot be subgroups, since they do 
not include the identity element. We learn that a nontrival subgroup can exist only 
if \G\, the order of the group, is divisible by \H\, the order of the subgroup, i.e., 
only if \G\ is not a prime number. 

Example 9.8 Subgroups, cosets of D 3 : (continued from example 9.6) The 

3-disks symmetry group, the D 3 dihedral group (9.2) has six subgroups 

{e), {e,o- l2 }, {e,o- 13 }, {e,o- 23 }, {e,C l/ \C 2/ \ D 3 . (9.10) 

The left cosets of subgroup Di = {e,o- 12 } are {<t 13 ,C 1/3 }, {cr 23 ,C 2/3 }. The coset of 
subgroup C 3 = {e, C 1/3 , C 2/3 } is {cr 12 ,<r 13, o"2 3 ). The significance of t he co set is that if a 
solution has a symmetry H, for example the symmetry of a 3-cycle 123 is C 3 , then all 
elements in a coset act on it the same way, for example {o- l2 , o- L3 , <t 23 }123 = 132. 

The nontrivial subgroups of D 3 are Di = [e, cr), consisting of the identity and 
any one of the reflections, of order 2, and C 3 = [e, C 1/3 , C 2/3 }, of order 3, so possible 
cycle multiplicities are \G\/\G P \ = 1, 2, 3 or 6. Only the fixed point at the origin has 
full symmetry G p = G. Such equilibria exist for smooth potentials, but not for the 3- 
disk billiard. Examples of other multiplicities are given in figure 9.3 and figure 9.7. 
(continued in example 9.9) 

Next we need a notion that will, for example, identify the three 3-disk 2-cycles 
in figure 9.3 as belonging to the same class. 
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Definition: Class. An element b € G is conjugate to a if & - cac where c is 
some other group element. If b and c are both conjugate to a, they are conjugate 
to each other. Application of all conjugations separates the set of group elements 
into mutually not-conjugate subsets called classes, types or conjugacy classes. 
The identity e is always in the class {e} of its own. This is the only class which is 

oxo rc i s 

a subgroup, all other classes lack the identity element. 

Example 9.9 Z> 3 symmetry - classes: (continued from example 9.8) The three 
classes of the 3-disk symmetry group D 3 = {e, C 1/3 , C 2/ \ <r, <xC 1/3 , crC 2/3 }, are fhe /den- 
tity, any one of the reflections, and the two rotations, 



In other words, the group actions either flip or rotate, (continued in example 9. 13) 

Physical importance of classes is clear from (9.8), the way coordinate trans- 
formations act on mappings: action of elements of a class (say reflections, or 
rotations) is equivalent up to a redefinition of the coordinate frame. 

Definition: Invariant subgroup. A subgroup H c G is an invariant subgroup 
or normal divisor if it consists of complete classes. Class is complete if no conju- 
gation takes an element of the class out of H. 

Think of action of H within each coset as identifying its \H\ elements as equiv- 
alent. This leads to the notion of the factor group or quotient group G/H of G, 
with respect to the invariant subgroup H. H thus divides G into H and m — 1 
cosets, each of order \H\. The order of G/H is m = \G\/\H\, and its multiplication 
table can be worked out from the G multiplication table class by class, with the 
subgroup H playing the role of identity. G/H is homeomorphic to G, with \H\ 
elements in a class of G represented by a single element in G/H. 

9.1.2 Orbits, quotient space 

So far we have discussed the structure of a group as an abstract entity. Now we 
switch gears and describe the action of the group on the state space. This is the key 
step; if a set of solutions is equivalent by symmetry (a circle, let's say), we would 
like to represent it by a single solution (cut the circle at a point, or rewrite the 
dynamics in a 'reduced state space,' where the circle of solutions is represented 
by a single point). 



Definition: Orbit. The subset M Xo c M traversed by the infinite-time trajec- 
tory of a given point xq is called the orbit (or time orbit, or solution) x(t) = /(xq). 
An orbit is a dynamically invariant notion: it refers to the set of all states that can 




(9.11) 



section 2.1 
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be reached in time from xq, thus as a set it is invariant under time evolution. The 
full state space M is foliated into a union of such orbits. We label a generic orbit 
M Xo by any point belonging to it, xq = x(0) for example. 

A generic orbit might be ergodic, unstable and essentially uncontrollable. The 
ChaosBook strategy is to populate the state space by a hierarchy of orbits which 
are compact invariant sets (equilibria, periodic orbits, invariant tori, . . .), each 
computable in a finite time. They are a set of zero Lebesgue measure, but dense 
on the non-wandering set, and are to a generic orbit what fractions are to normal 
numbers on the unit interval. We label orbits confined to compact invariant sets by 
whatever alphabet we find convenient in a given context: point EQ = xeq = Meq 
for an equilibrium, 1 -dimensional loop p = M p for a prime periodic orbit p, etc. 
(note also discussion on page 203, and the distinction between trajectory and orbit 
made in sect. 2.1; a trajectory is a finite-time segment of an orbit). 

Definition: Group orbit or the G-orbit of the point x e M is the set 



of all state space points into which x is mapped under the action of G. If G is a 
symmetry, intrinsic properties of an equilibrium (such as stability eigenvalues) or 
a cycle p (period, Floquet multipliers) evaluated anywhere along its G-orbit are 
the same. 

A symmetry thus reduces the number of inequivalent solutions Mp. So we 
also need to describe the symmetry of a solution, as opposed to (9-8), the sym- 
metry of the system. We start by defining the notions of reduced state space, of 
isotropy of a state space point, and of the symmetry of an orbit. 

Definition: Reduced state space. The action of group G partitions the state 
space At into a union of group orbits. This set of group orbits, denoted M/G, has 
many names: reduced state space, quotient space or any of the names listed on 
page 193. 

Reduction of the dynamical state space is discussed in sect. 9.4 for discrete 
symmetries, and in sect. 10.4 for continuous symmetries. 

Definition: Fixed-point subspace. Mh is the set of all state space points left 
H-fixed, point-wise invariant under subgroup or 'centralizer' H c G action 



Points in state space subspace Mg which are fixed points of the full group action 
are called invariant points, 



M x = {gx\geG) 



(9.12) 



M H = Fix (H) = {x e M : h x = x for all h e H} . 



(9.13) 



Mg - Fix (G) = {x e M : g x - x for all g e G} . 



(9.14) 
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Definition: Flow invariant subspace. A typical point in fixed-point subspace 
Mh moves with time, but, due to equivariance (9.7), its trajectory x(t) = f(x) 
remains within /(Mh) Q Mh for all times, 



i.e., it belongs to a flow invariant subspace. This suggests a systematic approach 
to seeking compact invariant solutions. The larger the symmetry subgroup, the 
smaller Mh, easing the numerical searches, so start with the largest subgroups H 
first. 

We can often decompose the state space into smaller subspaces, with group 
acting within each 'chunk' separately: 

Definition: Invariant subspace. M a c M is an invariant subspace if 



{0} and M are always invariant subspaces. So is any Fix{H) which is point-wise 
invariant under action of G. 



Definition: Irreducible subspace. A space M a whose only invariant subspaces 
are {0} and M a is called irreducible. 



9.2 Symmetries of solutions 

The solutions of an equivariant system can satisfy all of the system's symmetries, a 
subgroup of them, or have no symmetry at all. For a generic ergodic orbit f(x) the 
trajectory and any of its images under action of g e G are distinct with probability 
one, f\x) n gf (x) = 0 for all t, f. For example, a typical turbulent trajectory 
of pipe flow has no symmetry beyond the identity, so its symmetry group is the 
trivial {e). For compact invariant sets, such as fixed points and periodic orbits the 
situation is very different. For example, the symmetry of the laminar solution of 
the plane Couette flow is the full symmetry of its Navier-Stokes equations. In 
between we find solutions whose symmetries are subgroups of the full symmetry 
of dynamics. 

Definition: Isotropy subgroup. The maximal set of group actions which maps 
a state space point x into itself, 



hf t {x) = f{hx)=f t {x), heH, 



(9.15) 



{M a ■ gx e M a for all g e G and x e M a ) ■ 



(9.16) 



G x = {g € G : gx = x) , 



(9.17) 
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is called the isotropy group or little group of x. 

A solution usually exhibits less symmetry than the equations of motion. The 
symmetry of a solution is thus a subgroup of the symmetry group of dynamics. 
We thus also need a notion of set-wise invariance, as opposed to the point-wise 
invariance under G x . 



exercise 9.2 



Definition: Symmetry of a solution, -symmetric cycle. We shall refer to the 
subset of nontrivial group actions G p cGon state space points within a compact 
set M p , which leave no point fixed but leave the set invariant, as the symmetry G p 
of the solution M p , 

G p = {ge G p : gx e M p , gxtxforgte], (9.18) 

and reserve the notion of 'isotropy' of a set h\ for the subgroup G p that leaves 
each point in it fixed. 

A cycle p is G p - symmetric {set-wise symmetric, self-dual) if the action of 
elements of G p on the set of periodic points M p reproduces the set. g € G p acts 
as a shift in time, mapping the periodic point x e A\, into another periodic point. 

Example 9.10 D \ -symmetric cycles: For D\ the period of a set-wise symmetric 
cycle is even (n s = 2n- s ), and the mirror image of the x s periodic point is reached by 
traversing the relative periodic orbit segment s of length n- s , f" s (x s ) - crx s , see fig- 
ure 9.4 (b). 



Definition: Conjugate symmetry subgroups. The splitting of a group G into 
a symmetry group G p of orbit M p and m — 1 cosets cG p relates the orbit M p to 
m - 1 other distinct orbits cM p . All of them have equivalent symmetry subgroups, 
or, more precisely, the points on the same group orbit have conjugate symmetry 
subgroups (or conjugate stabilizers): 

G C p = cG pC - x , (9.19) 

i.e., if G p is the symmetry of orbit M p , elements of the coset space g e G/G p 
generate the m p - 1 distinct copies of M p , so for discrete groups the multiplicity 
of orbit p is m p - |G|/|G P |. 



exercise 9.4 



Definition: G p -fixed orbits: An equilibrium x q or a compact solution p is point- 
wise or Gp-fixed if it lies in the invariant points subspace Fix^Gp), gx - x for all 
g e G p , and x — x q or x 6 M p . A solution that is G-invariant under all group G 
operations has multiplicity 1. Stability of such solutions will have to be examined 
with care, as they lie on the boundaries of domains related by the action of the 
symmetry group. 
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Figure 9.4: The Di-equivariant bimodal sawtooth 
map of figure 9.2 has three types of periodic or- 
bits: (a) D i -fixed fixed point C, asymmetric fixed 
points pair (L, R}. (b) Di-symmetric (setwise in- 
variant) 2-cycle LR. (c) Asymmetric 2-cycles pair 
{LC,CR}. (continued in figure 9.8) (Y. Lan) 
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Example 9.11 Di-invariant cycles: In the example at hand there is only one G- 
invariant (point-wise invariant) orbit, the fixed point C at the origin, see figure 9.4 (a). As 
reflection symmetry is the only discrete symmetry that a map of the interval can have, 
this example completes the group-theoretic analysis of 1 -dimensional maps. We shall 
continue analysis of this system in example 9. 16, and work out the symbolic dynamics 
of such reflection symmetric systems in example 12.5. 



In the literature the symmetry group of a solution is often called stabilizer 
or isotropy subgroup. Saying that G p is the symmetry of the solution p, or that 
the orbit M p is 'G p -invariant,' accomplishes as much without confusing you with 
all these names (see remark 9.1). In what follows we say "the symmetry of the 
periodic orbit p is C2 = {e,R}," rather than bandy about 'stabilizers' and such. 

The key concept in the classification of dynamical orbits is their symmetry. 
We note three types of solutions: (i) fully asymmetric solutions a, (ii) subgroup 
Gs set-wise invariant cycles s built by repeats of relative cycle segments s, and 
(iii) isotropy subgroup Geq -invariant equilibria or point- wise Gp-fixed cycles b. 



Definition: Asymmetric orbits. An equilibrium or periodic orbit is not sym- 
metric if {x a } n {gx a } = 0 for any g € G, where {x a } is the set of periodic points 
belonging to the cycle a. Thus g e G generate \G\ distinct orbits with the same 
number of points and the same stability properties. 



Example 9.12 Group Di - a reflection symmetric Id map: Consider the bimodal 
'sawtooth' map of example 9.4, with the state space M = [-1, 1] split into three regions 
M = {M L , M c , Mr) which we label with a 3-letter alphabet L(eft), C (enter), andR(ight). 
The symbolic dynamics is complete ternary dynamics, with any sequence of letters 
Ji = {L, C, R) corresponding to an admissible trajectory ('complete' means no additional 
grammar rules required, see example 11.6 below). The Di-equivariance of the map, 
Di = {e, a-}, implies that if{x„) is a trajectory, so is {(rx n ). 

Fix(G), the set of points invariant under group action of D u M n crM, is just 
this fixed point x = 0, the reflection symmetry point. If a is an asymmetric cycle, 0- maps 
it into the reflected cycle crc^ with the same period and the_ same stability properties, 
see the fixed points pair {L, R) and the 2-cycles pair {LC, CR) in figure 9.4 (c). 



The next illustration brings in the non-abelian, noncommutative group struc- 
ture: for the 3-disk game of pinball of sect. 1.3, example 9.1 and example 9.17, 
the symmetry group has elements that do not commute. 



exercise 9.5 
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Example 9.13 3-disk game of pinball - cycle symmetries: (continued from exam- 
ple 9.9) The Ci subgroup G p = {e, C 1/3 , C 2/3 } invariance is exemplified by the two cy- 
cles 123 and 132 which are invariant under rotations by 2n/3 and 4-n /3, but are mapped 
into each other by any reflection, figure 9. 7 (a), and have multiplicity \G\/\G P \ = 2. 

The C v type of a subgroup is exem plifi ed by th e sym metries of p = 1213. This 
cycle is invariant under reflection o- 23 {1213} = 1312 = 1213, so the invarian t subgroup 
is Gp = {e, o- 2 3), with multiplicity is trip = \G\/\G P \ = 3; the cycles in this class, 1213, 1232 
and 1323, are related by 2n 13 rotations, figure 9.7(b). 

A cycle of no symmetry, such as 12123, has G p = {e} and contributes in all six 
copies (the remaining cycles in the class are 12132, 12313, 12323, 13132 and 13232;, 
figure 9.7(c). 

Besides the above spatial symmetries, for Hamiltonian syste ms cycles may 
be related by time rever sal symmetry. An example are the cycles 121212313 and 
313212121 = 121213132 which have the same periods and stabilities, but are related 
by no space symmetry, see figure 9. 7. (continued in example 9. 1 7) 



Consider next perhaps the simplest 3 -dimensional flow with a symmetry, the 
iconic flow of Lorenz. The example is long but worth working throug: the symmetry- 
reduced dynamics is much simpler than the original Lorenz flow. 

J t- b exercise 9.7 

exercise 9.8 

Example 9.14 Desymmetrization of Lorenz flow: (continuation of example 9^f rc Lofehz 
equation (2.13) is equivariant under (9.9), the action of order-2 group C2 = {e,C l/2 }, 
where C 1/2 is [x,y]-plane, half-cycle rotation by n about the z-axis: 



(x,y,z) -> C U2 (x,y,z) = (-x,-y,z). 



(9.20) 



(C 1/2 ) 2 = 1 condition decomposes the state space into two linearly irreducible sub- 
spaces M = M + ®M~, the z-axis M + and the [x,y] plane M~, with projection operators 
onto the two subspaces given by (see sect. ??j 



1 



(1 + C 1/2 ) 



0 0 0 ^ 
0 0 0 
0 0 1 



1 



(1-C 1/2 ) 



1 1 0 0 \ 
0 1 0 

0 0 0, 



(9.21) 



As the flow is Co-invariant, so is its linearization x = Ax. Evaluated at EQ 0 , A com- 
mutes with C 1/2 , and, as we have already seen in example 4.6, the EQo stability matrix 
decomposes into [x, y] and z blocks. 

The 1 -dimensional M + subspace is the fixed-point subspace, with the z-axis 
points left point-wise invariant under the group action 



M + = Fix(C 2 ) = {x e M : gx = x for g e \e,C 1/2 }} 



(9.22) 



(here x = (x,y,z) is a 3-dimensional vector, not the coordinate x). A C2-fixed point x(t) 
in Fix (C2) moves with time, but according to (9. 15) remains within x(t) e Fix (C2) for all 
times; the subspace M + = Fix(C2) is flow invariant. In case at hand this jargon is a bit 
of an overkill: clearly for(x,y,z) = (0,0,z) the full state space Lorenz equation (2.13) is 
reduced to the exponential contraction to the EQ Q equilibrium, 



z = -bz. 



(9.23) 



However, for higher-dimensional flows the flow-invariant subspaces can be high-dim- 
ensional, with interesting dynamics of their own. Even in this simple case this subspace 
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Figure 9.5: Lorenz attractor of figure 3.4, the full state 
space coordinates [x,y, z], with the unstable manifold 
orbits W"(EQo). (Green) is a continuation of the unsta- 
ble e (1) of EQo, and (brown) is its ^-rotated symmetric 
partner. Compare with figure 9.6. (E. Siminos) 




Figure 9.6: (a) Lorenz attractor plotted in [x,y, z], 
the doubled-polar angle coordinates (9.24), with 
points related by ^-rotation in the [x, y] plane iden- 
tified. Stable eigenvectors of EQo: e <3) and e <2 \ 
along the z axis (9.23). Unstable manifold orbit 
W"(EQ 0 ) (green) is a continuation of the unstable 
e' 1 ' of EQq. (b) Blow-up of the region near EQi: 
The unstable eigenplane of EQ\ defined by Re e (2) 
and Im e <2) , the stable eigenvector e (3) . The descent 
of the EQo unstable manifold (green) defines the 
innermost edge of the strange attractor. As it is 
clear from (a), it also defines its outermost edge. 
(E. Siminos) 





(b) Im 



plays an important role as a topological obstruction: the orbits can neither enter it nor 
exit it, so the number of windings of a trajectory around it provides a natural, topological 
symbolic dynamics. 

The Mr subspace is, however, not flow-invariant, as the nonlinear terms z = 
xy-bz in the Lorenz equation (2. 13) send all initial conditions within Mr = (x(0),y(0), 0) 
into the full, z(t) + 0 state space M/M + . 

By taking as a Poincare section any C 1/2 -equivariant, non-self-intersecting sur- 
face that contains the z axis, the state space is divided into a half-space fundamental 
domain M = Ml Co and its 180° rotation C l/2 M. An example is afforded by the V plane 
section of the Lorenz flow in figure 3.4. Take the fundamental domain M to be the half- 
space between the viewer and V. Then the full Lorenz flow is captured by re-injecting 
back into M any trajectory that exits it, by a rotation ofn around the z axis. 

As any such C l 12 -invariant section does the job, a choice of a 'fundamental 
domain' is here largely mater of taste. For purposes of visualization it is convenient 
to make the double-cover nature of the full state space by M explicit, through any 
state space redefinition that maps a pair of points related by symmetry into a single 
point. In case at hand, this can be easily accomplished by expressing (x,y) in polar 
coordinates (x,y) = (rcos6>, rsinfl), and then plotting the flow in the 'doubled-polar 
angle representation:' 



section 9.5 
se 9.8 



(Jc, y, z) = (r cos 26, r sin 26, z) = ((x 2 - y 2 )/r, 2xy/r, z) , ^SSJf 

as in figure 9.6(a). In contrast to the original G-equivariant coordinates [x,y,z], the 
Lorenz flow expressed in the new coordinates [Jc, y, z] is G-invariant, see example 9. 18. 
In this representation the M = M/C2 fundamental domain flow is a smooth, continuous 
flow, with (any choice of) the fundamental domain stretched out to seamlessly cover the 
entire [x,y] plane, (continued in example 11.4) 
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(E. Siminos and J. Halcrow) 

Note: nonlinear coordinate transformations such as the doubled-polar angle 
representation (9.24) are not required to implement the symmetry quotienting 
Wl/G.We deploy them only as a visualization aid that might help the reader dis- 
entangle 2-dimensional projections of higher-dimensional flows. All numerical 
calculations can still be carried in the initial, full state space formulation of a flow, 
with symmetry-related points identified by linear symmetry transformations. 



in depth: 

appendix I, p. 860 



9.3 Relative periodic orbits 



So far we have demonstrated that symmetry relates classes of orbits. Now we 
show that a symmetry reduces computation of periodic orbits to repeats of shorter, 
'relative periodic orbit' segments. 

Equivariance of a flow under a symmetry means that the symmetry image of 
a cycle is again a cycle, with the same period and stability. The new orbit may be 
topologically distinct (in which case it contributes to the multiplicity of the cycle) 
or it may be the same cycle. 

A cycle p is G p -symmetric under symmetry operation g e G p if the operation 
acts on it as a shift in time, advancing a cycle point to a cycle point on the sym- 
metry related segment. The cycle p can thus be subdivided into trip repeats of a 
relative periodic orbit segment, 'prime' in the sense that the full state space cycle 
is built from its repeats. Thus in presence of a symmetry the notion of a periodic 
orbit is replaced by the notion of the shortest segment of the full state space cycle 
which tiles the cycle under the action of the group. In what follows we refer to this 
segment as a relative periodic orbit. In the literature this is sometimes referred to 
as a short periodic orbit, or, for finite symmetry groups, as a pre-periodic orbit. 

Relative periodic orbits (or equivariant periodic orbits) are orbits x{t) in state 
space M which exactly recur 

x(t)=gx(t + T) (9.25) 

for the shortest fixed relative period T and a fixed group action g € G p . Parameters 
of this group action are referred to as 'phases' or 'shifts.' For a discrete group 
g" 1 = e for some finite m, by (9.6), so the corresponding full state space orbit is 
periodic with period mT. 

The period of the full orbit is given by the m p x (period of the relative periodic 
orbit), T p = \Gp\Tp, and the ith Floquet multiplier A p j is given by K: p . of the 
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Figure 9.7: Cycle 12121 2313 has multipl icity 6; 
show n here is 12 1313132 = rx 23 121212313. How- 
ever, 121231313 which has the same stability and 
period is related to 121313132 by time reversal, 
but not by any C3,, symmetry. 




relative periodic orbit. The elements of the quotient space b e G/G p generate the 
copies bp, so the multiplicity of the full state space cycle p is nip = \G\/\G p \. 

Example 9.15 Relative periodic orbits of Lorenz flow: (continuation of exam- 
ple 9. 14) The relation between the full state space periodic orbits, and the fundamen- 
tal domain (9.24) reduced relative periodic orbits of the Lorenz flow: an asymmetric full 
state space cycle pair p, Rp maps into a single cycle p in the fundamental domain, and 
any self-dual cycle p - Rp - pRp is a repeat of a relative periodic orbit p. 



9.4 Dynamics reduced to fundamental domain 



I submit my total lack of apprehension of fundamental 
concepts. 

— John F. Gibson 



So far we have used symmetry to effect a reduction in the number of independent 
cycles, by separating them into classes, and slicing them into 'prime' relative orbit 
segments. The next step achieves much more: it replaces each class by a single 
(typically shorter) prime cycle segment. 



1. Discrete symmetry tessellates the state space into dynamically equivalent 
domains, and thus induces a natural partition of state space: If the dynamics 
is invariant under a discrete symmetry, the state space M can be completely 
tiled by a fundamental domain M and its symmetry images M a = aM, 
Mb = bM, . . . under the action of the symmetry group G = \e, a, b, . . .}, 

At = M u M a u M b ■ ■ ■ u M\ G \ ■ (9.26) 

2. Discrete symmetriy can be used to restrict all computations to the funda- 
mental domain M = M/G, the reduced state space quotient of the full state 
space M by the group actions of G. 

We can use the invariance condition (9.7) to move the starting point x into 
the fundamental domain x = ax, and then use the relation d~ l b = h~ l to 
also relate the endpoint y e Mb to its image in the fundamental domain M. 
While the global trajectory runs over the full space M, the restricted trajec- 
tory is brought back into the fundamental domain M any time it exits into 
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Figure 9.8: The bimodal Ulam sawtooth map of 
figure 9.4 with the Di symmetry f(—x) = —f(x) 
restricted to the fundamental domain. f(x) is in- 
dicated by the thin line, and fundamental domain 
map f(x) by the thick line, (a) Boundary fixed 
point C is the fixed point 0. The asymmetric fixed 
point pair {L,R} is reduced to the fixed point 2, 
and the full state space symmetric 2-cycle LR is 
reduced to the fixed point 1 . (b) The asymmetric 
2-cycle pair [LC,CR] is reduced to 2-cycle 01. (c) 
All fundamental domain fixed points and 2-cycles. 
(Y. Lan) 



an adjoining tile; the two trajectories are related by the symmetry operation 
h which maps the global endpoint into its fundamental domain image. 

Cycle multiplicities induced by the symmetry are removed by reduction 
of the full dynamics to the dynamics on a fundamental domain. Each 
symmetry-related set of global cycles p corresponds to precisely one fun- 
damental domain (or relative) cycle p. 

Conversely, each fundamental domain cycle p traces out a segment of the 
global cycle p, with the end point of the cycle p mapped into the irreducible 
segment of p with the group element hp. A relative periodic orbit segment 
in the full state space is thus a periodic orbit in the fundamental domain. 

The group elements G = {e,g2, ■ ■ ■ ,g\G\) which map the fundamental do- 
main M into its copies gM, serve also as letters of a symbolic dynamics 
alphabet. 

For a symmetry reduction in presence of continuous symmetries, see sect. 10.4. 

exercise 9.6 

Example 9.16 Group Di and reduction to the fundamental domain. Consider 
again the reflection-symmetric bimodal Ulam sawtooth map f{-x) = - f{x) of exam- 
ple 9.12, with symmetry group Di = {e,cr}. The state space M = [-1,1] can be tiled by 
half -line M = [0, 1], and crM = [-1,0], its image under a reflection across x = 0 point. 
The dynamics can then be restricted to the fundamental domain x k e M- [0, 1]; every 
time a trajectory leaves this interval, it is mapped back using cr. 

In figure 9.8 the fundamental domain map f(x) is obtained by reflecting x < 0 
segments of the global map f(x) into the upper right quadrant, f is also bimodal and 
piecewise-linear, with M = [0, 1] split into three regions M = {Mo, M\, Mi) which we 
label with a 3-letter alphabet & = [0, 1, 2). The symbolic dynamics is again complete 
ternary dynamics, with any sequence of letters {0, 1, 2} admissible. 

However, the interpretation of the 'desymmetrized' dynamics is quite different 
- the multiplicity of every periodic orbit is now 1, and relative periodic segments of the 
full state space dynamics are all periodic orbits in the fundamental domain. Consider 
figure 9.8: 

In (a) the boundary fixed point C is also the fixed point 0. 

The asymmetric fixed point pair {L,R} is reduced to the fixed point 2, and the 
full state space symmetric 2-cycle LR is reduced to the fixed point I. The asymmetric 
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Figure 9.9: (a) The pair of full-space 9-cycles, the 
counter-clockwise 121232313 and the clockwise 
131323212 correspond to (b) one fundamental do- 
main 3-cycle 001. 



(b) 



(a) 




2-cycle pair {LC,CR} is reduced to the 2-cycle 01. Finally, the symmetric 4-cycle LCRC 
is reduced to the 2-cycle 02. This completes the conversion from the full state space 
for all fundamental domain fixed points and 2-cycles, figure 9.8(c). 



Example 9.17 3-disk game of pinball in the fundamental domain 

If the dynamics is equivariant under interchanges of disks, the absolute disk 
labels e, = l,2,---,N can be replaced by the symmetry-invariant relative disk-^disk 
increments g it where g, is the discrete group element that maps disk i-l into disk i. For 
3-disk system g t is either reflection cr back to initial disk (symbol V) or 2n/3 rotation 
by C to the next disk (symbol '1 '). An immediate gain arising from symmetry invariant 
relabeling is that N-disk symbolic dynamics becomes (N- 1 )-nary with no restrictions 
on the admissible sequences. 

An irreducible segment corresponds to a periodic orbit in the fundamental do- 
main, a one-sixth slice of the full 3-disk system, with the symmetry axes acting as 
reflecting mirrors (see figure 9.3(d)). A set of orbits related in the full space by dis- 
crete symmetries maps onto a single fundamental domain orbit. The reduction to 
the fundamental domain desymmetrizes the dynamics and removes all global discrete 
symmetry-induced degeneracies: rotationally symmetric global orbits (such as the 3- 
cycles 123 and 132) have multiplicity 2, reflection symmetric ones (such as the 2-cycles 
12, 13 and 23) have multiplicity 3, and global orbits with no symmetry are 6-fold degen- 
erate. Table 12.2 lists some of the shortest binary symbols strings, together with the 
corresponding full 3-disk symbol sequences and orbit symmetries. Some examples of 
such orbits are shown in figures 9. 7 and 9.9. (continued in example 12. 7) 



Physical laws should have the same form in symmetry-equivalent coordinate frames, 
so they are often formulated in terms of functions (Hamiltonians, Lagrangians, 



9.5 Invariant polynomials 
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■ • ■) invariant under a given set of symmetries. The key result of the representation 
theory of invariant functions is: 

Hilbert-Weyl theorem. For a compact group G there exists a finite G-invariant 
homogenous polynomial basis {ii\,ii2, ■ . . , u m \, m > d, such that any G-invariant 
polynomial can be written as a multinomial 

h(x) = p(u\(x), u%(x), . . . , u m {x)) , xeM. (9.27) 

These polynomials are linearly independent, but can be functionally dependent 
through nonlinear relations called syzygies. 

Example 9.18 Polynomials invariant under discrete operations on R 3 . (continued 
from example 9.2) cr is a reflection through the [x,y] plane. Any \e,cr) -invariant 
function can be expressed in the polynomial basis \u\,u 2 , "3) = \x,y,z 2 }. 

C 1/2 is a [x,y]-plane rotation by n about the z-axis. Any {e,C l/2 }-invariant func- 
tion can be expressed in the polynomial basis \u\,u 2 ,u^,u^\ = {x 2 ,xy,y 2 ,z}, with one 
syzygy between the basis polynomials, 0 2 )(y 2 ) - (xy) 2 = 0. 

P is an inversion through the point (0, 0, 0). Any [e, P}-invariant function can be 
expressed in the polynomial basis {u\, ■ ■ ■ , u^i = {x 2 , y 2 , z 2 , xy, xz, yz), with three syzy- 
gies between the basis polynomials, (x 2 )(y 2 ) - (xy) 2 = 0, and its 2 permutations. 

For the D 2 dihedral group G = [e, cr, C 1/2 ,P} the G-invariant polynomial basis 
is {u 1 ,u 2 ,u3,u 4 } = {x 2 ,y 2 ,z 2 ,xy}, with one syzygy, (x 2 )(y 2 ) - (xy) 2 = 0. (continued in 
example 10.13) 

In practice, explicit construction of G-invariant basis can be a laborious un- 
dertaking, and we will not take this path except for a few simple low-dimensional 
cases, such as the 5-dimensional example of sect. 10.5. We prefer to apply the 
symmetry to the system as given, rather than undertake a series of nonlinear co- 
ordinate transformations that the theorem suggests. (What 'compact' in the above 
refers to will become clearer after we have discussed continuous symmetries. For 
now, it suffices to know that any finite discrete group is compact.) 

©x©rcis6 y.i 

Resume 



A group G is a symmetry of the dynamical system (At, f) if its 'law of motion' 
retains its form under all symmetry-group actions, f(x) = g~ l f(gx) . A mapping u 
is said to be invariant if gu = u, where g is any element of G. If the mapping and 
the group actions commute, gu = ug, u is said to be equivariant. The governing 
dynamical equations are equivariant with respect to G. 

We have shown here that if a dynamical system (At, f) has a symmetry G, 
the symmetry should be deployed to 'quotient' the state space toAl = At/G, i.e., 
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identify all symmetry-equivalent x e M on each group orbit, thus replacing the 
full state space dynamical system (M, f) by the symmetry-reduced (M, f). The 
main result of this chapter can be stated as follows: 

In presence of a discrete symmetry G, associated with each full state space 
solution p is the group of its symmetries G p c G of order 1 < \G P \ < \G\, whose 
elements leave the orbit M p invariant. The elements of G p act on p as shifts, tiling 
it with \G P \ copies of its shortest invariant segment, the relative periodic orbit p. 
The elements of the coset b e G/G p generate m p - |G|/|G P | equivalent copies of 
P- 

Once you grasp the relation between the full state space M and the desym- 
metrized, G-quotiented reduced state space M/G, you will find the life as a funda- 
mentalist so much simpler that you will never return to your full state space ways 
of yesteryear. The reduction to the fundamental domain M = M/G simplifies 
symbolic dynamics and eliminates symmetry-induced degeneracies. For the short 
orbits the labor saving is dramatic. For example, for the 3-disk game of pinball 
there are 256 periodic points of length 8, but reduction to the fundamental domain 
non-degenerate prime cycles reduces this number to 30. In the next chapter con- 
tinuous symmetries will induce relative periodic orbits that never close a periodic 
orbit, and in the chapter 25 they will tile the infinite periodic state space, and re- 
duce calculation of diffusion constant in an infinite domain to a calculation on a 
compact torus. 
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Commentary 

Remark 9.1 Literature. We found Tinkham [ 1] the most enjoyable as a no-nonsense, 
the user friendliest introduction to the basic concepts. Byron and Fuller [ 2], the last chap- 
ter of volume two, offers an introduction even more compact than Tinkham's. For a 
summary of the theory of discrete groups see, for example, ref. [ 3]. Chapter 3 of Rebecca 
Hoyle [4] is a very student-friendly overview of the group theory a nonlinear dynamicist 
might need, with exception of the quotienting, reduction of dynamics to a fundamental do- 
main, which is not discussed at all. We found sites such as en.wildpedia.org/wiki/Quotient_group 
helpful. Curiously, we have not read any of the group theory books that Hoyle recom- 
mends as background reading, which just confirms that there are way too many group 
theory books out there. For example, one that you will not find useful at all is ref. [ 5]. The 
reason is presumably that in the 20th century physics (which motivated much of the work 
on the modern group theory) the focus is on the linear representations used in quantum 
mechanics, crystallography and quantum field theory. We shall need these techniques in 
Chapter 21, where we reduce the linear action of evolution operators to irreducible sub- 
spaces. However, here we are looking at nonlinear dynamics, and the emphasis is on the 
symmetries of orbits, their reduced state space sisters, and the isotypic decomposition of 
their linear stability matrices. 

In ChaosBook we focus on chaotic dynamics, and skirt the theory of bifurcations, 
the landscape between the boredom of regular motions and the thrills of chaos. Chapter 
4 of Rebecca Hoyle [4] is a student-friendly introduction to the treatment of bifurca- 
tions in presence of symmetries, worked out in full detail and generality in monographs 
by Golubitsky, Stewart and Schaeffer [6], Golubitsky and Stewart [7] and Chossat and 
Lauterbach [8]. Term 'stabilizer' is used, for example, by Broer et al. [9] to refer to a 
periodic orbit with Z2 symmetry; they say that the relative or pre-periodic segment is in 
this case called a 'short periodic orbit.' In Efstathiou [ 10] a subgroup of 'short periodic 
orbit' symmetries is referred to as a 'nontrivial isotropy group or stabilizer.' Chap. 8 of 
Govaerts [11] offers a review of numerical methods that employ equivariance with respect 
to compact, and mostly discrete groups, (continued in remark 10.1) 



Remark 9.2 Symmetries of the Lorenz equation: (continued from remark 2.3) Af- 
ter having studied example 9.14 you will appreciate why ChaosBook . org starts out with 
the symmetry-less Rtissler flow (2.18), instead of the better known Lorenz flow (2.13). 
Indeed, getting rid of symmetry was one of Rossler's motivations. He threw the baby 
out with the water; for Lorenz flow dimensionalities of stable/unstable manifolds make 
possible a robust heteroclinic connection absent from Rossler flow, with unstable mani- 
fold of an equilibrium flowing into the stable manifold of another equilibrium. How such 
connections are forced upon us is best grasped by perusing the chapter 13 'Heteroclinic 
tangles' of the inimitable Abraham and Shaw illustrated classic [12]. Their beautiful 
hand-drawn sketches elucidate the origin of heteroclinic connections in the Lorenz flow 
(and its high-dimensional Navier-Stokes relatives) better than any computer simulation. 
Miranda and Stone [13] were first to quotient the C2 symmetry and explicitly construct 
the desymmetrized, 'proto-Lorenz system,' by a nonlinear coordinate transformation into 
the Hilbert-Weyl polynomial basis invariant under the action of the symmetry group [ 14]. 
For in-depth discussion of symmetry-reduced ('images') and symmetry-extended ('cov- 
ers') topology, symbolic dynamics, periodic orbits, invariant polynomial bases etc., of 
Lorenz, Rossler and many other low-dimensional systems there is no better reference 
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than the Gilmore and Letellier monograph [15]. They interpret [16] the proto-Lorenz 
and its 'double cover' Lorenz as 'intensities' being the squares of 'amplitudes,' and call 
quotiented flows such as (Lorenz)/C2 'images.' Our 'doubled-polar angle' visualization 
figure 11.8 is a proto-Lorenz in disguise; we, however, integrate the flow and construct 
Poincare sections and return maps in the original Lorenz [x, y, z] coordinates, without any 
nonlinear coordinate transformations. The Poincare return map figure 1 1.9 is reminiscent 
in shape both of the one given by Lorenz in his original paper, and the one plotted in a 
radial coordinate by Gilmore and Letellier. Nevertheless, it is profoundly different: our 
return maps are from unstable manifold — > itself, and thus intrinsic and coordinate inde- 
pendent. In this we follow ref. [17]. This construction is necessary for high-dimensional 
flows in order to avoid problems such as double-valuedness of return map projections on 
arbitrary 1 -dimensional coordinates encountered already in the Rossler example of fig- 
ure 3.3. More importantly, as we know the embedding of the unstable manifold into the 
full state space, a periodic point of our return map is - regardless of the length of the cycle 
- the periodic point in the full state space, so no additional Newton searches are needed. 
In homage to Lorenz, we note that his return map was already symmetry-reduced: as z 
belongs to the symmetry invariant Fix (G) subspace, one can replace dynamics in the full 
space by z, Z, • • ■■ That is G -invariant by construction [15]. 

Remark 9.3 Examples of systems with discrete symmetries. Almost any flow 
of interest is symmetric in some way or other: the list of examples is endless, we list 
here a handful that we found interesting. One has a C2 symmetry in the Lorenz system 
(remark 2.3), the Ising model, and in the 3-dimensional anisotropic Kepler potential [ 18, 
19, 20], a D4 = C4,, symmetry in quartic oscillators [21, 22], in the pure x 2 y 2 potential [23, 
24] and in hydrogen in a magnetic field [25], and a Dn — C21. = V4 = C2 X C2 symmetry 
in the stadium billiard [26]. A very nice nontrivial desymmetrization is carried out in 
ref. [27]. An example of a system with D3 = C3,, symmetry is provided by the motion of 
a particle in the Henon-Heiles potential [28, 29, 30, 31] 

V(r,8) = + sin(30) . 

Our 3-disk coding is insufficient for this system because of the existence of elliptic islands 
and because the three orbits that run along the symmetry axis cannot be labeled in our 
code. As these orbits run along the boundary of the fundamental domain, they require 
the special treatment. A partial classification of the 67 possible symmetries of solutions 
of the plane Couette flow of example 9.7, and their reduction 5 conjugate classes is given 
in ref. [32]. 
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Exercises 

9.1. Polynomials invariant under discrete operations on 

R 3 . Prove that the {e,cr}, {e,C 1/2 }, {e,P} and 

{e, o", C 1 ^ 2 , P}-invariant polynomial basis and syzygies 
are those listed in example 9.18. 

9.2. G x c G. Prove that the set G x as defined in (9.17) is a 
subgroup of G. 

9.3. Transitivity of conjugation. Assume that g\, gi, gi e 
G and both gi and g2 are conjugate to #3. Prove that g\ 
is conjugate to g2- 

9.4. Isotropy subgroup of gx. Prove that for g e G, x and 

gx have conjugate isotropy subgroups: 

G g x = gG x g~ x 

9.5. D3: symmetries of an equilateral triangle. Consider 
group D3 = C3 V , the symmetry group of an equilateral 
triangle: 



1 




2 3 

(a) List the group elements and the corresponding ge- 
ometric operations 

(b) Find the subgroups of the group D 3 . 

(c) Find the classes of D3 and the number of elements 
in them, guided by the geometric interpretation of 
group elements. Verify your answer using the def- 
inition of a class. 

(d) List the conjugacy classes of subgroups of D3. 
(continued as exer:FractRot) 

9.6. Reduction of 3-disk symbolic dynamics to binary. 

(continued from exercise 1.1) 

(a) Verify that the 3-disk cycles 

{ 12, 1~3, 23}, {1~23, l3~2}, { 12L3 + 2 perms.}, 
{121232 313 + 5 perms.}, {121 323+ 2 perms.}, 

correspond to the fundamental domain cycles 0, 1 , 
01, 001, 01 1, ■ ■ ■ respectively. 

(b) Check the reduction for short cycles in table 12.2 
by drawing them both in the full 3-disk system and 
in the fundamental domain, as in figure 9.9. 



(c) Optional: Can you see how the group elements 
listed in table 12.2 relate irreducible segments to 
the fundamental domain periodic orbits? 

(continued in exercise 12.6) 

9.7. C2-equivariance of Lorenz system. Verify that the 
vector field in Lorenz equations (2.13) 





■ 




cr(y-x) 




x = v(x) 


y 




px-y - xz 


(9.28) 




z 




xy - bz 





is equivariant under the action of cyclic group C2 = 
{e, C 1 / 2 } acting on K 3 by a n rotation about the z axis, 

C 1/2 (x,y,z) = (-x,-y,z), 

as claimed in example 9.5. (continued in exercise 9.8) 

9.8. Lorenz system in polar coordinates: group the- 
ory. Use (B.3), (B.4) to rewrite the Lorenz equa- 
tion (9.28) in polar coordinates (r,9,z), where (x,y) = 
(r cos 0, r sinO). 

1. Show that in the polar coordinates Lorenz flow 
takes form 

r 

r = — ( — o~ - 1 + (cr + p - z) sin 20 

+(1 -cr) cos 20) 
0 = i(-cr+p- z + (cr- l)sin26» 

+ (cr + p - z) COS 20) 

r 2 

z = -bz +—sin20. (9.29) 

2. Argue that the transformation to polar coordinates 
is invertible almost everywhere. Where does the 
inverse not exist? What is group-theoretically spe- 
cial about the subspace on which the inverse not 
exist? 

3. Show that this is the (Lorenz) / C2 quotient map for 
the Lorenz flow, i.e., that it identifies points related 
by the n rotation in the [x, y] plane. 

4. Rewrite (9.28) in the invariant polynomial basis of 
example 9.18 and exercise 9.29. 

5. Show that a periodic orbit of the Lorenz flow in 
polar representation (9.29) is either a periodic or- 
bit or a relative periodic orbit (9.25) of the Lorenz 
flow in the (x, y, z) representation. 



exerDiscrete - 12feb2012 ChaosBook.org versionl4.4.1, Apr 21 2013 



REFERENCES 



175 



By going to polar coordinates we have quotiented out the 
^--rotation (x, y, z) — * (—x, —y, z) symmetry of the Lorenz 
equations, and constructed an explicit representation of 
the desymmetrized Lorenz flow. 



Proto-Lorenz system. Here we quotient out the C 2 
symmetry by constructing an explicit "intensity" repre- 
sentation of the desymmetrized Lorenz flow, following 
Miranda and Stone [13]. 



1. Rewrite the Lorenz equation (2.13) in terms of 
variables 




(w, v, z) = (x 2 - y 2 , 2xy, z) . 



(9.30) 



show that it takes form 



-(cr + \)u + (cr - r)v + (1 - o~)N + 
(r - cr)u - (cr + l)v + (r + cr)N - uz - 
v/2 - bz 



N 



^lu 1 + v 2 . 



2. Show that this is the (Lorenz)/C2 quotient map for 
the Lorenz flow, i.e., that it identifies points related 
by the n rotation (9.20). 

3. Show that (9.30) is invertible. Where does the in- 
verse not exist? 

4. Compute the equilibria of proto-Lorenz and their 
stabilities. Compare with the equilibria of the 
Lorenz flow. 

5. Plot the strange attractor both in the original form 
(2.13) and in the proto-Lorenz form (9.31) 



for the Lorenz parameter values cr - 10, = 8/3, 
p — 28. Topologically, does it resemble more the 
Lorenz, or the Rossler attractor, or neither? (plot 
by J. Halcrow) 

vz 7. ]Show that a periodic orbit of the proto-Lorenz is 
- uN either a periodic orbit or a relative periodic orbit 
of the Lorenz flow. 

(98? 1 Show that if a periodic orbit of the proto-Lorenz 
is also periodic orbit of the Lorenz flow, their Flo- 
quet multipliers are the same. How do the Floquet 
multipliers of relative periodic orbits of the Lorenz 
flow relate to the Floquet multipliers of the proto- 
Lorenz? 

9 What does the volume contraction formula (4.41) 
look like now? Interpret. 

Show that the coordinate change (9.30) is the same 
as rewriting (9.29) in variables 

(w,v) = (r 2 cos 26», r 2 sin 26), 



10. 



i.e., squaring a complex number z 
u + iv. 



x + ly, z 



11. How is (9.31) related to the invariant polynomial 
basis of example 9.18 and exercise 9.29? 
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Chapter 10 

Relativity for cyclists 



Physicists like symmetry more than Nature 
— Rich Kerswell 



What if the laws of motion retain their form for a family of coordinate fra- 
mes related by continuous symmetries? The notion of 'fundamental do- 
main' is of no use here. If the symmetry is continuous, the dynamical 
system should be reduced to a lower-dimensional, desymmetrized system, with 
'ignorable' coordinates eliminated (but not forgotten). 

We shall describe here two ways of reducing a continuous symmetry. In the 
'method of slices' or 'moving frames' of sect. 10.4 we slice the state space in 
such a way that an entire class of symmetry-equivalent points is represented by a 
single point. In the Hilbert polynomial basis approach of sect. 10.5 we replace the 
equivariant dynamics by the dynamics rewritten in terms of invariant coordinates. 
In either approach we retain the option of computing in the original coordinates, 
and then, when done, projecting the solution onto the symmetry reduced state 
space. 

Instead of writing yet another tome on group theory, in what follows we con- 
tinue to serve group theoretic nuggets on need-to-know basis, through a series of 
pedestrian examples (but take a slightly higher, cyclist road in the text proper). 



10.1 Continuous symmetries 



First of all, why worry about continuous symmetries? Here is an example of 
the effect a continuous symmetry has on dynamics (for physics background, see 
remark 10.2). 



Example 10.1 Complex Lorenz flow: Consider a complex generalization of Lorenz 
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Figure 10.1: A typical [x s ,x 2 ,z] trajectory of the com- 
plex Lorenz flow, with a short trajectory of figure 10.4 
whose initial point is close to the relative equilibrium 
TWi superimposed. See also figure 10.7. (R. Wilczak) 




equations (2.13), 

x — -crx + cry , y = (p — z)x - ay 

z = (xy* +x*y)/2-bz, (10.1) 

where x,y are complex variables, z is real, while the parameters cr, b are real and 
p-Px+ip2,a- l-ie are complex. Recast in real variables, this is a set of five coupled 
ODEs 

x\ = — <tx\ + oyi 

X2 = —CTX2 + OJ2 

yi = (pi-z)x l -p 2 x 2 -yi-ey 2 
y>2 = P2X\+{p\-z)x 2 + eyi-y 2 

z = -bz + xiyi + x 2 y2 ■ (10.2) 

In all numerical examples that follow, the parameters will be set to p\ = 28, p2 = 0, b = 
8/3, cr = 10, e = 1/10, unless explicitly stated otherwise. As we shall show in exam- 
ple 1 0. 7, this is a dynamical system with a continuous SO(2) (but no discrete) symmetry. 

Figure 10.1 offers a visualization of its typical long-time dynamics. What is 
wrong with this picture? It is a mess. As we shall show here, the attractor is built up by 
a nice 'stretch & fold' action, but that is totally hidden from the view by the continuous 
symmetry induced drifts. In the rest of this chapter we shall investigate various ways 
of 'quotienting' this SO(2) symmetry, and reducing the dynamics to a 4-dimensional 
reduced state space. We shall not rest until we attain the simplicity of figure 10. 12, and 
the bliss of the 1 -dimensional return map of figure 10.14. 



We shall refer to the component of the dynamics along the continuous sym- 
metry directions as a 'drift.' In a presence of a continuous symmetry an orbit 
explores the manifold swept by combined action of the dynamics and the sym- 
metry induced drifts. Further problems arise when we try to determine whether 
an orbit shadows another orbit (see the figure 13.1 for a sketch of a close pass 
to a periodic orbit), or develop symbolic dynamics (partition the state space, as 
in chapter 11): here a 1-dimensional trajectory is replaced by a (N+ ^-dimens- 
ional 'sausage,' a dimension for each continuous symmetry (N being the total 
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number of parameters specifying the continuous transformation, and ' 1 ' for the 
time parameter t). How are we to measure distances between such objects? In 
this chapter we shall learn here how to develop more illuminating visualizations 
of such flow than figure 10.1, 'quotient' symmetries, and offer computationally 
straightforward methods of reducing the dynamics to lower-dimensional, reduced 
state spaces. The methods should also be applicable to high-dimensional flows, 
such as translationally invariant fluid flows bounded by pipes or planes (see ex- 
ample 10.4). 

But first, a lightning review of the theory of Lie groups. The group-theoretical 
concepts of sect. 9.1 apply to compact continuous groups as well, and will not be 
repeated here. All the group theory that we shall need is in principle contained in 
the Peter- Weyl theorem, and its corollaries: A compact Lie group G is completely 
reducible, its representations are fully reducible, every compact Lie group is a 
closed subgroup of a unitary group U(n) for some n, and every continuous, unitary, 
irreducible representation of a compact Lie group is finite dimensional. 

Example 10.2 Special orthogonal group SO(2) (or S l ) is a group of length- 
preserving rotations in a plane. 'Special' refers to requirement that detg = 1, in con- 
tradistinction to the orthogonal group 0(n) which allows for length-preserving inversions 
through the origin, with detg = -1. A group element can be parameterized by angle <f>, 
with the group multiplication law g(<p')g(<f>) = g(<f>' + <p), and its action on smooth periodic 
functions u(4> + 2n) = u(<f>) generated by 

g(f) = </ T , T = ^. (10.3) 
dip 

Expand the exponential, apply it to a differentiable function u((p), and you will recognize 
a Taylor series. So g{<p') shifts the coordinate by <p', g(<f>') u(<f>) = u(<f>' + 4>) . 



Example 10.3 Translation group: Differential operator T in ( 10.3) is reminiscent 
of the generator of spatial translations. The 'constant velocity field' v(x) = v = c ■ 
T acts on xj by replacing it by the velocity vector cj. It is easy to verify by Taylor 
expanding a function u(x) that the time evolution is nothing but a coordinate translation 
by (time x velocity): 

e~ TcT u(x) = e~ TC ^u(x) = u(x-tc). (10.4) 

As x is a point in the Euclidean R d space, the group is not compact. In general, a 
sequence of time steps in time evolution always forms an abelian Lie group, albeit 
never as trivial as this free ballistic motion. 

If the group actions consist ofN rotations which commute, for example act on 
an N -dimensional cell with periodic boundary conditions, the group is an abelian group 
that acts on a torus T N . 



Example 10.4 Continuous symmetries of the plane Couette flow. (continued 
from example 9. 7) The plane Couette flow is a Navier-Stokes flow bounded by two 
countermoving planes, in a cell periodic in streamwise and spanwise directions. Every 
solution of Navier-Stokes equations belongs, by the SO(2) x SO(2) symmetry, to a 2- 
torus T 2 of equivalent solutions. Furthermore these tori are interrelated by a discrete D 2 
group of spanwise and streamwise flips of the flow cell, (continued in example 10.10) 
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Figure 10.2: (a) The group orbit Atv(O) of state 
space point x(Q), and the group orbit M X (t) reached 
by the trajectory x(t) time t later. As any point on 
the manifold M x u) is physically equivalent to any 
other, the state space is foliated into the union of 
group orbits, (b) Symmetry reduction M — > M 
replaces each full state space group orbit M x by a 
single point x e M. 




Let G be a group, and gM — > At a group action on the state space At. The 
[dxd] matrices g acting on vectors in the <i-dimensional state space At form a 
linear representation of the group G. If the action of every element g of a group G 
commutes with the flow 

gv(x) = v{gx) , gf(x) = f(gx) , (10.5) 

G is a symmetry of the dynamics, and, as in (9.7), the dynamics is said to be 
invariant under G, or G-equivariant. 

In order to explore the implications of equivariance for the solutions of dyn- 
amical equations, we start by examining the way a compact Lie group acts on state 
space At. For any x € At, the group orbit AL of x is the set of all group actions 
(see page 160 and figure 10.2) 

M x = {gx\geG\. (10.6) 

As we saw in example 10.3, the time evolution itself is a noncompact 1- 
parameter Lie group. Thus the time evolution and the continuous symmetries 
can be considered on the same Lie group footing. For a given state space point 
x a symmetry group of N continuous transformations together with the evolution 
in time sweeps out, in general, a smooth (Af+l)-dimensional manifold of equiv- 
alent solutions (if the solution has a nontrivial symmetry, the manifold may have 
a dimension less than N + 1). For solutions p for which the group orbit of Xp is 
periodic in time T p , the group orbit sweeps out a compact invariant manifold AL. 
The simplest example is the N = 0, no symmetry case, where the invariant mani- 
fold At p is the 1 -torus traced out by a periodic trajectory p. If At is a smooth C° 
manifold, and G is compact and acts smoothly on At, the reduced state space can 
be realized as a 'stratified manifold,' meaning that each group orbit (a 'stratum') 
is represented by a point in the reduced state space, see sect. 10.4. Generalizing 
the description of a non-wandering set of sect. 2.1.1, we say that for flows with 
continuous symmetries the non-wandering set Q. of dynamics (2.3) is the closure 
of the set of compact invariant manifolds AL. Without symmetries, we visualize 
the non-wandering set as a set of points; in presence of a continuous symmetry, 
each such 'point' is a group orbit. 



continuous - 15june2012 



ChaosBook.org version 14.4. 1 , Apr 212013 



CHAPTER 10. RELATIVITY FOR CYCLISTS 



182 



10.1.1 Lie groups for pedestrians 



[...] which is an expression of consecration of 'angular 
momentum.' 

— Mason A. Porter's student 



Definition: A Lie group is a topological group G such that (i) G has the struc- 
ture of a smooth differential manifold, and (ii) the composition map GxG — > G : 
(g,h) — > gh is smooth, i.e., C°° differentiable. 

Do not be mystified by this definition. Mathematicians also have to make 
a living. Historically, the theory of compact Lie groups that we will deploy here 
emerged as a generalization of the theory of SO(2) rotations, i.e., Fourier analysis. 
By a 'smooth differential manifold' one means objects like the circle of angles that 
parameterize continuous rotations in a plane, example 10.2, or the manifold swept 
by the three Euler angles that parameterize SO(3) rotations. 

An element of a compact Lie group continuously connected to identity can be 
written as 

g{cj)) = e^ , 0-T = £^ T «' «= L2,---,iV, (10.7) 

where <p • T is a Lie algebra element, and <p a are the parameters of the transforma- 
tion. Repeated indices are summed throughout this chapter, and the dot product 
refers to a sum over Lie algebra generators. The Euclidian product of two vectors 
x,y will be indicated by ^-transpose times y, i.e., xfy - J^f x{yi. Unitary trans- 
formations exp(0 • T) are generated by sequences of infinitesimal steps of form 



g(64>) * 1 + 8<p ■ T , 5<p e R N , \8<f>\ « 1 , (10.8) 

where T a , the generators of infinitesimal transformations, are a set of linearly 
independent [dxd] anti-hermitian matrices, (T a )^ = -T a , acting linearly on the 
cf-dimensional state space At. In order to streamline the exposition, we postpone 
discussion of combining continuous coordinate transformations with the discrete 
ones to sect. 10.2.1. . 

For continuous groups the Lie algebra, i.e., the set of N generators T a of 
infinitesimal transformations, takes the role that the \G\ group elements play in the 
theory of discrete groups. The flow field at the state space point x induced by the 
action of the group is given by the set of N tangent fields 

t a (x) i = (T a ) ij Xj, (10.9) 



exercise 10.2 
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which span the tangent space. Any representation of a compact Lie group G is 
fully reducible, and invariant tensors constructed by contractions of % are useful 
for identifying irreducible representations. The simplest such invariant is 

T T ■ T = ^ Cf ] l (a) , (10.10) 



where CT is the quadratic Casimir for irreducible representation labeled a, and 
1^ is the identity on the a-irreducible subspace, 0 elsewhere. The dot product of 
two tangent fields is thus a sum weighted by Casimirs, 

t(x) T ■t{x') = Y,C { fx i 5f j ) x' j . (10.11) 



Example 10.5 SO(2) irreducible representations: (continued from example 10.2) Ex- 
pand a smooth periodic function u(<p + 2n) = u((f>) as a Fourier series 

CO 

u(cp) — qq + 2^ ( a m cos nop + b m sin nop) . (10. 12) 



The matrix representation of the SO(2) action ( 10.3) on the mth Fourier coefficient pair 

(a m , b m ) is 



g (m \(b') = t (10.13) 

6 KY ' 1 - sin m(f> cos m<p f ' v ' 

with the Lie group generator 

T- = ( I J ) . (10.14) 

The S(X2) group tangent ( 10.9) to state space point u(<f>) on the mth invariant subspace 
is 

f(m) W = m ( -a m ) ' (10 - 15) 
The Lr norm of t(u) is weighted by the SO(2) quadratic Casimir (10.1 0), Cf ] = m 2 , 

<£ ^ (Tu(4») T Tu(2t: - 0) = J m 2 (a 2 m + b 2 ,) , (10. 16) 

m=l 

and converges only for sufficiently smooth u(<p). What does that mean? We saw in 
(10.4) that T generates translations, and by (10.14) the velocity of the mth Fourier 
mode is m times higher than for the m = 1 component. If \u (m) \ does not fall off faster 
the 1 /m, the action of SO(2) is overwhelmed by the high Fourier modes. 
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Example 10.6 SO(2) rotations for complex Lorenz equations: 

Lie algebra generator 



0 


1 


0 


0 


0 


-1 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


-1 


0 


0 


0 


0 


0 


0 


0 



Substituting the 



(10.17) 



acting on a 5-dimensional space ( 10.2) into (10.7) yields a finite angle SO(2) rotation: 



8(<P) 



' COS0 


sin0 


0 


0 


0 


- sirup 


COS (f> 


0 


0 


0 


0 


0 


cos <p 


sin0 


0 


0 


0 


— sin0 


cos <p 


0 


I 0 


0 


0 


0 


1 



(10.18) 



From (10.13) we see that the action of SO(2) on the complex Lorenz equations state 
space decomposes into m = 0 G-invariant subspace (z-axis) and m = 1 subspace with 
multiplicity 2. 

The generator T is indeed anti-hermitian, T 1 " = -T, and the group is compact, 
its elements parametrized by <p mod 2n. Locally, atx e M, the infinitesimal action of the 
group is given by the group tangent field t(x) = Tx = (x 2 ,-x l ,y 2 ,-y\,0). In other words, 
the flow induced by the group action is normal to the radial direction in the (x\, x 2 ) and 
(y\,yi) planes, while the z-axis is left invariant. 



fast track: 

sect. 10.2, p. 187 



10.1.2 Lie groups for cyclists 



Henriette Roux: "Why do you devote to Lie groups only 
a page, while only a book-length monograph can do it 
justice?" A: "ChaosBook tries its utmost to minimize 
the Gruppenpest jargon damage, which is a total turnoff 
to our intended audience of working plumbers and elec- 
tricians. The sufferings of our master plumber Fabian 
Waleffe while reading chapter 9 - World in a mirror are 
chicken feed in comparison to the continuous symmetry 
reduction nightmare that we embark upon here." 



All the group theory that we shall need is in principle contained in the Peter- Weyl 
theorem, and its corollaries: A compact Lie group G is completely reducible, its 
representations are fully reducible, every compact Lie group is a closed subgroup 
of a unitary group U(n) for some n, and every continuous, unitary, irreducible 
representation of a compact Lie group is finite dimensional. 

Here comes all of the theory of Lie groups in one quick serving. You will live 
even if you do not digest this section, or, to spell it out; skip this section unless 
you already know the theory of Lie algebras. 



appendix 
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The [d x d] matrices g acting on vectors in the state space M form a linear 
representation of the group G. Tensors transform as 

h' ij k = gfg/g k k'hi>/. (10.19) 

A multilinear function h(q,r, . . . , s) is an invariant function if (and only if) for any 
transformation g e G and for any set of vectors q,r,s,... it is unchanged by the 
coordinate transformation 

h(gq,gr, ...gs) = h(q,?, ...,s) = Kb- " c q a r b • • • s c . (10.20) 

Examples of such invariant functions are the length r(xf = 6 J .x'xj and the volume 
V(x,y,z) = e'^xiyjZk- Substitute the infinitesimal form of group action (10.8) into 
(10.19), keep the linear terms. In the index-notation longhand, the Lie algebra 
generator acts on each index separately, 

(T (( )f //,.*• + (TJ-f/i. - (TJ* ft,. j*~ + . . . = 0 . (10.21) 

Hence the tensor ft. . is invariant if T a h - 0, i.e., the N generators T a 'annihi- 
late' it. 

As one does not want the symmetry rules to change at every step, the genera- 
tors T a , a - 1, 2, ... ,N, are themselves invariant tensors: 

(ya) i j = g i i>g/gaa'(Ja>yj,, (10.22) 

where g a b - [e~"^' c j is the adjoint [A^xAH matrix representation of g e G. The 
[dxd] matrices T a are. in general non-commuting, and from (10.21) it follows that 
they close A7-element Lie algebra 

[T a , T b ] = T a T h - T h T a = -C a bcT c , a,b,c = 1,2, ...,N , 

where the fully antisymmetric adjoint representation hermitian generators 

[C c ] a f> — C a bc ~ ~Cbac ~ ~C a cb 

are the structure constants of the Lie algebra. 

As we will not use non-abelian Lie groups in this chapter, we omit the deriva- 
tion of the Jacobi relation between C a & c 's, and you can safely ignore all this talk of 
tensors and Lie algebra commutators as far as the pedestrian applications at hand 
are concerned. 
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10.1.3 Equivariance under infinitesimal transformations 

A flow x = v(x) is G-equi variant (10.5), if symmetry transformations commute 
with time evolutions 

v(x) = g' Y v(g x) , for all g e G . (10.23) 

For an infinitesimal transformation (10.8) the G-equivariance condition becomes 

v(x) = (1 - 0 • T) v(x + d> ■ Tx) + • • • = v(x) - </> ■ Tv(x) + ^ <f> ■ Tx + ■ ■ ■ . 

ax 

The v(x) cancel, and cp a are arbitrary. Denote the group flow tangent field at x by 
t a (x)i - (T a ),yXj. Thus the infinitesimal, Lie algebra G-equivariance condition is 



exercise 10.4 
exercise 10.5 



t a (v)-A(x)t a (x) = 0, 



(10.24) 



where A = dv/dx is the stability matrix (4.3). If case you find such learned 
remarks helpful: the left-hand side of (10.24) is the Lie derivative of the dynamical 
flow field v along the direction of the infinitesimal group-rotation induced flow 



T fl - ^(T a x) 



(10.25) 



y=x 



exercise 10.6 

The equivariance condition (10.24) states that the two flows, one induced by the exercise 10.7 
dynamical vector field v, and the other by the group tangent field t, commute if exercise 10.12 
their Lie derivatives (or the 'Lie brackets ' or 'Poisson brackets') vanish. 

Example 10.7 Equivariance of complex Lorenz flow: That complex Lorenz flow 
(10.2) is equivahant under SO(2) rotations (10.18) can be checked by substituting the 
Lie algebra generator (10.17) and the stability matrix (4.3) for complex Lorenz flow 
(10.2), 



(10.26) 



into the equivariance condition ( 10.24). Considering that f(v) depends on the full set of 
equations ( 1 0.2), andA(x) is only its linearization, this is not an entirely trivial statement. 
For the parameter values (10.2) the flow is strongly volume contracting (4.29), 



—cr 


0 


o~ 


0 


0 i 


0 


—cr 


0 


cr 


0 


Pi -z 


"P2 


-1 


—e 


-Xl 


P2 


Pi -z 


e 


-1 


-X2 


y\ 


yi 


X] 


X2 


-b ) 



d iVi = Ai(x, t) = -b-2(cr+l) = -24 - 2/3 , 



(10.27) 



at a coordinate-, p- and e-independent constant rate. 



Checking equivariance as a Lie algebra condition (10.24) is easier than checking 
it for global, finite angle rotations (10.23). 
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10.2 Symmetries of solutions 



Let v(x) be the dynamical flow, and f the trajectory or 'time-r forward map' of 
an initial point xq, 

dx r T 

-j- = v(x), x(T) = f T (x 0 ) = x 0 + dr'v{x{T')). (10.28) 
dt Jo 

As discussed in sect. 9.2, solutions x(t) of an equivariant system can satisfy all 
of the system's symmetries, a subgroup of them, or have no symmetry at all. For 
a given solution x(t), the subgroup that contains all symmetries that fix x (that 
satisfy gx = x) is called the isotropy (or stabilizer) subgroup of x. A generic 
ergodic trajectory x(t) has no symmetry beyond the identity, so its isotropy group 
is {e}, but recurrent solutions often do. At the other extreme is equilibrium, whose 
isotropy group is the full symmetry group G. 

The simplest solutions are the equilibria or steady solutions (2.9). 

Definition: Equilibrium xeq - Meq is a fixed, time-invariant solution, 

v(x E q) = 0, 

x(x E q,t) = x EQ + J dx v(x(t')) = x E q . (10.29) 
Jo 



An equilibrium with full symmetry, 
g x E q = x E q for all g e G , 



exercise 10.13 



lies, by definition, in Fix(G) subspace subspace, for example the x^ axis in fig- 
ure 10.3 (a). The multiplicity of such solution is one. An equilibrium xeq with 
symmetry Geq smaller than the full group G belongs to a group orbit G/Geq- 
If G is finite there are |G|/|G£g| equilibria in the group orbit, and if G is contin- g X g r cjgg 10 14 
uous then the group orbit of x is a continuous family of equilibria of dimension 
dim G - dim Geq- For example, if the angular velocity c in figure 10.3 (b) equals 
zero, the group orbit consists of a circle of (dynamically static) equivalent equi- 
libria. 



Definition: Relative equilibrium solution xtw(t) e Mtw' me dynamical flow 
field points along the group tangent field, with constant 'angular' velocity c, and 
the trajectory stays on the group orbit, see figure 10.3 (a): 

v(x) = c • t{x) , x e Mtw 

x{t) = g(-Tc)x{0) = e- rc - T x{0). (10.30) exercise 10.22 

exercise 10.26 

continuous - 15jime2012 ChaosBook.org version!4.4.1, Apr 21 2013 exercise 10.27 
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Figure 10.3: (a) A relative equilibrium orbit starts out 
at some point x(0), with the dynamical flow field v(x) = 
c ■ t(x) pointing along the group tangent space. For the 
SO(2) symmetry depicted here, the flow traces out the 
group orbit of x(0) in time T = 2n/c. (b) An equilib- 
rium lives either in the fixed Fix (G) subspace (jc 3 axis 
in this sketch), or on a group orbit as the one depicted 
here, but with zero angular velocity c. In that case the 
circle (in general, A'-torus) depicts a continuous family 
of fixed equilibria, related only by the group action. 



Figure 10.4: ( z] plot of the complex Lorenz 
flow with initial point close to TWi. In figure 10.1 this 
relative equilibrium is superimposed over the strange 
attractor. (R. Wilczak) 



x, t 



V = c g(x)t 



x(x)= g(x)x(0) 




A traveling wave 

x(t) = g(-cr) xtw = x tw - c t , ceR d (10.31) 



remark 10.3 



is a special type of a relative equilibrium of equivariant evolution equations, where 
the action is given by translation (10.4), g(y) x(0) = x(0) + y . A rotating wave is 
another special case of relative equilibrium, with the action is given by angular 
rotation. By equivariance, all points on the group orbit are equivalent, the mag- 
nitude of the velocity c is same everywhere along the orbit, so a 'traveling wave' 
moves at a constant speed. For an N > 1 trajectory traces out a line within the 
group orbit. As the c a components are generically not in rational ratios, the tra- 
jectory explores the TV-dimensional group orbit (10.6) quasi-periodically. In other 
words, the group orbit g(r) x(0) coincides with the dynamical orbit x(t) e AVw 
and is thus flow invariant. 



Example 10.8 A relative equilibrium: For complex Lorenz equations and our 
canonical parameter values of (10.2) a computation yields the relative equilibrium TW\ 
with a representative group orbit point 

(jCi,x 2 ,yi,0,zWi = (8.48492,0.0771356,8.48562,0,26.9999), (10.32) 

and angular velocity c TW i = 1/11. This corresponds to period T TWl = 2njc « 69, so 
a simulation has to be run up to time of order of at least 70 for the strange attractor in 
figure 10.1 to start filling in. 

Figure 10.4 shows the complex Lorenz flow with the initial point (10.32) on the 
relative equilibrium TW\ . It starts out by drifting in a circle around the z-axis, but as the 
numerical errors accumulate, the trajectory spirals out. 
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Figure 10.5: A periodic orbit starts out at x(0) with the 
dynamical v and group tangent t flows pointing in dif- 
ferent directions, and returns after time T p to the initial 
point x(0) = x(T p ). The group orbit of the temporal 
orbit of a(0) sweeps out a (1 + AO-dimensional torus, a 
continuous family of equivalent periodic orbits, two of 
which are sketched here. For SO(2) this is topologi- 




cally a 2-torus. 



Calculation of the relative equilibrium stability reveals that it is spiral-out un- 
stable, with the very short period T spiral = 0.6163. This is the typical time scale for 
fast oscillations visible in figure 10.1, with some 100 turns for one circumambulation 
of the TWi orbit. In that time an initial deviation from x T wi 'S multiplied by the factor 
Kadiai ~ 500. It would be sweet if we could eliminate the drift time scale ~ 70 and focus 
just on the fast time scale of « 0.6. That we will attain by reformulating the dynamics in 
a reduced state space. 



Definition: Periodic orbit. Let x be a periodic point on the periodic orbit p of 
period T, 

f T (x) = X, X £ M p . 

By equivariance, g x is another periodic point, with the orbits of x and gx either 
identical or disjoint. 

If gx lands on the same orbit, g is an element of periodic orbit's symmetry 
group G p . If the symmetry group is the full group G, we are back to (10.30), 
i.e., the periodic orbit is the group orbit traced out by a relative equilibrium. The 
other option is that the isotropy group is discrete, the orbit segment {jc,^^} is pre- 
periodic (or eventually periodic), x(0) = g p x(T p ), where T p is a fraction of the 
full period, T p = T/m, and thus 



If the periodic solutions are disjoint, as in figure 10.5, their multiplicity (if G 
is finite, see sect. 9.1), or the dimension of the manifold swept under the group 
action (if G is continuous) can be determined by applications of g € G. They form 
a family of conjugate solutions (9.19), 



x(0) = gpx(T p ), xeMp, 
x(0) = g n p l x{mT p ) = x{T) = x{0). 



8 P e G p 



(10.33) 



Mgp = gM p g 



-i 



(10.34) 
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Figure 10.6: A relative periodic orbit starts out at x(0) 
with the dynamical v and group tangent t flows point- 
ing in different directions, and returns to the group or- 
bit of x(0) after time T p at x(T p ) = g p x(0), a rotation of 
the initial point by g p , For flows with continuous sym- 
metry a generic relative periodic orbit (not pre-periodic 
to a periodic orbit) fills out ergodically what is topo- 
logically a torus, as in figure 10.5; if you are able to 
draw such a thing, kindly send us the figure. As il- 
lustrated by figure 10.8(a) this might be a project for 
Lucas Films. 




x(T p ) 



Definition: Relative periodic orbit p is an orbit M p in state space M which 
exactly recurs 



at a fixed relative period T p , but shifted by a fixed group action g p which brings 
the endpoint x p (T p ) back into the initial point x p (0), see figure 10.6. The group 
action g p parameters (p = ((f>\, (fa, • • • <Pn) are referred to as "phases," or "shifts." 
In contrast to the pre-periodic (10.33), the phase here are irrational, and the tra- 
jectory sweeps out ergodically the group orbit without ever closing into a periodic 
orbit. For dynamical systems with only continuous (no discrete) symmetries, the 
parameters {t, 0i, • • • , 4>n) are real numbers, ratios n/cf>j are almost never rational, 
likelihood of finding a periodic orbit for such system is zero, and such relative 
periodic orbits are almost never eventually periodic. 

Relative periodic orbits are to periodic solutions what relative equilibria (trav- 
eling waves) are to equilibria (steady solutions). Equilibria satisfy f(x) - x — 0 
and relative equilibria satisfy fix) - gif) x = 0 for any t. In a co-moving frame, 
i.e., frame moving along the group orbit with velocity v(x) = c ■ t{x), the relative 
equilibrium appears as an equilibrium. Similarly, a relative periodic orbit is peri- 
odic in its mean velocity c p = <p p IT p co-moving frame (see figure 10.8), but in the 
stationary frame its trajectory is quasiperiodic. A co-moving frame is helpful in 
visualizing a single 'relative' orbit, but useless for viewing collections of orbits, 
as each one drifts with its own angular velocity. Visualization of all relative peri- 
odic orbits as periodic orbits we attain only by global symmetry reductions, to be 
undertaken in sect. 10.4. 

Example 10.9 Complex Lorenz flow with relative periodic orbit: Figure 1 0. 7 is 
a group portrait of the complex Lorenz equations state space dynamics, with several 
important players posing against a generic orbit in the background. 

The unstable manifold of relative equilibrium TWi is characterized by a 2- 
dimensional complex eigenvector pair, so its group orbit is a 3-dimensional. Only one 
representative trajectory on it is plotted in the figure. The unstable manifold of equi- 
librium EQo has one expanding eigenvalue, but its group orbit is a cone originating at 
EQ 0 . Only one representative trajectory on this cone is shown in the figure. It lands 
close to TWi, and then_spirals out along its unstable manifold. 3 repetitions of a short 
relative periodic orbit 01 are drawn. The trajectory fills out ergodically a 2-dimensional 



x p (0) = gpX p (T p ) , 



x p (t) e M 



■p > 



(10.35) 
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Figure 10.7: (Figure 10.1 continued) A group portrait 
of the complex Lorenz equations state space dynamics. 
Plotted are relative equilibrium TWi (red), its unsta- 
ble manifold (brown), equilibrium EQo, one trajectory 
from the group orbit of its unstable manifold (green), 3 
repetitions of relative periodic orbit 01 (magenta) and 



a generic orbit (blue). 



(E. Siminos) 




Figure 10.8: A relative periodic orbit of 
Kuramoto-Sivashinsky flow projected on (a) the 
stationary state space coordinate frame jvi, V2, V3), 
traced for four periods T p ; (b) the co-moving 
{vi,V2,V3l coordinate frame, moving with the 
mean angular velocity c p = <f> p /T p . (from ref. [1]) 



v 3 



(a) 




orbit Moi ■ The assignment of its symbolic dynamics label will be possible only after the 
symmetry reduction, see figure 10.14 and figure 11.9. 



10.2.1 Discrete and continuous symmetries together 

We expect to see relative periodic orbits because a trajectory that starts on and 
returns to a given torus of a symmetry equivalent solutions is unlikely to intersect 
it at the initial point, unless forced to do so by a discrete symmetry. This we 
will make explicit in sect. 10.4, where relative periodic orbits will be viewed as 
periodic orbits of the reduced dynamics. 

If, in addition to a continuous symmetry, one has a discrete symmetry which is 
not its subgroup, one does expect equilibria and periodic orbits. However, a relati- 
ve periodic orbit can be pre-periodic if it is equivariant under a discrete symmetry, 
as in (10.33): If g m = 1 is of finite order m, then the corresponding orbit is periodic 
with period mT p . If g is not of a finite order, a relative periodic orbit is periodic 
only after a shift by g p , as in (10.35). Morally, as it will be shown in chapter 21, 
such orbit is the true 'prime' orbit, i.e., the shortest segment that under action of 
G tiles the entire invariant submanifold h\. 



Definition: Relative orbit Mgx in state space M is the time evolved group 
orbit M x of a state space point x, the set of all points that can be reached from x 
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by all symmetry group actions and evolution of each in time. 

M m = {gxt:teR,geG) . (10.36) 

In presence of symmetry, an equilibrium is the set of all equilibria related by 
symmetries, an relative periodic orbit is the hyper-surface traced by a trajectory in 
time T and all group actions, etc.. 



Example 10.10 Relative orbits in the plane Couette flow. (continued from 

example 10.4) Translational symmetry allows for relative equilibria (traveling waves), 
characterized by a fixed profile Eulerian velocity u TW (x) moving with constant velocity 
c, i.e. 

u(x, t) = utw( x - ct) . (10.37) 

As the plane Couette flow is bounded by two counter-moving planes, it is easy to see 
where the relative equilibrium (traveling wave) solutions come from. A relative equi- 
librium solution hugs close to one of the walls and drifts with it with constant velocity, 
slower than the wall, while maintaining its shape. A relative periodic solution is a solu- 
tion that recurs at time T p with exactly the same disposition of the Eulerian velocity fields 
over the entire cell, but shifted by a 2-dimensional (streamwise,spanwise) translation 
g p . By discrete symmetries these solutions come in counter-traveling pairs u q (x - ct), 
-u q (-x + ct): for example, for each one drifting along with the upper wall, there is a 
counter-moving one drifting along with the lower wall. Discrete symmetries also imply 
existence of strictly stationary solutions, or 'standing waves.' For example, a solution 
with velocity fields antisymmetric under reflection through the midplane has equal flow 
velocities in opposite directions, and is thus an equilibrium stationary in time. 

chapter 21 



10.3 Stability 



A spatial derivative of the equivariance condition (10.5) yields the matrix equiv- 
ariance condition satisfied by the stability matrix (stated here both for the finite 
group actions, and for the infinitesimal, Lie algebra generators): 



OA 

gA(x)g~ 1 = A(gx) , [T fl , A] = —t a (x) . (10.38) 

ox 



For a flow within the fixed Fix(G) subspace, t(x) vanishes, and the symmetry 
imposes strong conditions on the perturbations out of the Fix(G) subspace. As 
in this subspace stability matrix A commutes with the Lie algebra generators T, 
the spectrum of its eigenvalues and eigenvectors is decomposed into irreducible 
representations of the symmetry group. This we have already observed for the 
EQo of the Lorenz flow in example 9.14. 

A infinitesimal symmetry group transformation maps the initial and the end 
point of a finite trajectory into a nearby, slightly rotated equivalent points, so we 



fa 



exercise 10.28 
exercise 10.29 
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expect the perturbations along to group orbit to be marginal, with unit eigenvalues. 
The argument is akin to (4.8), the proof of marginality of perturbations along a pe- 
riodic orbit. Consider two nearby initial points separated by an A^-dimensional in- 
finitesimal group transformation (10.8): 5xq = g{6cf))xQ- xq = dcp-Txo = S(p-t(xo)- 
By the commutativity of the group with the flow, g{5<p)f{xo) - f T {g{8<p)xQ). Ex- 
panding both sides, keeping the leading term in 8<f>, and using the definition of the 
Jacobian matrix (4.5), we observe that ^(xq) transports the A^-dimensional group 
tangent space at x(0) to the rotated tangent space at x(t) at time r: 

t a (r) = J T (x 0 ) t a (0) , t a (r) = T a x(t) . ( 10.39) 

For a relative periodic orbit, g p x(T p ) = x(0), at any point along cycle p the group 
tangent vector t a (r) is an eigenvector of the Jacobian matrix with an eigenvalue of 
unit magnitude, 

J p t a (x) = t a (x) , J p (x) = g p J T "(x) , xeMp. (10.40) 

For a relative equilibrium flow and group tangent vectors coincide, v = c • t(x) . 
Dotting by the velocity c (i.e., summing over c a t a ) the equivariance condition 
(10.24), t a (y) - A(x) t a {x) = 0, we get 

(c-T-A)v = 0. (10.41) 

In other words, in the co-rotating frame the eigenvalues corresponding to group 
tangent are marginal, and the velocity v is the corresponding right eigenvector. 

Two successive points along the cycle separated by 6xo = Sep ■ t(r) have the 
same separation after a completed period 6x(T p ) = g p 8xo, hence eigenvalue of 
magnitude 1. In presence of an A^-dimensional Lie symmetry group, ,/V eigenval- 
ues equal unity. 



10.4 Reduced state space 

Maybe when I'm done with grad school I'll be able to fig- 
ure it all out . . . 

— Rebecca Wilczak, undergraduate 

Given Lie group G acting smoothly on a C°° manifold At, we can think of 
each group orbit as an equivalence class. Symmetry reduction is the identification 
of a unique point on a group orbit as the representative of its equivalence class. 
We call the set of all such group orbit representatives the reduced state space 
M/G. In the literature this space is often rediscovered, and thus has many names 
- it is alternatively called 'desymmetrized state space,' 'symmetry -reduced space,' 
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Figure 10.9: A point x on the full state space trajectory 
x(t) is equivalent up to a group rotation g(t) to the point 
x on the curve x(t) if the two points belong to the same 
group orbit M x , see (10.6). 




'orbit space' (because every group orbit in the original space is mapped to a sin- 
gle point in the orbit space), or 'quotient space' (because the symmetry has been 
'divided out'), obtained by mapping equivariant dynamics to invariant dynamics 
('image') by methods such as 'moving frames,' 'cross sections,' 'slices,' 'freez- ^ 
ing,' 'Hilbert bases,' 'quotienting,' 'lowering of the degree,' 'lowering the order,' 
or 'desymmetrization.' 

Symmetry reduction replaces a dynamical system (At,/) with a symmetry by 
a 'desymmetrized' system (At, /), a system where each group orbit is replaced 
by a point, and the action of the group is trivial, gx. = x for all x. e At, g e G. 
The reduced state space At is sometimes called the 'quotient space' M/G because 
the symmetry has been 'divided out.' For a discrete symmetry, the reduced state 
space M/G is given by the fundamental domain of sect. 9.4. In presence of a 
continuous symmetry, the reduction to M/G amounts to a change of coordinates 
where the 'ignorable angles' {<p\, • ■ ■ , (p^) that parameterize N group translations 
can be separated out. 

We start our discussion of symmetry reduction by considering the finite-rotations 
method of moving frames, and its differential formulation, the method of slices. 

10.4.1 Go with the flow: method of moving frames 

The idea: We can, at least locally, map each point along any solution x(t) to the 
unique representative x(t) of the associated group orbit equivalence class, by a 
suitable rotation 



Equivariance implies the two points are equivalent. In the 'method of slices' the 
reduced state space representative x. of a group orbit equivalence class is picked 
by slicing across the group orbits by a fixed hypersurface. We start by describing 
how the method works for a finite segment of the full state space trajectory. 



X(j) = g(T) X(T) . 



(10.42) 
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Figure 10.10: Slice M is a hyperplane (10.43) pass- 
ing through the slice-fixing point X ', and normal to the 
group tangent f at x'. It intersects all group orbits (in- 
dicated by dotted lines here) in an open neighborhood 
of Jc\ The full state space trajectory x(t) and the re- 
duced state space trajectory x(t) belong to the same 
group orbit M X ^ T ) and are equivalent up to a group ro- 
tation g(r), defined in (10.42). 



/At 



\g(t) 
V 



x(t) 



x(0) 



Definition: Slice. Let G act regularly on a J-dimensional manifold At, i.e., with 
all group orbits A-dimensional. A slice through point j? is a (d- AO-dimensional 
submanifold At such that all group orbits in an open neighborhood of the slice- 
defining point % intersect At transversally and only once (see figure 10.10). 

The simplest slice condition defines a linear slice as a (d - AO-dimensional 
hyperplane At normal to the N group rotation tangents f a at point x': 



(x-x) t a 



0, 



(10.43) 



In other words, 'slice' is a Poincare section (3.6) for group orbits. Each 'big 
circle' -group orbit tangent to i a - intersects the hyperplane exactly twice, with 
the two solutions separated by n. As for a Poincare section (3.4), we add an 
orientation condition, and select the intersection with the clockwise rotation angle 
into the slice. 



Definition: Moving frame. Assume that for a given x e At and a given slice 
At there exists a unique group element g = g(x) that rotates x into the slice, 
gx = x. € At. The map that associates to a state space point x a Lie group action 
g(x) is called a moving frame . 

As (x') T t' a = 0 by the antisymmetry of T a , the slice condition (10.43) fixes (p 
for a given x by 



exercise B.1 
exercise 10.30 



0 = x T t' a = x T g{<p) T t a , 



(10.44) 



where g T denotes the transpose of g. The method of moving frames can be 
interpreted as a change of variables 



x(t) = g (r) x(t) , 



(10.45) 



that is passing to a frame of reference in which condition (10.44) is identically 
satisfied, see example 10.11. Therefore the name 'moving frame.' Method of 
moving frames should not be confused with the co-moving frames, such as the 
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one illustrated in figure 10.8. Each relative periodic orbit has its own co-moving 
frame. In the method of moving frames (or the method of slices) one fixes a 
stationary slice, and rotates all solutions back into the slice. 

The method of moving frames is a post-processing method; trajectories are 
computed in the full state space, then rotated into the slice whenever desired, with 
the slice condition easily implemented. The slice group tangent t is a given vec- 
tor, and g(<p) x is another vector, linear in x and a function of group parameters <p. 
Rotation parameters <f> are determined numerically, by a Newton method, through 
the slice condition (10.44). 

Figure 10.11 illustrates the method of moving frames for an SO(2) slice nor- 
mal to the X2 axis. Looks innocent, except there is nothing to prevent a trajectory 
from going through (x\ , X2) = (0, 0), and what <p is one to use then? We can always 
chose a finite time step that hops over this singularity, but in the continuous time 
formulation we will not be so lucky. 

How does one pick a slice point xf? A generic point % not in an invariant 
subspace (on the complex Lorenz equations z axis, for example) should suffice 
to fix a slice. The rules of thumb are much like the ones for picking Poincare 
sections, sect. 3.1.2. The intuitive idea is perhaps best visualized in the context 
of fluid flows. Suppose the flow exhibits an unstable coherent structure that - 
approximately- recurs often at different spatial dispositions. One can fit a 'tem- 
plate' to one recurrence of such structure, and describe other recurrences as its 
translations. A well chosen slice point belongs to such dynamically important 
equivalence class (i.e., group orbit). A slice is locally isomorphic to M/G, in an 
open neighborhood of x / . As is the case for the dynamical Poincare sections, in 
general a single slice does not suffice to reduce M — > M/G globally. 

The Euclidian product of two vectors x, y is indicated in (10.43) by ^-transpose 
times y, i.e., x T y - xiy,. More general bilinear norms (x,y) can be used, as long 
as they are G-invariant, i.e., constant on each irreducible subspace. An example is 
the quadratic Casimir (10.11). 



Examplel0.ll An SO(2) moving frame: (continued from example 10.2) The 
SO(2) action 

(x\,X2) = {x\ cos 8 + X2 sin 6, —x\ sinfl + X2 cos 6) (10.46) 

is regular on R 2 \{0). Thus we can define a slice as a 'hyperplane' (here a mere line), 
through x' = (0, 1), with group tangent t' = (1, 0), and ensure uniqueness by clockwise 
rotation into positive x 2 axis. Hence the reduced state space is the half-line x\ = 0, x 2 = 
x 2 > 0. The slice condition then simplifies to xi = 0 and yields the explicit formula for 
the moving frame parameter 

e(xux 2 ) = tw~\xi/x2), (10.47) 

i.e., the angle which rotates the point (x u x 2 ) back to the slice, taking care that tan -1 
distinguishes (x\,x 2 ) plane quadrants correctly. Substituting (10.47) back to (10.46) 
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Figure 10.11: Method of moving frames for a flow 
SO(2)-equivariant under (10.18) with slice through 
x' = (0, 1,0,0,0), group tangent f = (1,0,0,0,0). 
The clockwise orientation condition restricts the slice 
to half-hyperplane xi =0, X2 > 0. A trajectory started 
on the slice at x(0), evolves to a state space point with 
a non-zero Xi(t\). Compute the polar angle (f>i of x(t{) 
in the (xi,x?) plane. Rotate x(fj) clockwise by <f>\ to 
x (h) - g(—<f>i) x(ti), so that the equivalent point on the 
circle lies on the slice, Xi(ti) = 0. Thus after every 
finite time step followed by a rotation the trajectory 
restarts from the xjfe) = 0 reduced state space. 











X(t;) \ 






y(h) y(0) x 2 

X, 



and using cos(tan 1 a) = (1 + a 2 ) 1/2 , sin(tan 1 a) = a(l + a 2 ) 1/2 confirms x\ = 0. It also 
yields an explicit expression for the transformation to variables on the slice, 

x 2 = ^jx\ + x\ . (10.48) 

This was to be expected as SO(2) preserves lengths, x 2 + x 2 = x 2 + x 2 . If dynamics is 
in plane and SO(2) equivariant, the solutions can only be circles of radius (x 2 + x 2 ) 112 , 
so this is the "rectification" of the harmonic oscillator by a change to polar coordinates, 
example B. 1. Still, it illustrates the sense in which the method of moving frames yields 
group invariants. (E. Siminos) 



The slice condition (10.43) fixes Af directions; the remaining vectors (xn+\ . . . x c {) 
span the slice hyperplane. They are d - N fundamental invariants, in the sense 
that any other invariant can be expressed in terms of them, and they are function- 
ally independent. Thus they serve to distinguish orbits in the neighborhood of the 
slice-fixing point x! , i.e., two points lie on the same group orbit if and only if all 
the fundamental invariants agree. 



10.4.2 Dynamics within a slice 



I made a wrong mistake. 
— Yogi Berra 

As an alternative to the post-processing approach of the preceding sections, we 
can proceed as follows: Split up the integration into a sequence of finite time 
steps, each followed by a rotation of the final point (and the whole coordinate 
frame with it; the 'moving frame') such that the next segment's initial point is in 
the slice fixed by a point % ', see figure 10. 1 1. It is tempting to see what happens 
if the steps are taken infinitesimal. As we shall see, we do get a flow restricted to 
the slice, but at a price. 

Using decomposition (10.42) one can always write the full state space tra- 
jectory as x(t) = g(T)x(r), where the (d - AO-dimensional reduced state space 
trajectory £(t) is to be fixed by some condition, and g(r) is then the corresponding 
curve on the A-dimensional group manifold of the group action that rotates x. into 
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x at time r. The time derivative is then x - v{gx) - gx + gv, with the reduced state 
space velocity field given by v = dx/dt. Rewriting this as v = g~ l v(g x) - g~ l g x 
and using the equivariance condition (10.23) leads to 



v = v-g l gx. 



The Lie group element (10.7) and its time derivative describe the group tangent 
flow 

*- i *=*- i !«*- t =*-t. 

This is the group tangent velocity g~ l gx = </> • t(x) evaluated at the point x, i.e., 
with g - 1 . The flow v = dx/dt in the (d-N) directions transverse to the group 
flow is now obtained by subtracting the flow along the group tangent direction, 

v(Jc) = v(x) - 0(jc) • t(x) , ( 10.49) 

for any factorization (10.42) of the flow of form x(r) = g(r)x(r). To integrate 
these equations we first have to fix a particular flow factorization by imposing 
conditions on x(r), and then integrate phases 0(t) on a given reduced state space 
trajectory x(t). 

Here we demand that the reduced state space is confined to a hyperplane slice. 
Substituting (10.49) into the time derivative of the fixed slice condition (10.44), 



v(x) T t' a =v(x) T t' a -^ a -t(x) T t' a = 0, 



yields the equation for the group phases flow0 for the slice fixed by xf, together 
with the reduced state space M flow v(x): 



v(x) = v(Jc) - j>(x) ■ t(x) , xeM (10.50) 

v(x) T t' a 
t(x) T ■ f 



Ux) = 4^7. (10.51) 



exercise 10.31 
exercise 10.32 



Each group orbit M x = {gx\g € G} is an equivalence class; method of slices 
represents the class by its single slice intersection point x. By construction #V = 
0, and the motion stays in the (J-A^)-dimensional slice. We have thus replaced the 
original dynamical system {M,f} by a reduced system {M,f}. 

In the pattern recognition and 'template fitting' settings (10.51) is called the 
'reconstruction equation.' Integrated together, the reduced state space trajectory exercjse 1Q 33 
(10.50) and g(j) = exp{^(r) • T}, the integrated phase (10.51), reconstruct the full exerc j se 10 35 
state space trajectory x(t) = g(r) Jc(t) from the reduced state space trajectory x(t), 
so no information about the flow is lost in the process of symmetry reduction. 
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Figure 10.12: A slice fixed by taking as a tem- 
plate a point on the complex Lorenz equations 
relative equilibrium group orbit, x' = x TW i- (a) 
The strange attractor of figure 10.1 in the reduced 
state space of (10.50), {x i ,x 2 ,z} projection, (b) 
{*2,y2, z] projection. The reduced state space com- 
plex Lorenz flow strange attractor of figure 10.1 
now exhibits a discontinuity due to the vanishing 
denominator in (10.52). 




Example 10.12 A slice for complex Lorenz flow, (continuation of example 10.6) Here 
we can use the fact that 

t(x) T ■ f = x T T T ■ T x — x\x\ + x 2 x' 2 + y\y\ + y 2 y' 2 

is the dot-product restricted to the m = 1 4-dimensional representation of SO(2). A 
generic x' can be brought to form x' = (0, \,y\,y' 2 ,z) by a rotation and rescaling. Then 
IT = (1,0,^,-^,0), and 

v(x) ■ f _ vi + y 3 / 2 - v A y\ 
t(x) -f x 2 + yi/j + y%y' 2 ' 

A long time trajectory of (10.50) with x! on the relative equilibrium TWi group orbit 
is shown in figure 10.12. As initial condition we chose the initial point ( 10.32) on the 
unstable manifold of T W\ , rotated back to the slice by angle $ as prescribed by ( 1 0.44). 
We show the part of the trajectory for t e [70, 100]. The relative equilibriumTW \, now an 
equilibrium of the reduced state space dynamics, organizes the flow into a Rossler type 
attractor (see figure 2.6). The denominator in (10.51) vanishes and the phase velocity 
ip(x) diverges whenever the direction of group action on the reduced state space point 
is perpendicular to the direction of group action on the slice point x'. While the reduced 
state space flow appears continuous in the {x u x 2 ,z} projection, figure 10.12(a), this 
singularity manifests itself as a discontinuity in the {x 2 ,y 2 , z} projection, figure 1 0. 12(b). 
The reduced state space complex Lorenz flow strange attractor of figure 10.1 now 
exhibits a discontinuity whenever the trajectory crosses this 3-dimensional subspace. 



Slice flow equations (10.50) and (10.51) are pretty, but there is a trouble in 
the paradise. The slice flow encounters singularities in subsets of state space, with 
phase velocity $ divergent whenever the denominator in (10.52) changes sign, see 
{x2,yi,z} projection of figure 10.12(b). Hence a single slice does not in general 
suffice to cover M/G globally. 



10.5 Method of images: Hilbert bases 



(E. Siminos and P. Cvitanovic) 

Erudite reader might wonder: why all this slicing and dicing, when the problem 
of symmetry reduction had been solved by Hilbert and Weyl nearly a century 
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Figure 10.13: Invariant 'image' of complex Lorenz 
flow, figure 10.1, projected onto the invariant polyno- 
mials basis (10.53). Note the unstable manifold con- 
nection from the equilibrium EQ Q at the origin to the 
strange attractor controlled by the rotation around rela- 
tive equilibrium EQ\ (the reduced state space image of 
TWi); as in the Lorenz flow figure 3.4, natural measure 
close to EQ 0 is vanishingly small but non-zero. 




ago? Indeed, the most common approach to symmetry reduction is by means 
of a Hilbert invariant polynomial bases (9.27), motivated intuitively by existence 
of such nonlinear invariants as the rotationally-invariant length r = x\ + x\ + 
• • • + xh, or, in Hamiltonian dynamics, the energy function. One trades in the 
equivariant state space coordinates {x\,x%, ■■■ , x c i\ for a non-unique set of m > d 
polynomials {u\, ui, • • • ,u m } invariant under the action of the symmetry group. 
These polynomials are linearly independent, but functionally dependent through 
m - d + N relations called syzygies. 



Example 10.13 An SO(2) Hilbert basis. 

Hilbert basis 



(continued from example 9.18) The 



"5 



2,2 

xiyi - x 2 yi 
z . 



ui = y\ + y\ , 
u 4 = xiy\ + x 2 y 2 , 



(10.53) 



is invariant under the SO(2) action on a 5-dimensional state space (10. 18). That im- 
plies, in particular, that the image of the full state space relative equilibrium TW\ group 
orbit of figure 10.4 is the stationary equilibrium point EQ U see figure 10.13. The poly- 
nomials are linearly independent, but related through one syzygy 



U\U 2 



0. 



(10.54) 



yielding a 4-dimensional MISO(2) reduced state space, a symmetry-invariant repre- 
sentation of the 5-dimensional SO{2) equivariant dynamics. (continued in exam- 
ple 10.14) 



The dynamical equations follow from the chain rule 



dui 

Ui — — Xj 
OX; 



(10.55) 



upon substitution {x\ , X2, ■ ■ ■ , xj} — > {u\ , 112, • • • , u m \. One can either rewrite the 
dynamics in this basis or plot the 'image' of solutions computed in the original, 
equivariant basis in terms of these invariant polynomials. 



exercise 10.15 
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Example 10.14 Complex Lorenz equations in a Hilbert basis. (continuation of 
example 10.13) Substitution of (10.2) and (10.53) into (10.55) yields complex Lorenz 
equations in terms of invariant polynomials: 

ii\ — 2o~(u4 — u\), 

u 2 = -2(u 2 -p2«3 - (pi - u 5 )u 4 ) , 

u i — — (f + 1) «3 + Pi U\ + e U4 , (10.56) 
U4 — — (o~ + 1) U4 + (pi - K5) U\ + (TUi — e W3 , 
M5 = U4 — bu$. 

As far as visualization goes, we need neither construct nor integrate the invariant dy- 
namics (10.56). It suffices to integrate the original, unreduced flow of Figure 10.1, but 
plot the solution in the image space, i.e., u -, invariant, Hilbert polynomial coordinates, 
as in figure 10. 13. (continued in example 10. 15) 

Reducing dimensionality of a dynamical system by elimination of variables 
through inclusion of syzygies such as (10.54) introduces singularities. Such elimi- 
nation of variables, however, is not needed for visualization purposes; syzygies 
merely guarantee that the dynamics takes place on a submanifold in the projec- 
tion on the {u\,U2, ■■■ , u m \ coordinates. However, when one reconstructs the dy- 
namics in the original space M from its image M/G, the transformations have 
singularities at the fixed-point subspaces of the isotropy subgroups in At 



Example 10.15 Hilbert basis singularities, (continuation of example 10.14) When 
one takes syzygies into account in rewriting a dynamical system, singularities are intro- 
duced. For instance, if we solve (10.54) for u 2 and substitute into (10.56), the reduced 
set of equations, 

ii\ — 2o~(ii4 — u\) 

«3 — —(cr+l)uj,+p2Ui+eu4 

U4 — — (cr + l)z<4 + (p\ — us) u\ + cr(u\ + ify/ui — eu^ 

its — U4-bu$, (10.57) 
is singular as u\ -> 0. (E. Siminos) 

Nevertheless we can now easily identify a suitable Poincare section, guided 
by the Lorenz flow examples of chapter 9, as one that contains the z-axis and 
the image of the relative equilibrium TW\, here defined by the condition u\ = 
u\. As in example 11.4, we construct the first return map using as coordinate 
the Euclidean arclength along the intersection of the unstable manifold of TWy 
with the Poincare surface of section, see figure 10.14. Thus the goals set into 
the introduction to this chapter are attained: we have reduced the messy strange 
attractor of figure 10.1 to a 1-dimensional return map. As will be explained in 
example 11.4 for the Lorenz attractor, we now have the symbolic dynamics and 
can compute as many relative periodic orbits of the complex Lorenz flow as we 
wish, missing none. 
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Figure 10.14: Return map to the Poincare section 
u i — U4 for complex Lorenz equations projected on 
invariant polynomials (10.53). The return map coor- 
dinate is the Euclidean arclength distance from TW lt 
measured along the Poincare section of its spiral-out 
unstable manifold, as for the Lorenz flow return map 
of example 1 1.4. 



What limits the utility of Hilbert basis methods are not such singularities, but 
rather the fact that the algebra needed to determine a Hilbert basis becomes com- 
putationally prohibitive as the dimension of the system or of the group increases. 
Moreover, even if such basis were available, rewriting the equations in an invari- 
ant polynomial basis seems impractical, so in practice Hilbert basis computations 
appear not feasible beyond state space dimension of order ten. When our goal is 
to quotient continuous symmetries of high-dimensional flows, such as the Navier- 
Stokes flows, we need a workable framework. The method of moving frames of 
sect. 10.4 is one such minimalist alternative. 



Resume 

The message: If a dynamical systems has a symmetry, use it! Here we have 
described how, and offered two approaches to continuous symmetry reduction. 
In the method of slices one fixes a 'slice' (x - l) T t' = 0, a hyperplane normal 
to the group tangent f that cuts across group orbits in the neighborhood of the 
slice-fixing point x'. Each class of symmetry-equivalent points is represented by 
a single point, with the symmetry-reduced dynamics in the reduced state space 
M/G given by (10.50): 

v = v-j>-t, 4> a = {v T t' a )l{t-t'). 

In practice one runs the dynamics in the full state space, and post-processes the 
trajectory by the method of moving frames. In the Hilbert polynomial basis ap- 
proach one transforms the equivariant state space coordinates into invariant ones, 
by a nonlinear coordinate transformation 

{xi,x%,---,x d } -> {u\,u 2 , •■■,u m ] + {syzygies}, 

and studies the invariant 'image' of dynamics (10.55) rewritten in terms of invari- 
ant coordinates. 

Continuous symmetry reduction is considerably more involved than the dis- 
crete symmetry reduction to a fundamental domain of chapter 9. Slices are only 
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local sections of group orbits, and Hilbert polynomials are non-unique and diffi- 
cult to compute for high-dimensional flows. However, there is no need to actually 
recast the dynamics in the new coordinates: either approach can be used as a vi- 
sualization tool, with all computations carried out in the original coordinates, and 
then, when done, projecting the solutions onto the symmetry reduced state space 
by post-processing the data. The trick is to construct a good set of symmetry 
invariant Poincare sections (see sect. 3.1), and that is always a dark art, with or 
without a symmetry. 

We conclude with a few general observations: Higher dimensional dynamics 
requires study of compact invariant sets of higher dimension than 0-dimensional 
equilibria and 1 -dimensional periodic orbits studied so far. In sect. 2. 1.1 we made 
an attempt to classify 'all possible motions:' (1) equilibria, (2) periodic orbits, (3) 
everything else. Now one can discern in the fog of dynamics an outline of a more 
serious classification - long time dynamics takes place on the closure of a set of 
all invariant compact sets preserved by the dynamics, and those are: (1) 0-dimens- 
ional equilibria Meq, (2) 1-dimensional periodic orbits M p , (3) global symmetry 
induced A^-dimensional relative equilibria Airw, (4) (A7+l)-dimensional relative 
periodic orbits M p , (5) terra incognita. We have some inklings of the 'terra incog- 
nita:' for example, in symplectic symmetry settings one finds KAM-tori, and in 
general dynamical settings we encounter partially hyperbolic invariant M-tori, 
isolated tori that are consequences of dynamics, not of a global symmetry. They 
are harder to compute than anything we have attempted so far, as they cannot be 
represented by a single relative periodic orbit, but require a numerical computa- 
tion of full M-dimensional compact invariant sets and their infinite-dimensional 
linearized Jacobian matrices, marginal in M dimensions, and hyperbolic in the 
rest. We expect partially hyperbolic invariant tori to play important role in high- 
dimensional dynamics. In this chapter we have focused on the simplest example 
of such compact invariant sets, where invariant tori are a robust consequence of 
a global continuous symmetry of the dynamics. The direct product structure of a 
global symmetry that commutes with the flow enables us to reduce the dynamics 
to a desymmetrized (J-l-A^-dimensional reduced state space M/G. 

Relative equilibria and relative periodic orbits are the hallmark of systems 
with continuous symmetry. Amusingly, in this extension of 'periodic orbit' theory 
from unstable 1-dimensional closed periodic orbits to unstable (A^+ ^-dimension- 
al compact manifolds M p invariant under continuous symmetries, there are either 
no or proportionally few periodic orbits. In presence of a continuous symmetry, 
likelihood of finding a periodic orbit is zero. Relative periodic orbits are almost 
never eventually periodic, i.e., they almost never lie on periodic trajectories in 
the full state space, so looking for periodic orbits in systems with continuous 
symmetries is a fool's errand. 

However, dynamical systems are often equivariant under a combination of 
continuous symmetries and discrete coordinate transformations of chapter"?, for 
example the orthogonal group 0(n). In presence of discrete symmetries relative 
periodic orbits within discrete symmetry-invariant subspaces are eventually peri- 
odic. Atypical as they are (no generic chaotic orbit can ever enter these discrete 
invariant subspaces) they will be important for periodic orbit theory, as there the 
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shortest orbits dominate, and they tend to be the most symmetric solutions. 

J J chapter 21 



Commentary 

Remark 10.1 A brief history of relativity, or, 'Desymmetrization and its discontents' 
(after Civilization and its discontents; continued from remark 9.1): The literature on 
symmetries in dynamical systems is immense, most of it deliriously unintelligible. Would 
it kill them to say 'symmetry of orbit p' instead of carrying on about 'isottopies, quo- 
tients, factors, normalizers, centralizers and stabilizers?' [9, 10, 8, 15] Group action being 
'free, faithful, proper, regular?' Symmetry-reduced state space being 'orbitfold?' For the 
dynamical systems applications at hand we need only basic the Lie group facts, on the 
level of any standard group theory textbook [ 2], We found Roger Penrose [ 3] introduction 
to the subject both enjoyable and understandable. Chapter 2. of ref. [ 4] offers a peda- 
gogical introduction to Lie groups of transformations, and Nakahara [ 5] to Lie deriva- 
tives and brackets. The presentation given here is in part based on Siminos thesis [ 6] 
and ref. [7]. The reader is referred to the monographs of Golubitsky and Stewart [ 8], 
Hoyle [9], Olver [11], Bredon [12], and Krupa [ 1 3] for more depth and rigor than would 
be wise to wade into here. 

Relative equilibria and relative periodic solutions are related by symmetry reduction 
to equilibria and periodic solutions of the reduced dynamics. They appear in many phys- 
ical applications, such as celestial mechanics, molecular dynamics, motion of rigid bod- 
ies, nonlinear waves, spiralling patterns, and fluid mechanics. A relative equilibrium is a 
solution which travels along an orbit of the symmetry group at constant speed; an intro- 
duction to them is given, for example, in Marsden [?]. According to Cushman, Bates [ 14] 
and Yoder [15], C. Huygens [ 1 6] understood the relative equilibria of a spherical pendu- 
lum many years before publishing them in 1673. A reduction of the translation symmetry 
was obtained by Jacobi (for a modern, symplectic implementation, see Laskar et al. [ 17]). 
In 1892 German sociologist Vierkandt [18] showed that on a symmetry-reduced space 
(the constrained velocity phase space modulo the action of the group of Euclidean mo- 
tions of the plane) all orbits of the rolling disk system are periodic [ 19]. According to 
Chenciner [20], the first attempt to find (relative) periodic solutions of the TY-body prob- 
lem was the 1896 short note by Poincare [21], in the context of the 3-body problem. 
Poincare named such solutions 'relative.' Relative equilibria of the /Y-body problem 
(known in this context as the Lagrange points, stationary in the co-rotating frame) are 
circular motions in the inertial frame, and relative periodic orbits correspond to quasiperi- 
odic motions in the inertial frame. For relative periodic orbits in celestial mechanics see 
also ref. [22]. A striking application of relative periodic orbits has been the discovery of 
"choreographies" in the TV -body problems [ 23, 24, 25]. 

The modern story on equivariance and dynamical systems starts perhaps with S. 
Smale [26] and M. Field [27], and on bifurcations in presence of symmetries with Ru- 
elle [28]. Ruelle proves that the stability matrix/Jacobian matrix evaluated at an equilib- 
rium/fixed point x e Aic decomposes into linear irreducible representations of G, and that 
stable/unstable manifold continuations of its eigenvectors inherit their symmetry proper- 
ties, and shows that an equilibrium can bifurcate to a rotationally invariant periodic orbit 
(i.e., relative equilibrium). 

Gilmore and Lettelier monograph [ 29] offers a very clear, detailed and user friendly 
discussion of symmetry reduction by means of Hilbert polynomial bases (do not look for 
'Hilbert' in the index, though). Vladimirov, Toronov and Derbov [ 30] use an invariant 
polynomial basis different from (10.53) to study bounding manifolds of the symmetry re- 
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duced complex Lorenz flow and its homoclinic bifurcations. There is no general strategy 
how to construct a Hilbert basis; clever low-dimensional examples have been constructed 
case-by-case. The example 10.13, with one obvious syzygy, is also misleading - syzygies 
proliferate rapidly with increase in dimensionality. The determination of a Hilbert basis 
appears computationally prohibitive for state space dimensions larger than ten [31, 32], 
and rewriting the equations of motions in invariant polynomial bases appears impractical 
for high-dimensional flows. Thus, by 1920's the problem of rewriting equivariant flows 
as invariant ones was solved by Hilbert and Weyl, but at the cost of introducing largely 
arbitrary extra dimensions, with the reduced flows on manifolds of lowered dimensions, 
constrained by sets of syzygies. Cartan found this unsatisfactory, and in 1935 he intro- 
duced [33] the notion of a moving frame, a map from a manifold to a Lie group, which 
seeks no invariant polynomial basis, but instead rewrites the reduced Ai/G flow in terms 
of d — N fundamental invariants defined by a generalization of the Poincare section, a 
slice that cuts across all group orbits in some open neighborhood. Fels and Olver view the 
method as an alternative to the Grobner bases methods for computing Hilbert polynomi- 
als, to compute functionally independent fundamental invariant bases for general group 
actions (with no explicit connection to dynamics, differential equations or symmetry re- 
duction). 'Fundamental' here means that they can be used to generate all other invari- 
ants. Olver's monograph [ I ! ] is pedagogical, but does not describe the original Cartan's 
method. Fels and Olver papers [34, 35] are lengthy and technical. They refer to Cartan's 
method as method of 'moving frames' and view it as a special and less rigorous case of 
their 'moving coframe' method. The name 'moving coframes' arises through the use of 
Maurer-Cartan form which is a coframe on the Lie group G, i.e., they form a pointwise 
basis for the cotangent space. In refs. [6, 7] the invariant bases generated by the moving 
frame method are used as a basis to project a full state space trajectory to the slice (i.e., 
the Ai/G reduced state space). 

The basic idea of the 'method of slices' is intuitive and frequently reinvented, often 
under a different name; for example, it is stated without attribution as the problem 1. 
of Sect. 6.2 of Arnol'd Ordinary Differential Equations [36]. The factorization (10.42) 
is stated on p. 31 of Anosov and Arnol'd [37], who note, without further elaboration, 
that in the vicinity of a point which is not fixed by the group one can reduce the order 
of a system of differential equations by the dimension of the group. Ref. [ 38] refers to 
symmetry reduction as 'lowering the order.' For the definition of 'slice' see, for example, 
Chossat and Lauterbach [ ]. Briefly, a submanifold ALt< containing x' is called a slice 
through x' if it is invariant under isotropy G %(M<t>) - Mf>. If x' is a fixed point of G, than 
slice is invariant under the whole group. The slice theorem is explained, for example, 
in Encyclopaedia of Mathematics. Slices tend to be discussed in contexts much more 
difficult than our application - symplectic groups, sections in absence of global charts, 
non-compact Lie groups. We follow refs. [ 39] in referring to a local group-orbit section as 
a 'slice.' Refs. [ 1 2, 40] and others refer to global group-orbit sections as 'cross-sections,' a 
term that we would rather avoid, as it already has a different and well established meaning 
in physics. Duistermaat and Kolk [41] refer to 'slices,' but the usage goes back at least 
to Guillemin and Sternberg [40] in 1984, Palais [42] in 1961 and Mastow [43] in 1957. 
Bredon [12] discusses both cross-sections and slices. Guillemin and Sternberg [40] define 
the 'cross-section,' but emphasize that finding it is very rare: "existence of a global section 
is a very stringent condition on a group action. The notion of 'slice' is weaker but has a 
much broader range of existence." 

Several important fluid dynamics flows exhibit continuous symmetries which are ei- 
ther SO(2) or products of SO(2) groups, each of which act on different coordinates of the 
state space. The Kuramoto-Sivashinsky equations [?, ?], plane Couette flow [ 3 1, ?, 55, 1], 
and pipe flow [56, 57] all have continuous symmetries of this form. In the 1982 paper 
Rand [58] explains how presence of continuous symmetries gives rise to rotating and 
modulated rotating (quasiperiodic) waves in fluid dynamics. Haller and Mezic [ 59] re- 
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duce symmetries of three-dimensional volume-preserving flows and reinvent method of 
moving frames, under the name 'orbit projection map.' There is extensive literature on re- 
duction of symplectic manifolds with symmetry; Marsden and Weinstein 1974 article [ 60] 
is an important early reference. Then there are studies of the reduced phase spaces for vor- 
tices moving on a sphere such as ref. [61], and many, many others. 

Reaction-diffusion systems are often equivariant with respect to the action of a fi- 
nite dimensional (not necessarily compact) Lie group. Spiral wave formation in such 
nonlinear excitable media was first observed in 1970 by Zaikin and Zhabotinsky [44]. 
Winfree [45, 46] noted that spiral tips execute meandering motions. Barkley and collabo- 
rators [47, 48] showed that the noncompact Euclidean symmetry of this class of systems 
precludes nonlinear entrainment of translational and rotational drifts and that the inter- 
action of the Hopf and the Euclidean eigenmodes leads to observed quasiperiodic and 
meandering behaviors. Fiedler, in the influential 1995 talk at the Newton Institute, and 
Fiedler, Sandstede, Wulff, Turaev and Scheel [49, 50, 51, 52] treat Euclidean symme- 
try bifurcations in the context of spiral wave formation. The central idea is to utilize 
the semidirect product structure of the Euclidean group E(2) to transform the flow into a 
'skew product' form, with a part orthogonal to the group orbit, and the other part within 
it, as in (10.50). They refer to a linear slice M. near relative equilibrium as a Palais slice, 
with Palais coordinates. As the choice of the slice is arbitrary, these coordinates are 
not unique. According to these authors, the skew product flow was first written down by 
Mielke [53], in the context of buckling in the elasticity theory. However, this decompo- 
sition is no doubt much older. For example, it was used by Krupa [ 13, 32] in his local 
slice study of bifurcations of relative equilibria. Biktashev, Holden, and Nikolaev [ 54] 
cite Anosov and Arnol'd [37] for the 'well-known' factorization (10.42) and write down 
the slice flow equations (10.50). 

Neither Fiedler et al. [49] nor Biktashev et al. [54] implemented their methods nu- 
merically. That was done by Rowley and Marsden for the Kuramoto-Sivashinsky [ 39] and 
the Burgers [62] equations, and Beyn and Thummler [63, 64] for a number of reaction- 
diffusion systems, described by parabolic partial differential equations on unbounded do- 
mains. We recommend the Barkley paper [-18] for a clear explanation of how the Eu- 
clidean symmetry leads to spirals, and the Beyn and Thummler paper [ 63] for inspira- 
tional concrete examples of how 'freezing '/'slicing' simplifies the dynamics of rotational, 
traveling and spiraling relative equilibria. Beyn and Thummler write the solution as a 
composition of the action of a time dependent group element g{t) with a 'frozen,' in-slice 
solution u{t) (10.42). In their nomenclature, making a relative equilibrium stationary by 
going to a co-moving frame is 'freezing' the traveling wave, and the imposition of the 
phase condition (i.e., slice condition (10.43)) is the 'freezing ansatz.' They find it more 
convenient to make use of the equivariance by extending the state space rather than reduc- 
ing it, by adding an additional parameter and a phase condition. The 'freezing ansatz' [ 63] 
is identical to the Rowley and Marsden [62] and our slicing, except that 'freezing' is for- 
mulated as an additional constraint, just as when we compute periodic orbits of ODEs we 
add Poincare section as an additional constraint, i.e., increase the dimensionality of the 
problem by 1 for every continuous symmetry (see sect. 13.4). 

F J J J J section 13.4 

Derivation of sect. 10.4.2 follows most closely Rowley and Marsden [62] who, in 
the pattern recognition setting refer to the slice point as a 'template,' and call ( 10.51) the 
'reconstruction equation' [?, 65]. They also describe the 'method of connections' (called 
'orthogonality of time and group orbit at successive times' in ref. [ 63]), for which the 
reconstruction equation (10.5 1) denominator is t(x) T ■ t(x) and thus nonvanishing as long 
as the action of the group is regular. This avoids the spurious slice singularities, but it 
is not clear what the 'method of connections' buys us otherwise. It does not reduce the 
dimensionality of the state space, and it accrues 'geometric phases' which prevent relative 
periodic orbits from closing into periodic orbits. Geometric phase in laser equations, in- 
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eluding complex Lorenz equations, has been studied in ref. [ 66, 67, 69, 70, 71]. Another 
theorist's temptation is to hope that a continuous symmetry would lead us to a conserved 
quantity. However, Noether theorem requires that equations of motion be cast in La- 
grangian form and that the Lagrangian exhibits variational symmetries [72, 73]. Such 
variational symmetries are hard to find for dissipative systems. 

Sect. 10.1.2 title 'Lie groups for cyclists' is bit of a joke in more ways than one. 
First, 'cyclist,' 'pedestrian' throughout ChaosBook.org refer jokingly both to the title of 
Lipkin's Lie groups for pedestrians [74] and to our preoccupations with actual cycling. 
Lipkin's 'pedestrian' is fluent in Quantum Field Theory, but wobbly on Dynkin diagrams. 
More to the point, it is impossible to dispose of Lie groups in a page of text. As a counter- 
dote to the 1-page summary of sect. 10.1.2, consider reading Gilmore's monograph [75] 
which offers a quirky, personal and enjoyable distillation of a lifetime of pondering Lie 
groups. As seems to be the case with any textbook on Lie groups, it will not help you with 
the problem at hand, but it is the only place you can learn both what Galois actually did 
when he invented the theory of finite groups in 1830, and what, inspired by Galois, Lie 
actually did in his 1874 study of symmetries of ODEs. Gilmore also explains many things 
that we pass over in silence here, such as matrix groups, group manifolds, and compact 
groups. 

One would think that with all this literature the case is shut and closed, but not so. 
Applied mathematicians are inordinately fond of bifurcations, and almost all of the pre- 
vious work focuses on equilibria, relative equilibria, and their bifurcations, and for these 
problems a single slice works well. Only when one tries to describe the totality of chaotic 
orbits does the non-global nature of slices become a serious nuisance. 

(E. Siminos and P. Cvitanovic) 

Remark 10.2 Complex Lorenz equations (10.1) were introduced by Gibbon and 
McGuinness [76, 77] as a low-dimensional model of baroclinic instability in the atmo- 
sphere. They are a generalization of Lorenz equations (2.13). Ning and Haken [78] have 
shown that equations isomorphic to complex Lorenz equations also appear as a trunca- 
tion of Maxwell-Bloch equations describing a single mode, detuned, ring laser. They set 
e+po = 0 so that SO(2)-orbits of detuned equilibria exist [77]. Zeghlache and Mandel [?] 
also use equations isomorphic to complex Lorenz equations with e +p 2 = 0 in their studies 
of detuned ring lasers. This choice is 'degenerate' in the sense that it leads to non-generic 
bifurcations. As existence of relative equilibria in systems with SO(2) symmetry is the 
generic situation, we follow Bakasov and Abraham [79] who set pi — 0 and e + 0 in 
order to describe detuned lasers. Here, however, we are not interested in the physical ap- 
plications of these equations; rather, we study them as a simple example of a dynamical 
system with continuous (but no discrete) symmetries, with a view of testing methods of 
reducing the dynamics to a lower-dimensional reduced state space. Complex Lorenz flow 
examples and exercises in this chapter are based on E. Siminos thesis [6] and R. Wilczak 
project report [80]. 

Remark 10.3 Velocity vs. Speed Velocity is a vector, the rate at which the object 
changes its position. Speed, or the magnitude of the velocity, is a scalar quantity which 
describes how fast an object moves. We denote the rate of change of group phases, or 
the phase velocity by the vector c = ((f>\, • • • , 4>n) = (ci, • ■ • , Cjv), a component for each 
of the N continuous symmetry parameters. These are converted to state space velocity 
components along the group tangents by 

v(x) = c(t) ■ t(x) . (10.58) 
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For rotational waves these are called "angular velocities." 



Remark 10.4 Killing fields. The symmetry tangent vector fields discussed here are a 
special case of Killing vector fields of Riemannian geometry and special relativity. If this 
poetry warms the cockles of your heart, hang on. From wikipedia ( this wikipedia might 
also be useful): A Killing vector field is a set of infinitesimal generators of isometries on 
a Riemannian manifold that preserve the metric. Flows generated by Killing fields are 
continuous isometries of the manifold. The flow generates a symmetry, in the sense that 
moving each point on an object the same distance in the direction of the Killing vector 
field will not distort distances on the object. A vector field X is a Killing field if the Lie 
derivative with respect to X of the metric g vanishes: 



-Cxg = 0. 



(10.59) 



Killing vector fields can also be defined on any (possibly nonmetric) manifold M if we 
take any Lie group G acting on it instead of the group of isometries. In this broader sense, 
a Killing vector field is the pushforward of a left invariant vector field on G by the group 
action. The space of the Killing vector fields is isomorphic to the Lie algebra g of G. 

If the equations of motion can be cast in Lagrangian form, with the Lagrangian ex- 
hibiting variational symmetries [72, 73], Noether theorem associates a conserved quantity 
with each Killing vector. 



(E. Siminos and P, Cvitanovic) 



Exercises 



10.1. 



10.2. 



Visualizations of the 5-dimensional complex Lorenz 
flow: Plot complex Lorenz flow projected on any 
three of the five [x\, Xz,y\,yi, z] axes. Experiment with 
different visualizations. 

SO(2) rotations in a plane: Show by exponentiation 
(10.7) that the SO(2) Lie algebra element T generates 
rotation g in a plane, 



g {0) = e Te = cos 



1 0 
0 1 



+ sin 9 



0 1 
-1 0 



cos 9 
- sin 9 



sin 9 
cos 9 



crete subgroup C m of SO(2) rotations about an axis (let's 
say the 'z-axis'), 



C 1/m v(x) = v(C 1/m x) = v(x) , 



l/myn 



(C 1/m ) 



the only non-zero components of Fourier-transformed 
equations of motion are a jm for j = 1, 2, • • •. Argue that 
the Fourier representation is then the quotient map of 
the dynamics, Ai/C m . (Hint: this sounds much fancier 
than what is - think first of how it applies to the Lorenz 
system and the 3-diskpinball.) 



10.3. 



Invariance under fractional rotations. Argue that 
if the isotropy group of the velocity field v(x) is the dis- 



(10.60) 10.4. U(l) equivariance of complex Lorenz equations for 
finite angles: Show that the vector field in complex 
Lorenz equations (10.1) is equivariant under (10.7), the 
unitary group U(l) acting on R 5 = C 2 x R by 



g(ff)(x,y,z) = (e l9 x,e ,e y,z), 9e [Q,2n) .(10.61) 
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(E. Siminos) 

10.5. SO(2) equivariance of complex Lorenz equations for 
finite angles: Show that complex Lorenz equations 
(10.2) are equivariant under rotation for finite angles. 

10.6. Stability matrix of complex Lorenz flow: Compute 
the stability matrix (10.26) for complex Lorenz equa- 
tions (10.2). 

10.7. SO(2) equivariance of complex Lorenz equations for 
infinitesimal angles. Show that complex Lorenz 
equations are equivariant under infinitesimal SO(2) ro- 
tations. 

10.8. A 2-mode SO(2)-equivariant flow: Complex Lorenz 
equations (10.1) of Gibbon and McGuinness [76] have 
a degenerate 4-dimensional subspace, with SO(2) acting 
only in its lowest non-trivial representation. Here is a 
possible model, still 5-dimensional, but with SO(2) act- 
ing in the two lowest representations. Such models arise 
as truncations of Fourier-basis representations of PDEs 
on periodic domains. In the complex form, the simplest 
such modification of complex Lorenz equations may be 
the "2-mode" system 



and substituting the Lie algebra generator 



x 

y 



—crx + crx y 

(P ~ z)x 2 - ay 
1 



z = —bz + - (x 2 y* + x* 2 yj , 



(10.62) 



where x, y, p, a are complex and z, b, cr are real. Rewrit- 
ten in terms of real variables x — x\ + ix 2 , y - yi + iy 2 
this is a 5-dimensional first order ODE system 



x\ 

h 

z 



-crxi + cr(xiyi - x 2 y 2 ) 

-crx 2 + cr(xiy 2 - x 2 yi) 

-y\ + ey 2 + (pi - z)(x\ - x\) - 2p 2 xix 2 

-y 2 -ey x + p 2 (x\ - x\) + (pi - z)(2x\x 2 ) 



-bz 



■ {x\ - x\)yi + 2xix 2 y 2 . 



(10.63) 



Verify ( 10.63) by substituting x = x\+ix 2 , y - yi+iy 2 , 
p — pi + ip 2 , a = 1 + i e into the complex 2-mode equa- 
tions (10.62). 

10.9. U(l) equivariance of 2-mode system for finite angles: 

Show that 2-mode system (10.62) is equivariant under 
rotation for finite angles. 

10.10. SO(2) equivariance of the 2-mode system for in- 
finitesimal angles. Verify that the 2-mode system 
(10.63) is equivariant under infinitesimal SO(2) rota- 
tions (10.18) by showing that the stability matrix (4.3) 
for the system is given by A — 

cr(yi - 1) cry 2 

cry 2 -cr(yi + 1) 

2p\X\ - 2p 2 x 2 - 2x\z 2x 2 z - 2p 2 x\ - 2p\x 2 

2p\x 2 + 2p 2 x\ — 2x 2 z 2p\X\ — 2p 2 x 2 — 2x\z 

2x\yi + 2x 2 y 2 2x\y 2 - 2x 2 y x 



10.15. 



crx i 
—crx 



T = 



0 
0 
0 



1 

0 
0 
0 
0 



0 0 ^ 

0 0 

2 0 

0 0 

0 0 ) 



(10.65) 



and the stability matrix (10.64) into the equivariance 
condition (10.24). 

10.11. Visualizations of the 5-dimensional 2-mode system: 

Plot 2-mode system projected on any three of the five 
{x\,x 2 ,y\,y 2 ,z} axes. For complex Lorenz flow nu- 
merical examples we have set the parameters to p\ = 
28, p 2 = 0, b = 8/3, cr = 10, e = 1/10, but here you 
will have to play with them until you find something 
that looks interestingly chaotic. Experiment with differ- 
ent visualizations. It's a big mess - have no clue what 
parameters to take, what the trajectory will do. 

10.12. Discover the equivariance of a given flow: 



Suppose you were given complex Lorenz equa- 
tions, but nobody told you they are SO(2) equivariant. 
More generally, you might encounter a flow without re- 
alizing that it has a continuous symmetry - how would 
you discover it? 

10.13. Equilibria of complex Lorenz equations: Find all 
equilibria of complex Lorenz equations. Hint: Equilib- 
ria come either in the fixed Fix (G) subspace, or on a 
group orbit. 

10.14. More equilibria of complex Lorenz equations: 



In exercise 1 0. 1 3 we found only one equilibrium 
of complex Lorenz equations. The Ning and Haken [78] 
version of complex Lorenz equations (a truncation of 
Maxwell-Bloch equations describing a single mode ring 
laser) sets e + p 2 — 0 so that a detuned equilibrium ex- 
ists. Test your routines on 2 cases: (a) e — 0, p 2 — 0. As 
discussed by Siminos [6], reality of parameters a, p in 
(10.1) implies existence of a discrete C2 symmetry, (b) 
e + p 2 — 0, e + 0. You might want to compare results 
with those of Ning and Haken. 

Complex Lorenz equations in a Hilbert basis, (con- 
tinuation of example 10.13) Derive complex Lorenz 
equations (10.56) in terms of invariant polynomials 
(10.56), plot the strange attractor in projections you find 
illuminating ^pne example is figure 10.13). 

— crxr crx\ 0 
10J^). Hilbert basis_sjngi ilar^g^ When one takes syzygies 

into_4ccoun£if|jpjwiting a dynamical system, singular! 



tigSjL-ar^ introduced 



For instance, eliminate u 2 using the 
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syzygy, and show that you get the reduced set of equa- 10.19. Visualizations of the complex Lorenz flow in polar 



tions, 



coordinates: 



"3 
"4 
"5 



2 0~(U4 — U\) 

— (cr + 1) M3 + P2 Ml + e U4 

-(cr + 1)m4 + (pi — Us)u\ + cr{u\ + u 2 A )lu\ — eus 
114 -bus, (10.66) 

singular as u\ — » 0. (E. Siminos) 

10.17. Complex Lorenz equations in polar coordinates. 

Rewrite complex Lorenz equations from Cartesian to 
polar coordinates, using (x\, X2,y\,y2,z) = 

(r\ cos 0i, r\ sin^i, r2 cos 62, r 2 sin 82, z) , (10.67) 

where r\ > 0 , r2 > 0. Show that in polar coordinates the 
equations take form 



Plot a long-time solution of (10.68) and show that the 
polar representation introduces singularities into what 
initially was a smooth flow: 



( h ) 


1 






h 




62 




, z , 


\ 



-cr (r\ — r2 cos 9) 
-o-H sin6 
-r 2 + r\ ((pi - z) cos 9 - p2 sin 9) 
z) sin 9 + p2 cos 9) 
-bz + r\ r2 cos 9 



where angles always appear in the combination 9 = 
9\ - 92 We know from classical mechanics that for trans- 
lationally or rotationally invariant flows the relative dis- 
tance is invariant (that is why one speaks of 'relative' 
equilibria), hence we introduce a variable 9 = 9\ -62. 
Show that this new variable allows us to rewrite the com- 
plex Lorenz equations as 4 coupled polar coordinates 
equations: 



' h 1 


/ 


h 
9 




, z ) 


V 



—cr {r\ — Y2 cos 9) 
-r% + (pi - z)r\ cos# 
-(cr^+( Pl -zy£jsm9 

—bz + r\ r2 cos 9 



10.20. 



(10.68) 



where we have set p2 = 0. (hints: review (2.23), exam- 
ple B.l, exercise B.l, and (10.55)) 

10.18. 2-mode system in polar coordinates. Show that the 
2-mode system (10.63) rewritten in polar coordinates 
(10.67) is given by 



r\ = 

h = 



-crr\ + crr\r2 cos(#) 

-n + r\((p\ - z) cos(6>) - p 2 sin(6»)) 




We shall encounter the same problem in implementing 
the x\ — 0 slice, . 9 is very small until the trajectory ap- 
proaches either r\ — > 0 or ¥2 — > 0, where Mathematica 
continues through the singularity by a rapid change of 9 
by n. The fixed Fix (G) subspace (n ,r2,9,z) = (0, 0, 9, z) 
separates the two folds of the attractor. 

Plot complex Lorenz flow projected on any three of the 
{r\, r2, 9, z) coordinates. Experiment with different visu- 
alizations. The flow is singular as r ; — > 0, with angle 9j 
going through a rapid change there: is that a problem? 
Does it make sense to insist on r\ > 0 , r2 > 0, or should 
one let them have either sign in order that the 9 trajectory 
be continuous? 

Computing the relative equilibrium TW\\ The two 
rotation angles 9\ and 02 change in time, but at the rela- 
tive equilibria the difference between them is constant, 
9 = 0. Find the relative equilibria of the complex Lorenz 
equations by finding the equilibria of the system in polar 
coordinates (10.68). Show that 

(a) The relative equilibrium (hereafter referred to [ 6] 
as TWi) is given by 



9i = -crr 2 sin(0), # 2 = -e + -((pi-z)sin(60+p2Cos(<9)) 

, n where d = I + e 2 /(cr + l) 2 , 

z = -bz + — cos(#) , 



(ri,r 2 ,9,z) = (y/b(pi- d), yjbd(p\ - d), 

cos" 1 (l/ Vj),pi -d) ,(10.71) 



where 9 = 29\ - 02- Rewriting the angular part as 
9 = 29\ — 9\, we have 



(10$) 

The angular velocity of relative equilibrium TW\ 

(10.72) 



is 



9/ = cre/(cr+ 1), 
with the period Tjwt = 1n(cr + l)/<xe. 



9 = e — -((pi-z) sin(6>)+p 2 cos(0))-2r 2 o-sin(0) .(10.70) 

ri 10.21. Relative equilibrium TW\ in polar coordinates: Plot 

D. Borrero the equilibrium TWi in polar coordinates. 
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10.22. Relative equilibrium TW\ in Cartesian coordinates: 

Show that for (10.2) parameter values, 

xtw, = (xi,x 2 ,yi,y2,z) (10.73) 
= (8.4849, 0.077135, 8.4856, 0, 26.999) , 

10.31. 

is a point on the TW\ orbit. Plot the relative equilibrium 
TW\ in Cartesian coordinates. State the velocity of rela- 
tive equilibrium, compare with the imaginary part of the 
complex stability eigenvalue, and explain the two time 
scales visible in the 'horn', as well as the expansion rate 
per turn of the spiral. 

10.23. The relative equilibria of the 2-mode system: Find 
the relative equilibria of the 2-mode system by finding 

the equilibria of the system in polar coordinates ( 10.68). 10.32. 

10.24. Plotting the relative equilibria of the 2-mode system 
in polar coordinates: Plot the relative equilibria of 
the 2-mode system in polar coordinates. 

10.25. Plotting the relative equilibria of the 2-mode system 10.33. 
in Cartesian coordinates: Plot the relative equilibria 

of the 2-mode system in Cartesian coordinates. 

10.26. Eigenvalues and eigenvectors of TW\ stability ma- 10.34. 
trix: Compute the eigenvalues and eigenvectors of 

the stability matrix (10.26) evaluated at TW\ and using 

the (10.2) parameter values, in (a) Cartesian coordinates, 

(b) polar coordinates. 

F 10.35. 

10.27. The eigen-system of TW\ stability matrix in polar co- 
ordinates: Plot the eigenvectors of A at TW\ in polar 
coordinates, as well as the complex Lorenz flow at val- 
ues very near TW\. 

10.28. Eigenvalues and eigenvectors of EQ 0 stability ma- 
trix: Find the eigenvalues and the eigenvectors of the 
stability matrix A (10.26) at EQ 0 = (0, 0, 0, 0, 0) deter- 
mined in exercise 10.13. ChaosBook convention is to 
order eigenvalues from most positive (unstable) to the 
most negative. Follow that. Replace complex eigenvec- 
tors by the real, imaginary parts, so you can plot them in 10.36. 
(real) state space. 

10.29. The eigen-system of the stability matrix at E go : Plot 
the eigenvectors of A at EQo and the complex Lorenz 

flow at values very close to EQq. 10.37. 

10.30. SO(2) or harmonic oscillator slice: Construct a 
moving frame slice for action of SO(2) on R 2 

(x, y) i-» (x cos 9 - y sin 9, x sin 9 + y cos 9) 



by, for instance, the positive y axis: x — 0, y > 0. Write 
out explicitly the group transformations that bring any 
point back to the slice. What invariant is preserved by 
this construction? (E. Siminos) 

State space reduction by a slice, ODE formulation: 

Replace integration of the complex Lorenz equations by 
a sequence of finite time steps, each followed by a ro- 
tation such that the next segment initial point is in the 
slice %2 = 0, %\ > 0. Reconsider this as a sequence of 
infinitesimal time steps, each followed by an infinitesi- 
mal rotation such that the next segment initial point is in 
the slice X2 = 0, x\ > 0. Derive the corresponding 4d 
reduced state space ODE for the complex Lorenz flow. 

Accumulated phase shift: Derive the Id equation 
(10.5 1 ) for the accumulated phase shift 9 associated with 
the 4-dimensional reduced state space ODE of exer- 
cise 10.31. 

The moving frame flow stays in the reduced state 
space: Show that the flow (10.50) stays in a (d- 1)- 
dimensional slice. 

Relative equilibrium TW\ by the method of slices: 

Determine numerically the complex Lorenz equations 
equilibrium TW\ by the method of slices, template x' 
of your choice. 

State space reduction by a relative equilibrium TW\ 
template: Replace integration of the complex Lorenz 
equations by a sequence of short time steps, each fol- 
lowed by a rotation such that the next segment initial 
point is in the relative equilibrium TW\ slice 

(x - xtw, ) ' trw, = 0 , tjw, = Txtw, , (10.74) 

where for any x, x — g{9) ■ x is the rotation that lies in 
the slice. Check figure 10.12 by long-time integration of 
(10.50). 

Stability of a relative equilibrium in the reduced state 

space: Find an expression for the stability matrix of 
the system at a relative equilibrium when a linear slice 
is used to reduce the symmetry of the flow. 

Stability of a relative periodic orbit in the reduced 
state space: Find an expression for the Jacobian 
matrix (monodromy matrix) of a relative periodic orbit 
when a linear slice is used to reduce the dynamics of the 
flow. 
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Chapter 11 



Charting the state space 



The classification of the constituents of a chaos, nothing 
less is here essayed. 

— Herman Melville, Moby Dick, chapter 32 




n this chapter and the next we learn to partition state space in a topologically 
invariant way, and identify topologically distinct orbits. 



We start in sect. 11.1 with a simple and intuitive example, a 3-disk game 
of pinball. The qualitative dynamics of stretching/shrinking strips of surviving 
state space regions enables us to partition the state space and assign symbolic 
dynamics itineraries to trajectories. For the 3-disk game of pinball all possible 
symbol sequences enumerate all possible orbits. 

In sect. 1 1 .2 we use Rossler and Lorenz flows to motivate modeling of higher- 
dimensional flows by iteration of 1-dimensional maps. For these two flows the 
1 -dimensional maps capture essentially all of the higher-dimensional flow dynam- 
ics, both qualitatively and quantitatively. 1-dimensional maps suffice to explain 
the two key aspects of qualitative dynamics; temporal ordering, or itinerary with 
which a trajectory visits state space regions (sect. 11.3), and the spatial ordering 
between trajectory points (sect. 11.4), which is the key to determining the admis- 
sibility of an orbit with a prescribed itinerary. In a generic dynamical system not 
every symbol sequence is realized as a dynamical trajectory; as one looks further 
and further, one discovers more and more 'pruning' rules which prohibit fami- 
lies of itineraries. For 1-dimensional 'stretch & fold' maps the kneading theory 
(sect. 11.5) provides the definitive answer as to which temporal itineraries are ad- 
missible as trajectories of the dynamical system. Finally, sect. 11.6 is meant serve 
as a guide to the basic concepts of symbolic dynamics. 

Deceptively simple, this subject can get very difficult very quickly, so in this 
chapter we do the first, 1-dimensional pass at a pedestrian level, postponing the 
discussion of higher-dimensional, cyclist level issues to chapter 12. 

Even though by inclination you might only care about the serious stuff, like 
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Figure 11.1: A coarse partition of M into regions Aio, 
Mi, and M2, labeled by ternary alphabet Si = {1, 2, 3}. 




Rydberg atoms or mesoscopic devices, and resent wasting time on formal things, 
this chapter and chapters 14 and 15 are good for you. Study them. 



11.1 Qualitative dynamics 



What can a flow do to points in state space? This is a very difficult question to 
answer because we have assumed very little about the evolution function /; con- 
tinuity, and differentiability a sufficient number of times. Trying to make sense of 
this question is one of the basic concerns in the study of dynamical systems. The 
first answer was inspired by the motion of the planets: they appear to repeat their 
motion through the firmament, so the ancients' attempts to describe dynamical 
systems were to think of them as periodic. 

However, periodicity is almost never quite exact. What one tends to observe 
is recurrence. A recurrence of a point ^ of a dynamical system is a return of 
that point to a neighborhood of where it started. How close the point must 
return is up to us: we can choose a volume of any size and shape, and call it the 
neighborhood Aio, as long as it encloses xq. For chaotic dynamical systems, the 
evolution might bring the point back to the starting neighborhood infinitely often. 
That is, the set 



will in general have an infinity of recurrent episodes. 

To observe a recurrence we must look at neighborhoods of points. This sug- 
gests another way of describing how points move in state space, the important 
first step on the way to a theory of dynamical systems: qualitative, topological 
dynamics, or symbolic dynamics. As the subject can get quite technical, a sum- 
mary of the basic notions and definitions of symbolic dynamics is relegated to 
sect. 11.6; check that section and references cited in remark 11.1 whenever you 
run into baffling jargon. 



(R. Mainieri and R Cvitanovic) 



{y € Mo : y = f'(x 0 ), t > t 0 ] 



(11.1) 
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Figure 11.2: A trajectory with itinerary 021012. 



Figure 11.3: A 1-step memory refinement of the par- 
tition of figure 11.1, with each region Mj subdivided 
into Mjo, Ma, and Ma, labeled by nine 'words' 
{00,01,02, •••,21,22). 




We start by dividing the state space up into regions Ma, Mb,---, Mz, as in 
figure 11.1. This can be done in many ways, not all equally clever. Any such 
division of state space into distinct regions constitutes a partition, and we associate 
with each region (sometimes referred to as a state) a symbol s from an AMetter 
alphabet or state set 3K - {A, B, C, • • • , Z). As the state evolves, different regions 
will be visited. The visitation sequence - forthwith referred to as the itinerary - 
can be represented by the letters of the alphabet J{. If, as in the example sketched 
in figure 1 1.2, the state space is divided into three regions M>, Mi, and M2, the 
'letters' are the integers {0, 1,2}, and the itinerary for the trajectory sketched in 
the figure is 0 1 — > 2 1 — » 1 1 — » 0 1 — > 1 1 — » 2 1 — » - - 



Example 11.1 3-disk symbolic dynamics: Consider the motion of a free point 

sx&rci c 

particle in a plane with 3 elastically reflecting convex disks, figure 11.4. After a collision 
with a disk a particle either continues to another disk or escapes, so a trajectory can 
be labeled by the disk sequence. Sets of configuration space pinball trajectories of 
figure 11.4 become quickly hard to disentangle. As we shall see in what follows, their 
state space visualization in terms of Poincare sections V = [s, p] (figure 11.5, see also 
figure 3.9 (b)) is much more powerful, (continued in example 11.2) 



In general only a subset of points in Mb reaches Ma- This observation offers 
a systematic way to refine a partition by introducing m-step memory: the region 
M Sm ... SlSo consists of the subset of points of M So whose trajectory for the next m 
time steps will be sq t-> s\ i-» • • • s m , see figure 11.3. 



Example 11.2 3-disk state space partition: (continued from example 11.1) Em- 
bedded within Mn, Mu are four strips Mm, Mm, Mm, Mm of initial conditions 
that survive two bounces, and so forth. At each bounce a cone of initially nearby trajec- 
tories disperses (see figures 1.8 and 11.4). Also in order to attain a desired longer and 
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Figure 11.4: Two pinballs that start out very close 
to each other exhibit the same qualitative dynamics 
_2313_ for the first three bounces, but due to the expo- 
nentially growing separation of trajectories with time, 
follow different itineraries thereafter: one escapes after 
_2313_, the other one escapes after _23 132321_. (No- 
tation _2313_ is explained in sect. 11.6.) 



23132321 \ 




2313 



Figure 11.5: The 3-disk game of pinball Poincare 
section, trajectories emanating from the disk 1 
with x = (arclength, parallel momentum) = (s, p), 
where p = sin 9. (a) Strips of initial points Mn, 
Ma which reach disks 2, 3 in one bounce, re- 
spectively, (b) 1-step memory refinement of parti- 
tion (see figure 11.3): strips of initial points Mm, 
Mm, Mm and Mm which reach disks 1, 2, 3 
in two bounces, respectively. Disk radius : center 
separation ratio a:R = 1:2.5. (Y. Lan) 




-l 



-2.5 



0 2. 



2.5 



■2.5 



(a) 



(b) 



longer itinerary of bounces, the strip of initial points x 0 = (so,po) requires exponentially 
finer precision, nested within the initial state space strips drawn in figure 11.5. Provided 
that the disks are sufficiently separated, after n bounces the survivors are labeled by 2" 
distinct itineraries s\S2S3...s n . (continued in example 12.2) 

If there is no way to reach partition M from partition Mj, and conversely, par- 
tition Mj from partition Mj, the state space consists of at least two disconnected 
pieces, and we can analyze it piece by piece. An interesting partition should be 
dynamically connected, i.e., one should be able to go from any region M to any 
other region Mj in a finite number of steps. A dynamical system with such a 
partition is said to be metrically indecomposable. 

In general one also encounters transient regions - regions to which the dy- 
namics does not return once they are exited. Hence we have to distinguish be- 
tween (uninteresting to us) wandering trajectories that never return to the initial 
neighborhood, and the non-wandering set (2.3) of the recurrent trajectories. We 
are implicitly assuming that the transients are sufficiently short-lived not to be of 
experimental interest. 

However, knowing that a point from M reaches {Mj, ■ ■ ■ ,Mt) in one step 
is not quite good enough. We would be happier if we knew that the map of the 
entire initial region f{M) overlaps nicely with the entire Mj; otherwise we have 
to subpartition Mj into the subset f(Mj) and the reminder, and often we will 
find ourselves partitioning ad infinitum, a difficult topic that we shall return to 
sect. 12.4. 

Such considerations motivate the notion of a Markov partition, a partition for 
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Figure 11.6: For the 3-disk game of pinball no 
itineraries are pruned as long as the inter-disk spac- 
ing exceeds R : a > 2.04821419.... (from 
K.T. Hansen [22]) 




which no memory of preceding steps is required to fix the transitions allowed 
in the next step. Finite Markov partitions can be generated by expanding d- 
dimensional iterated mappings / : M — > M, if At can be divided into N regions 
{ Mq, All , ... , Atjv-l I sucn that in one step points from an initial region A\ either 
fully cover a region Aly, or miss it altogether, 



Whether such partitions can be found is not clear at all - the borders need to be 
lower-dimensional sets invariant under dynamics, and there is no guarantee that 
these are topologically simple objects. However, the game of pinball (and many 
other non- wandering repeller sets) is especially nice: the issue of determining the 
partition borders does not arise, as the survivors live on disconnected pieces of the 
state space, separated by a chasm of escaping trajectories. 

The itinerary of a billiard trajectory is finite for a scattering trajectory, coming 
in from infinity and escaping after a finite number of collisions, infinite for a 
trapped trajectory, and infinitely repeating for a periodic orbit. A finite length 
trajectory is not uniquely specified by its finite itinerary, but an isolated unstable 
cycle is: its itinerary is an infinitely repeating block of symbols. For hyperbolic 
flows the intersection of the future and past itineraries, the bi-infinite itinerary 
S'.S + = ■ ■ ■ S-2S-1 so-Si ^2^3 • • • specifies a unique orbit. Almost all infinite length 
trajectories (orbits) are aperiodic. Still, the longer the trajectory is, the closer to 
it is a periodic orbit whose itinerary shadows the trajectory for its whole length: 
think of the state space as the unit interval, aperiodic orbits as normal numbers, 
and periodic ones as fractions whose denominators correspond to cycle periods, 
as is literally the case for the Farey map (24.42), to be discussed in sect. 24.3.4. 

Determining whether the symbolic dynamics is complete (as is the case for 
sufficiently separated disks, see figure 11.6), pruned (for example, for touching or 
overlapping disks), or only a first coarse-graining of the topology (as, for example, 
for smooth potentials with islands of stability) requires a case -by-case investiga- 
tion, a discussion we postpone until sect. 11.5 and chapter 12. For now, we assume 
that the disks are sufficiently separated that there is no additional pruning beyond 
the prohibition of self-bounces. 

Inspecting figure 11.5 we see that the relative ordering of regions with dif- 
fering finite itineraries is a qualitative, topological property of the flow. This ob- 
servation motivates searches for simple, 'canonical' partitions which exhibit in 
a simple manner the spatial ordering common to entire classes of topologically 
similar nonlinear flows. 



either Mj n /(At,-) - 0 or Mj c /(At,-) . 



(11.2) 
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11.2 From (/-dimensional flows to 1 -dimensional maps 

Symbolic dynamics for the 3-disk game of pinball is so straightforward that one 
may altogether fail to see the connection between the topology of hyperbolic 
flows and their symbolic dynamics. This is brought out more clearly by the 1- 
dimensional visualization of 'stretch & fold' flows to which we turn now. 

We construct here the return maps (3.4) for two iconic flows, the Rossler and 
the Lorenz, in order to show how ODEs in higher dimensions can be modeled by 
low-dimensional maps. In the examples at hand the strong dissipation happens to 
render the dynamics essentially 1 -dimensional, both qualitatively and quantitati- 
vely. However, as we shall show in chapter 12, strong dissipation is not essential 
-the hyperbolicity is- so the method applies to Hamiltonian (symplectic areas pre- 
serving) flows as well. The key idea is to replace the original, arbitrarily concocted 
coordinates by intrinsic, dynamically invariant curvilinear coordinates erected on 
neighborhoods of unstable manifolds. 



Suppose concentrations of certain chemical reactants worry you, or the variati- 
ons in the Chicago temperature, humidity, pressure and winds affect your mood. 
Such quantities vary within some fixed range, and so do their rates of change. 
Even if we are studying an open system such as the 3-disk pinball game, we tend 
to be interested in a finite region around the disks and ignore the escapees. So a 
typical dynamical system that we care about is bounded. If the price to keep going 
is high - for example, we try to stir up some tar, and observe it come to a dead 
stop the moment we cease our labors - the dynamics tends to settle into a simple 
state. However, as the resistance to change decreases - the tar is heated up and we 
are more vigorous in our stirring - the dynamics becomes unstable. 

Example 11.3 Rossler attractor return map: Stretch & fold. (continued from 
example 4.5) In the Rossler flow (2. 18) of example 3.3 we sketched the attractor by 
running a long chaotic trajectory, and noted that the attractor of figure 3.2 is very thin. 
For Rossler flow an interval transverse to the attractor is stretched, folded and fiercely 
pressed back. The attractor is 'fractal,' but for all practical purposes the return map 
is 1 -dimensional; your printer will need a resolution better than 10 13 dots per inch to 
start resolving its structure. We had attempted to describe this 'stretch & fold' flow by a 
1 -dimensional return map, but the maps that we plotted in figure 3.3 were disquieting; 
they did not appear to be a 1-to-1 maps. This apparent non-invertibility is an artifact of 
projection of a 2-dimensional return map (R n ,z n ) -» (R»+\,z n +\) onto the 1 -dimensional 
subspace R„ -» R n+ \ . Now that we understand equilibria and their linear stability, let's 
do this right. 

The key idea is to measure arclength distances along the unstable manifold of 
the x~ equilibrium point, as in figure 1 1 .7 (a). Luck is with us; figure 1 1 .7 (b) return map 
= P(s„) looks much like a parabola of example 3.8, so we shall take the unimodal 
map symbolic dynamics, sect. 1 1 .3, as our guess for the covering symbolic dynamics, 
(continued in example 11.11) 



fast track: 



sect. 11.3, p. 225 
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Figure 11.7: (a) x = 0, y > 0 Poincare section of 
the x_ unstable manifold, Rossler flow figure 2.6. 
(PuPi) are measured with the origin placed at x_. 
(b) s — > P(s) return map, where .? is the arc-length 
distance measured along the unstable manifold of 
equilibrium point x— (A. Basu and J. Newman) 
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You get the idea - Rossler flow winds around the stable manifold of the 'cen- 
tral' equilibrium, stretches and folds, and the dynamics on the Poincare section 
of the flow can be reduced to a 1 -dimensional map. The next example is simi- 
lar, but the folding mechanism is very different: the unstable manifold of one of 
the equilibria collides with the stable manifold of the other one, forcing a robust 
heteroclinic connection between the two. 



fast track: 

sect. 1 1 .3, p. 225 



11.2.1 Heteroclinic connections 



In general, two manifolds can intersect in a stable way if the sum of their di- 
mensions is greater than or equal to the dimension of the state space, hence an 
unstable manifold of dimension k is likely to intersect a stable manifold whose 
codimension in state space is less than or equal to k (i.e., robustly with respect to 
small changes of system parameters). Trajectories that leave a fixed point along 
its unstable manifold and reach another fixed point along its stable manifold are 
called heteroclinic if the two fixed points are distinct or homoclinic if the initial 
and the final point are the same point. Whether the two manifolds actually in- 
tersect is a subtle question that is central to the issue of "structural stability" of 
ergodic dynamical systems. 



remark 1 1 .3 



Example 11.4 Lorenz flow: Stretch & crease. We now deploy the symmetry of 
Lorenz flow to streamline and complete analysis of the Lorenz strange attractor com- 
menced in example 9.14. There we showed that the dihedral D\ - {e,R} symmetry 
identifies the two equilibria EQ\ and EQ 2 , and the traditional 'two-eared' Lorenz flow 
figure 2.5 is replaced by the 'single-eared' flow of figure 9.6(a). Furthermore, symme- 
try identifies two sides of any plane through the z axis, replacing a full-space Poincare 
section plane by a half-plane, and the two directions of a full-space eigenvector ofEQo 
by a one-sided eigenvector, see figure 9.6(a). 

Example 4.7 explained the genesis of the x EQl equilibrium unstable manifold, 
its orientation and thickness, its collision with the z-axis, and its heteroclinic connec- 
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Figure 11.8: (a) A Poincare section of the Lorer 
flow in the doubled-polar angle representation, fij 
ure 9.6, given by the [y',z] plane that contains th 
z-axis and the equilibrium EQ\. xf axis points tc 
ward the viewer, (b) The Poincare section of th 
Lorenz flow by the section Crossings into the sec 
tion are marked red (solid) and crossings out < 
the section are marked blue (dashed). Outermo 
points of both in- and out-sections are given t 
the EQo unstable manifold W"(EQo) intersection 
(E. Siminos) 
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(b) 



Figure 11.9: The Poincare return map s„ +l = 
P(s„) parameterized by Euclidean arclength 5 mea- 
sured along the EQ\ unstable manifold, from x E q 1 to 
W"(EQ 0 ) section point, uppermost right point of the 
blue (dashed) segment in figure 11.8(b). The critical 
point (the 'crease') of the map is given by the section 
of the heteroclinic orbit W S (EQ 0 ) that descends all the 
way to EQo, in infinite time and with infinite slope. (E. 
Siminos) 




tion to the 

X EQ 0 — (0> 0> 0) equilibrium. All that remains is to describe how the EQo 
neighborhood connects back to theEQi unstable manifold. 

Figure 9.6 and figure 11.8(a) show clearly how the Lorenz dynamics is pieced 
together from the 2 equilibria and their unstable manifolds: Having completed the de- 
scent to EQo, the infinitesimal neighborhood of the heteroclinic EQ\ -> EQo trajectory 
is ejected along the unstable manifold of EQo and is re-injected into the unstable man- 
ifold of EQ\. Both sides of the narrow strip enclosing the EQo unstable manifold lie 
above it, and they get folded onto each other with a knife-edge crease (contracted 
exponentially for infinite time to the EQo heteroclinic point), with the heteroclinic out- 
trajectory defining the outer edge of the strange attractor. This leads to the folding of 
the outer branch of the Lorenz strange attractor, illustrated in figure 11.8(b), with the 
outermost edge following the unstable manifold of EQo. 

Now the stage is set for construction of Poincare sections and associated 
Poincare return maps. There are two natural choices; the section at EQo, lower part 
of figure 11.8(b), and the section (blue) above EQ\. The first section, together with 
the blowup of the EQo neighborhood, figure 4.6(b), illustrates clearly the scarcity of 
trajectories (vanishing natural measure) in the neighborhood of EQo- The flat section 
above EQ\ (which is, believe it or not, a smooth conjugacy by the flow of the knife-sharp 
section at EQo) is more convenient for our purposes. Its return map (3.4) is given by 
figure 11.9. 

The rest is straight sailing: to accuracy 1(T 4 the return map is unimodal, its crit- 
ical point's forward trajectory yields the kneading sequence (11.13), and the admissible 
binary sequences, so any number of periodic points can be accurately determined from 
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Figure 11.10: (a) The Rossler flow, figure 3.2, is 
an example of a recurrent flow that stretches and 
folds, (b) The Rossler 'stretch & fold' return map, 
figure 1 1 .7 (b). (R. Paskauskas and A. Basu) 




+ 6 



this 1 -dimensional return map, and the 3-dimensional cycles then verified by integrating 

the Lorenz differential equations (2. 13). As already observed by Lorenz, such a map is 

everywhere expanding on the strange attractor, so it is no wonder mathematicians can 

here make the ergodicity rigorous. 

a ' y section 20.6 

(E. Siminos and J. Halcrow) 



What have we learned from the above two exemplary 3-dimensional flows? 
If a flow is locally unstable but globally bounded, any open ball of initial points 
will be stretched out and then folded back. If the equilibria are hyperbolic, the 
trajectories will be attracted along some eigen-directions and ejected along others. 
The unstable manifold of one equilibrium can avoid stable manifolds of other 
equilibria, as is the case for Rossler, or slice them head on, as is the case for 
Lorenz. A typical trajectory wanders through state space, alternatively attracted 
into equilibria neighborhoods, and then ejected again. What is important is the 
motion along the unstable manifolds - that is where Id maps come from. 

At this juncture we proceed to show how this works on the simplest exam- 
ple: unimodal mappings of the interval. The erudite reader may skim through 
this chapter and then take a more demanding path, via the Smale horseshoes of 
chapter 12. Unimodal maps are easier, but less physically compelling. Smale 
horseshoes offer the high road, more complicated, but the right tool to generalize 
what we learned from the 3-disk dynamics, and begin analysis of general dynam- 
ical systems. It is up to you - unimodal maps suffice to get quickly to the heart of 
this treatise. 



11.3 Temporal ordering: Itineraries 

In this section we learn to name topologically distinct trajectories for the simple, 
but instructive case; 1 -dimensional maps of an interval. 

The simplest mapping of this type is unimodal; interval is stretched and folded 
only once, with at most two points mapping into a point in the refolded inter- 
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val, as in the Rtissler return map figure 11.10(b). A unimodal map f(x) is a 1- 
dimensional function R — > R denned on an interval At e R with a monotonically 
increasing (or decreasing) branch, a critical point (or interval) ^ for which f(x c ) 
attains the maximum (minimum) value, followed by a monotonically decreasing 
(increasing) branch. f/m'-modal means that the map is a 1 -humped map with one 
critical point within interval At. MwM-modal maps, with several critical points 
within interval At, can be described with a straight-forward generalization of the 
methods we describe next. 



Example 11.5 Unimodal maps: (continued from example 3.8) The simplest exam- 
ples of unimodal maps are the quadratic map 

f(x)=Ax{l-x), x e M = [0,1] (11.3) 

and numerically computed return maps such as figure 11.10(b). Such dynamical 
systems are irreversible (the inverse of f is double-valued), but, as we shall show 
in sect. 12.2, they may nevertheless serve as effective descriptions of invertible 2- 
dimensional hyperbolic flows. For the unimodal map such as figure 11.12 a Markov 
partition of the unit interval M is given by the two intervals {Mo, Mi}. (continued in 
example 11.6) 



Example 11.6 Full tent map, Ulam map: (continued from example 11.5) The 
simplest examples of unimodal maps with complete binary symbolic dynamics are the 
full tent map, figure 11.11, 

/(y) = l-2| r -l/2|, yeM = [0,l], (11.4) 

the Ulam map (quadratic map (1 1 .3) with A = 4) 

©xsrcis© d . 4 

f(x) = 4x(l-x), xeAt = [0,l], (11.5) 

and the repelling unimodal maps such as figure 11.12. For unimodal maps the Markov 
partition of the unit interval M is given by intervals {Mo, Mi}. We refer to (11.4) as the 
complete tent map because its symbolic dynamics is completely binary: as both f(Mo) 
and f{Mi ) fully cover M = {Mo, Mi}, all binary sequences are realized as admissible 
itineraries. 
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Figure 11.12: A unimodal repeller with the survivor 
intervals after 1 and 2 iterations. Intervals marked 
s i s 2 • • • s n consist of points that do not escape in n iter- 
ations, and follow the itinerary S + = Si$2--- s n . Note 
that the spatial ordering does not respect the binary or- 
dering; for example x 00 < x ol < x n < x 10 . Also indi- 
cated are the fixed points 0, 1, the 2-cycle 01, and the 
3-cycle OIL 
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For Id maps the critical value denotes either the maximum or the minimum 
value of f(x) on the defining interval; we assume here that it is a maximum, 
f(x c ) > f(x) for all x € At. The critical point x c that yields the critical value f(Xc) 
belongs to neither the left nor the right partition A\ and is instead denoted by its 
own symbol s = C. As we shall see, its images and preimages serve as partition 
boundary points. 

The trajectory xi,X2,X3, . . . of the initial point xq is given by the iteration 
x n +\ = f{x n ) . Iterating / and checking whether the point lands to the left or to the 
right of x c generates a temporally ordered topological itinerary (11.17) for a given 
trajectory, 



We refer to S + {xq) = .S1S2S3 • • • as the future itinerary. Our next task is to answer 
the reverse problem: given an itinerary, what is the spatial ordering of points that 
belong to the corresponding state space trajectory? 



Suppose you have succeeded in constructing a covering symbolic dynamics, such 
as the one we constructed for a well-separated 3-disk system. Now start moving 
the disks toward each other. At some critical separation (see figure 11.6) a disk 
will start blocking families of trajectories traversing the other two disks. The 
order in which trajectories disappear is determined by their relative ordering in 




(11.6) 



11.4 Spatial ordering 



A well-known theorem states that combinatorial factors 
are impossible to explain. [22] 

— G. 't Hooft and M. Veltman, DIAGRAMMAR 
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Figure 11.13: The n = 2, 4-intervals state space par- 
tition for the Bernoulli shift map (1 1.7), together with 
the fixed points 0, 1 and the 2-cycle 01. 




00 01 10 11 



space; the ones closest to the intervening disk will be pruned first. Determining 
inadmissible itineraries requires that we relate the spatial ordering of trajectories 
to their time ordered itineraries. 

The easiest point of departure is to start by working out this relation for 
the symbolic dynamics of 1 -dimensional mappings. As it appears impossible 
to present this material without getting bogged down in a sea of 0's, l's and 
subscripted subscripts, we announce the main result before embarking upon its 
derivation: 

The admissibility criterion (sect. 11.5) eliminates all itineraries that cannot 
occur for a given unimodal map. 



exercise 12.7 



section 11.5 



Example 11.7 Bernoulli shift map state space partition. First, an easy example: 
the Bernoulli shift map, figure 11.13, 

b(y) _f b 0 (y) = 2y, ye M 0 -[0,1/2) 

D{7> -\ b 1 (y) = 2y-l, y e Mi = (1/2,1] ' { ' 

models the 50-50% probability of a coin toss. It maps the unit interval onto itself, with 
fixed points yo = 0, y\ = 1. The closely related doubling map acts on the circle 

x i — > 2x (modi), xe [0,1] (11.8) 

and consequently has only one fixed point, x 0 = 0 = 1 (mod 1). The Bernoulli map 
is called a 'shift' map, as a multiplication by 2 acts on the binary representation of 
y = .*i*2*3 ■■■ by shifting its digits, b(y) = .s 2 S3 . . .. The nth preimages b~ n (y) of the 
critical point y c = l/2 partition the state space into 2" subintervals, each labeled by the 
first n binary digits of points y = . $1*2*3 • • ■ within the subinterval: figure 11.13 illustrates 
such 4-intervals state space partition {Mm, Mo\,Mn,M\o) for n = 2. 

Consider a map f(x) topologically conjugate (two monotonically increasing 
branches) to the Bernoulli shift, with the forward orbit of x generating the itinerary 
*i*2*3 • ■ •■ Convert this itinerary into Bernoulli map point y = .$1*2*3 • ■ •■ These values 
can now be used to spatially order points with different temporal itineraries: ify < y' , 
then x < x? . 

Suppose we have already computed all (« - l)-cyc/es of f(x), and would now 
like to compute the cycle p = *i*2*3 . . . *« of period n. Marky values on the unit interval 
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Figure 11.14: An alternating binary tree relates the 
itinerary labeling of the unimodal map intervals, fig- 
ure 11.12, to their spatial ordering. The dotted line 
stands for 0, the full line for 1; the binary sub-tree 
whose root is a full line with symbol 1 reverses the 
orientation, due to the orientation-reversing fold in fig- 
ures 11.10 and 11.12. See also figure 14.4. 



for all known periodic points of the Bernoulli shift map, and then insert in between them 
jo-kp, k = 0, 1, • • -,n p - 1 corresponding to periodic points of cycle p. In the dynamical 
state space they will be bracketed by corresponding cycle points xj from cycles al- 
ready computed, and thus the knowledge of the topological ordering of all cycle points 
provides us with robust initial guesses for periodic-orbit searches for any map with 2 
monotonically increasing branches, (continued in example 23.5) 



For the Bernoulli shift converting itineraries into a topological ordering is 
easy; the binary expansion of coordinate y is also its temporary itinerary. The tent 
map (11.4), figure 1 1 . 1 1 is a bit harder. It consists of two straight segments joined 
at x - 1/2. The symbol s n defined in (11.6) equals 0 if the function increases, 
and 1 if it decreases. Iteration forward in time generates the time itinerary. More 
importantly, the piecewise linearity of the map makes the converse possible: de- 
termine analytically an initial point given its itinerary, a property that we now use 
to define a topological coordinatization common to all unimodal maps. 

Here we have to face the fundamental problem of pedagogy: combinatorics 
cannot be taught. The best one can do is to state the answer and hope that you 
will figure it out by yourself. Then you can also complain that the way the rule is 
stated here is incomprehensible. 

The tent map point y(S + ) with future itinerary 5 + is given by converting the 
itinerary of s n 's into a binary number y by the following algorithm: 



W ^ = i l-w„ if ,„ +1 = l ■ Wl = *' 



w„ if s n = 0 
1 - w n if s n+1 = 

OO 

y(5 + ) = O.W1W2W3... = X w n /2 n . (11.9) 



n=\ 



This follows by inspection from the binary tree of figure 11.14. Once you figure 
this out, feel free to complain that the way the rule is stated here is incomprehen- 
sible, and show us how you did it better. 



Example 11.8 Converting y to S + : y whose itinerary is S + = 01 10000 ■■■is given 
by the binary number y = .010000- ■ -. Conversely, the itinerary of y - .01 is si = 0, 
f( y ) = .1 -> s 2 = 1, f 2 (y) = f(.l) = 1 -> s 3 = I, etc.. Orbit that starts out as a finite 
block followed by infinite repeats of another block p = S p = (s\S2S3 . . . s n )°° is said to 
be heteroclinic to the cycle p. An orbit that starts out as p°° followed by a finite block 
followed by infinite repeats of another block p' is said to be heteroclinic connection from 
cycle p to cycle p'. 
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We refer to y(S + ) as the (future) topological coordinate. The w/s are the 
digits in the binary expansion of the starting point y for the full tent map (11.4). 
In the left half-interval the map f(x) acts by multiplication by 2, while in the right 
half-interval the map acts as a flip as well as multiplication by 2, reversing the 
ordering, and generating in the process the sequence of &'s from the binary digits 
w„. 

The mapping xq — > S + (xq) — > yo = y(S + ) is a topological conjugacy that 
maps the trajectory of an initial point xq under the iteration of a given unimodal 
map to that initial point yo for which the trajectory of the 'canonical' unimodal 
map, the full tent map (1 1.4), has the same itinerary. The virtue of this conjugacy 
is that y(S + ) preserves the ordering for any unimodal map in the sense that if 
x > x, then y > y. 



Example 11.9 Periodic orbits of unimodal maps. Let 



m = 



Mx) if 
/iW if 



x > x r 



(11.10) 



and assume that all periodic orbits are unstable, i.e., the stability A p = f^' (see (4.42)) 
satisfies \A P \ > 1. Then the periodic point x SlS2S3 ,„ Sn is the only fixed point of the unique 
composition (3.15) ofn maps 

fs„ 0 ■ ■ ■ 0 fs 2 ° / Sl (* SlS j S3 ... S „) = -V v . (11-11) 

(note that successive maps, applied from the left, correspond to later times, i.e., later 
symbols in the itinerary). 

The nth iterate of a unimodal map has at most 2" monotone segments, and 
therefore there will be 2" or fewer periodic points of length n. For the full tent map 
(11.4) it has exactly 2" periodic points. A periodic orbit p of length n corresponds to an^ 
infinite repetition of a length n =n p symbol string block, customarily indicated by a fine 
over the string: p = S = (s1.s2.s3 . . . i„)°° = S1S2S3 • ■ • ■ As all itineraries are infinite, 
we shall adopt convention that a finite string itinerary p = $1*2*3 ■ • ■ s„ stands for infinite 
repetition of a finite block, and routinely omit the overline. A cycle p is called prime if its 
itinerary S cannot be written as a repetition of a shorter block S'. If the itinerary of xq is 
P = S1S2S3 ...s n , its cyclic permutation cr k p = suSk+\ ■ ■ ■ s n si . . . su-\ corresponds to the 
point Xk-\ in the same cycle. 



Example 11.10 Periodic points of the full tent map. Each cycle p is a set of n p 

rational-valued full tent map periodic points y. It follows from (11.9) that if the repeating 
string sis 2 . . . s„ contains an odd number of '1's, the string of well ordered symbols 
w\W2 ■ ■ ■ w 2 „ has to be of the double length before it repeats itself. The cycle-point y is 
a geometrical sum which we can rewrite as the odd-denominator fraction 

( V w, l ri w, 

r(^2---s n ) = l U T + 2F^^2J + "' 

t=\ t=l 

= ^_y^ (U12) 

Using this we can calculate the y p = y(S p ) for all short cycles. For orbits up to length 5 
this is done in table 11.1. 
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s 


y(S) 


S 


y(S) 


o 


.0 


0 


10111 


.11010 


26/31 


I 


.To 


2/3 


10110 


.1101100100 


= 28/33 


To 


.1100 


4/5 


10010 


.11100 


= 28/31 


ToT 


.TTo 


6/7 


10011 


.1110100010 


10/11 


Too 


.111000 


8/9 


10001 


.11110 


= 30/31 


1011 


.11010010 = 


14/17 


10000 


.1111100000 


= 32/33 


1001 


.1110 


14/15 








1000 


.11110000 = 


16/17 









Table 11.1: The maximal values of unimodal map cycles up to length 5. (K.T. Hansen) 

Critical points are special - they define the boundary between intervals, i.e., 
interval is split into 0 [left part], Xc [critical point] and 1 [right part]. For the dike 
map and the repeller figure 11.12 jc c is the whole interval of points along the flat 
top of the map, but usually it is a point. As illustrated by figures 11.11 and 11.13, 
for a unimodal map the preimages f~"(x c ) of the critical point x c serve as partition 
boundary points. But not all preimages-one has to ensure that they are within the 
set of all admissible orbits by checking them against the kneading sequence of the 
map. 



11.5 Kneading theory 

Tired of being harassed by your professors? Finish, get a 
job, do combinatorics your own way, while you still know 
everything. 

— Professor Gatto Nero 

(K.T. Hansen and P. Cvitanovic) 

The reason we need to be mindful of spatial ordering of temporal itineraries is 
that this spatial ordering provides us with criteria that separate inadmissible orbits 
from those realizable by the dynamics. For 1 -dimensional mappings the kneading 
theory provides a precise and definitive criterion of admissibility. 

If the parameter in the quadratic map (1 1.3) is A > 4, then the iterates of the 
critical point x c diverge for n —> oo, and any sequence S + composed of letters St = 
{0, 1} is admissible, and any value of 0 < y < 1 corresponds to an admissible orbit 
in the non-wandering set of the map. The corresponding repeller is a complete 
binary labeled Cantor set, the n —> oo limit of the nth level covering intervals 
sketched in figure 11.12. 

For A < 4 only a subset of the points in the interval y € [0, 1] corresponds 
to admissible orbits. The forbidden symbolic values are determined by observing 
that the largest x n value in an orbit x\ —> *2 — * *3 — * ■ • ■ has to be smaller than or 
equal to the image of the critical point, the critical value /(jfc). Let K = S + (x c ) 
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Figure 11.15: The 'dike' map obtained by slicing of 
the top portion of the tent map in figure 11.11. Any or- 
bit that visits the primary pruning interval (k, 1] is inad- 
missible. The admissible orbits form the Cantor set ob- 
tained by removing from the unit interval the primary 
pruning interval and all its iterates. Any admissible or- 
bit has the same topological coordinate and itinerary as 
the corresponding tent map figure 11.11 orbit. 




pruned 



be the itinerary of the critical point denoted the kneading sequence of the map. 
The corresponding topological coordinate is called the kneading value 

K = y{K) = y{S\x c )). (11.13) 

The 'canonical' map that has the same kneading sequence K (11.13) as f(x) 
is the dike map, figure 11.15, 

{ /o(r) = 2y yeMo = [(U/2) 

f(7) = My) = k yeM c = [k/2, 1 - k/2] , (11.14) 

{ My) = 2(1 -y) ye Mi = (l-*/2,l] 

obtained by slicing off all y(S + (xo)) > k. The dike map is the full tent map 
figure 11.11 with the top sliced off. It is convenient for coding the symbolic dy- 
namics, as those y values that survive the pruning are the same as for the full tent 
map figure 11.11, and are easily converted into admissible itineraries by (11.9). 

If y(S + ) > y(K), the point x whose itinerary is S + would exceed the critical 
value, x > f(x c ), and hence cannot be an admissible orbit. Let 

y(S + ) = supy(cr"'(S + )) (11.15) 

m 

be the maximal value, the highest topological coordinate reached by the orbit 
x\ — > X2 — > xj, — > . . ., where cr is the shift (1 1.20), o~(- • • 5 , _2^-i^o-^i^'2^3 • • •) = 
• • • s-2S-iSoS\.S2S3 ■ ■ ■ . We shall call the interval (k, 1] the primary pruned inter- 
val. The orbit 5 + is inadmissible if y of any shifted sequence of S + falls into this 
interval. 

Criterion of admissibility: Let k be the kneading value of the critical point, 
and y(S + ) be the maximal value of the orbit S + . Then the orbit S + is admissible 
if and only ify(S + ) < k. 

While a unimodal map may depend on many arbitrarily chosen parameters, its 
dynamics determines the unique kneading value k. We shall call k the topological 
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Figure 11.16: (a) Web diagram generated by 
the trajectory of the critical point the unimodal 
Rossler return map of figure 1 1.7 (b). (b) The web 
diagram for the corresponding 'canonical' dike 
map (11.14) with the same kneading sequence. 
(A. Basu and J. Newman) 



(a) 




parameter of the map. Unlike the parameters of the original dynamical system, 
the topological parameter has no reason to be either smooth or continuous. The 
jumps in k as a function of the map parameter such as A in (11.3) correspond to 
inadmissible values of the topological parameter. Each jump in k corresponds to 
a stability window associated with a stable cycle of a smooth unimodal map. For 
the quadratic map (1 1.3) k increases monotonically with the parameter A, but for 
a general unimodal map such monotonicity need not hold. 



Examplell.il Rossler return map web diagram: (continuation of example 11 .2) The 
arclength distance along the unstable manifold of the x- equilibrium point return map, 
figure 11.7(b), generates the kneading sequence (11.13) as the itinerary of the critical 
point plotted in figure 11.16(a). 



For further details of unimodal dynamics, the reader is referred to appendixE. 1 . 
As we shall see in sect. 12.4, for higher dimensional maps and flows there is no 
single parameter that orders dynamics monotonically; as a matter of fact, there 
is an infinity of parameters that need adjustment for a given symbolic dynamics. 
This difficult subject is beyond our current ambition horizon. 



fast track: 
chapter 12, p. 242 



11.6 Symbolic dynamics, basic notions 



(Mathematics) is considered a specialized dialect of the 
natural language and its functioning as a special case of 
speech. 

— Yuri I. Manin [1] 

In this section we collect the basic notions and definitions of symbolic dynamics. 
The reader might prefer to skim through this material on a first reading and return 
to it later, as the need arises. 
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Shifts. We associate with every initial point xq e M the future itinerary, a se- 
quence of symbols S + (xq) = s\S2Sj ■ ■ ■ which indicates the order in which the 
regions are visited. If the trajectory xi, X2, X3, . . . of the initial point xq is gener- 
ated by 

Xn+l = f(xn), (11.16) 

then the itinerary is given by the symbol sequence 

s„ = s if x„eM s . (11.17) 

Similarly, the past itinerary S'(xq) = ■ ■ ■ S-2S-1SQ describes the history of xq, the 
order in which the regions were visited before arriving to the point xj. To each 
point xq in the dynamical space we thus associate a bi-infmite itinerary 

S(xq) = (s k )keZ = S'.S + = ■■■ S-2S-1S0.S1S2S3 ■■■ . (11.18) 

The itinerary will be finite for a scattering trajectory, entering and then escaping 
At after a finite time, infinite for a trapped trajectory, and infinitely repeating for 
a periodic trajectory. 

The set of all bi-infmite itineraries that can be formed from the letters of the 
alphabet ft is called the, full shift (or topological Markov chain) 

ft z = {(s k ) ke z ■ s k e ft for all keZ}. (1U9) 

The jargon is not thrilling, but this is how professional dynamicists talk to each 
other. We will stick to plain English to the extent possible. 

We refer to this set of all conceivable itineraries as the covering symbolic dy- 
namics. The name shift is descriptive of the way the dynamics acts on these se- 
quences. As is clear from the definition (1 1. 17), a forward iteration x — > x 1 = f(x) 
shifts the entire itinerary to the left through the 'decimal point.' This operation, 
denoted by the shift operator o~, 

<rG • • S-2S-\Sq.S\S2S2, ■■■) = ■■■ S-2S-\SQS\.S2Sl ■■■ , (11.20) 

demoting the current partition label s\ from the future S + to the 'has been' itinerary 
5". The inverse shift cT l shifts the entire itinerary one step to the right. 

A finite sequence b - s^Sk+i • • • Sk+n b -\ °f symbols from ft is called a block 
of length «/>. If the symbols outside of the block remain unspecified, we denote 
the totality of orbits that share this block by _ ^t^+i - • - Sk+ nb -i- 
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A state space orbit is periodic if it returns to its initial point after a finite time; 
in shift space the orbit is periodic if its itinerary is an infinitely repeating block 
p°°. We shall refer to the set of periodic points M p that belong to a given periodic 
orbit as a cycle 

P — s\S2 - ■ ■ s„ p - {x SlS2 ... Snp , x S2 ... S/ipSl , ■ ■ ■ , x SnpSl ... Siip l } . (11.21) 

By its definition, a cycle is invariant under cyclic permutations of the symbols 
in the repeating block. A bar over a finite block of symbols denotes a periodic 
itinerary with infinitely repeating basic block; we shall omit the bar whenever it 
is clear from the context that the orbit is periodic. Each periodic point is labeled 
by the first n p steps of its future itinerary. For example, the 2nd periodic point is 
labeled by 

X S2-S np S\ ~ x S2-s„ p s v s 2 -s np s l ■ 

This - a bit strained - notation is meant to indicate that the symbol block repeats 
both in the past and in the future. It is helpful for determining spatial ordering of 
cycles of 2D-hyperbolic maps, to be undertaken in sect. 12.3.1. 

A prime cycle p of length n p is a single traversal of the orbit; its label is 
a block of rip symbols that cannot be written as a repeat of a shorter block (in 
the literature, such cycles are sometimes called primitive; we shall refer to it as 
'prime' throughout this text). 

Partitions. A partition is called generating if every infinite symbol sequence 
corresponds to a distinct point in state space. The finite Markov partition (1 1.2) 
is an example. Constructing a generating partition for a given system is a difficult 
problem. In the examples to follow, we shall concentrate on cases which that 
permit finite partitions, but in practice almost any generating partition of interest 
is infinite. 

A partition too coarse, coarser than, for example, a Markov partition, would 
assign the same symbol sequence to distinct dynamical trajectories. To avoid that, 
we often find it convenient to work with partitions finer than strictly necessary. 
Ideally the dynamics in the refined partition assigns a unique infinite itinerary 
• • • s-2S-\ sq.s\S2S-$ • • • to each distinct orbit, but there might exist full shift sym- 
bol sequences (1 1.19) which are not realized as orbits; such sequences are called 
inadmissible, and we say that the symbolic dynamics is pruned. The word is 
suggested by the 'pruning' of branches corresponding to forbidden sequences for 
symbolic dynamics organized hierarchically into a tree structure, as explained in 
chapter 14. 

A mapping / : M — » Ai together with a partition 3\ induces topological 
dynamics (E, o~), where the subshift 

£ = {ta)*ez}, (11-22) 
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is the set of all admissible (i.e., 'pruned') infinite itineraries, and cr : S — > X 
is the shift operator (11.20). The designation 'subshift' comes form the fact that 
S c Jl z is the subset of the full shift (11.19). One of our principal tasks in 
developing the symbolic dynamics of dynamical systems that occur in nature will 
be to determine S, the set of all bi-infinite itineraries S that are actually realized 
by the given dynamical system. 

Pruning. If the dynamics is pruned, the alphabet must be supplemented by 
a grammar, a set of pruning rules. After the inadmissible sequences have been 
pruned, it is often convenient to parse the symbolic strings into words of variable 
length - this is called coding. Suppose that the grammar can be stated as a finite 
number of pruning rules, each forbidding a block of finite length, 

§ = {bub 2 ,---b k } , (11.23) 

where a pruning block b is a sequence of symbols b - si S2 • • • s nb , s e Jl, of finite 
length i%. In this case we can always construct a finite Markov partition (1 1.2) by 
replacing finite length words of the original partition by letters of a new alphabet. 
In particular, if the longest forbidden block is of length M + 1 , we say that the 
symbolic dynamics is a shift of finite type with M-step memory. In that case we 
can recode the symbolic dynamics in terms of a new alphabet, with each new 
letter given by an admissible block of at most length M. 

A topological dynamical system (£, cr) for which all admissible itineraries are 
generated by a finite transition matrix (14.1) 

2 - {(Skhez : T SkSt+l = 1 for all k) (11.24) 

is called a subshift of finite type. 

in depth: 

chapter 12, p. 242 

Resume 

From our initial chapters 2 to 4 fixation on things local: a representative point, a 
short-time trajectory, a neighborhood, in this chapter we have made a courageous 
leap and gone global. 

The main lesson is that - if one intends to go thoughtfully about globalization - 
one should trust the dynamics itself, and let it partition the state space, by means of 
its (topologically invariant) unstable manifolds. This works if every equilibrium 
and periodic orbit is unstable, so one exits it local neighborhood via its unstable 
manifold. We delineate the segment of the unstable manifold between the fixed 
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point and the point where the nonlinearity of the dynamics folds back on itself 
as the primary segment, and measure location of nearby state space points by 
arclengths measured along this (curvilinear) segment. For 1 -dimensional maps 
the folding point is the critical point, and easy to determine. In higher dimensions, 
the situation is not so clear - we shall discuss that in chapter 12. 

Trajectories exit a neighborhood of an equilibrium or periodic point along un- 
stable directions, and fall along stable manifolds towards other fixed points, until 
they again are repelled along their unstable manifolds. Such sequences of visi- 
tations can be described by symbolic dynamics. As we shall show in chapter 14, 
they are encoded by transition matrices / transition graphs, and approximated dy- 
namically by sequences of unstable manifold — > unstable manifold maps, or, in 
case of a return to the initial neighborhood, by return maps s — > f(s). 

As 'kneading theory' of sect. 1 1.5 illustrates, not all conceivable symbol seq- 
uences are actually realized (admissible). The identification of all inadmissible or 
pruned sequences is in general not possible. However, the theory to be developed 
here relies on exhaustive enumeration of all admissible itineraries up to a given 
topological length; chapters 12 and 15 describe several strategies for accomplish- 
ing this for physically realistic goals. 



Commentary 

Remark 11.1 Symbolic dynamics. For a brief history of symbolic dynamics, from 
Hadamard in 1898, Morse and Hedlund in 1938 and onward, see notes to chapter 1 of 
Kitchens monograph [2], a very clear and enjoyable mathematical introduction to topics 
discussed here. Diacu and Holmes [ ] provide an excellent survey of symbolic dynamics 
applied to celestial mechanics. For a compact survey of symbolic dynamics techniques, 
consult sects. 3.2 and 8.3 of Robinson [4]. The binary labeling of the once-folding map 
periodic points was introduced by Myrberg [5] for 1 -dimensional maps, and its utility 
to 2-dimensional maps has been emphasized in refs. [6, 7]. For 1 -dimensional maps it 
is now customary to use the R-L notation of Metropolis, Stein and Stein [8, 9], indi- 
cating that the point x„ lies either to the left or to the right of the critical point in fig- 
ure 11.12. The symbolic dynamics of such mappings has been extensively studied by 
means of the Smale horseshoes, see for example ref. [10]. Using letters rather than nu- 
merals in symbol dynamics alphabets probably reflects good taste. We prefer numerals 
for their computational convenience, as they speed up conversions of itineraries into the 
topological coordinates {5, y) introduced in sect. 12.3.1. The alternating binary ordering 
of figure 11.14 is related to the Gray codes of computer science [11]. Kitchens [2] con- 
vention is • • • s^2S-i-saS\S2S-i ■ ■ •, with '.' placed differently from our convention ( 11.18). 



Remark 11.2 Kneading theory. The admissible itineraries are studied, for example, 
in refs. [12, 8, 10, 13]. We follow here the Milnor- Thurston exposition [14]. They study 
the topological zeta function for piecewise monotone maps of the interval, and show that 
for the finite subshift case it can be expressed in terms of a finite dimensional kneading 
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determinant. As the kneading determinant is essentially the topological zeta function 
of sect. 15.4, we do not discuss it here. Baladi and Ruelle have reworked this theory 
in a series of papers [15, 16, 17, 18]. See also P. Dahlqvist's appendix E.l. Knight 
and Klages [ 1 9] in their study of deterministic diffusion (for deterministic diffusion, see 
chapter 25) refer to the set of iterates of the critical point as 'generating orbit.' They 
say: "The structure of the Markov partitions varies wildly under parameter variation. 
The method we employ to understand the Markov partitions involves iterating the critical 
point. The set of iterates of this point form a set of Markov partition points for the map. 
Hence we call the orbit of the critical point a 'generating orbit.' If the generating orbit 
is finite for a particular value of parameters, we obtain a finite Markov partition. We can 
then use the finite Markov partition to tell us about the diffusive properties of the map and 
hence the structure of the diffusion coefficient." 

Remark 11.3 Heteroclinic connections. For sketches of heteroclinic connections in 
the nonlinear setting, see Abraham and Shaw illustrated classic [2D]. Section 5 of ref. [21] 
makes elegant use of stable manifold co-dimension counts and of invariant subspaces im- 
plied by discrete symmetries of the underlying PDE to deduce the existence of a hetero- 
clinic connection. 
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Exercises 

11.1. Binary symbolic dynamics. Verify that the short- 
est prime binary cycles of the unimodal repeller of fig- 
ure 11.12 are 0, T, 57, 00l, Oil, ■ ■ ■. Compare with ta- 
ble 15.1. Sketch them in the graph of the unimodal func- 
tion f(x); compare the ordering of the periodic points 
with that in figure 11.14. The point is that while over- 
layed on each other the longer cycles look like a hope- 
less jumble, the periodic points are clearly and logically 
ordered by the alternating binary tree. 

11.2. Generating prime cycles. Write a program that gen- 
erates all binary prime cycles up to a given finite length. 

11.3. A contracting baker's map. Consider the contracting 
(or "dissipative") baker's map defined in exercise 4.6. 
The symbolic dynamics encoding of trajectories is real- 
ized via symbols 0 (y < 1 /2) and 1 (y > 1 /2). Consider 
the observable a(x, y) - x. Verify that for any periodic 
orbit/? (ei . . .e„ p ), e, e {0, 1} 

3 "" 

a p = iZ^ 1 - 

1 1 .4. Unimodal map symbolic dynamics. Show that the 
tent map point y{S + ) with future itinerary S + is given 
by converting the sequence of s„'s into a binary number 
by the algorithm (11.9). This follows by inspection from 
the binary tree of figure 1 1.14. 

11.5. Unimodal map kneading value. Consider the 1- 
dimensional quadratic map 

f(x) = Ax{\ -x), A = 3.8. (11.25) 

(a) (easy) Plot (11.25), and the first 4-8 (whatever 
looks better) iterates of the critical point x c — \ /2. 

(b) (hard) Draw corresponding intervals of the parti- 
tion of the unit interval as levels of a Cantor set, as 
in the symbolic dynamics partition of figure 11.12. 
Note, however, that some of the intervals of fig- 
ure 11.12 do not appear in this case - they are 
pruned. 

(c) (easy) Check numerically that K — S + (x c ), knead- 
ing sequence (the itinerary of the critical point 
(11.13)) is 

K= 1011011110110111101011110111110. 

As the orbits of a chaotic map are exponentially 
unstable, so many digits seem too good to be true 
- recheck this sequence using arbitrary precision 
arithmetics. 



(d) (medium) The tent map point y(S + ) with future 
itinerary S + is given by converting the sequence of 
s„'s into a binary number by the algorithm (11.9). 
List the corresponding kneading value ( 1 1.13) se- 
quence k = y{K) to the same number of digits as 
K. 

(e) (hard) Plot the dike map, figure 11.15, with the 
same kneading sequence K as f(x). The dike map 
is obtained by slicing off all y (S + (xq)) > k, from 
the full tent map figure 11.11, see (11.14). 

How this kneading sequence is converted into a series of 
pruning rules is a dark art, relegated to sect. 15.5. 

11.6. "Golden mean" pruned map. Consider a symmetric 
tent map on the unit interval such that its highest point 
belongs to a 3-cycle: 



1 




0 0.2 0.4 0.6 0.8 1 



(a) Find the value |A| for the slope (the two different 
slopes +A just differ by a sign) where the maxi- 
mum at 1/2 is part of a 3-cycle, as depicted in the 
figure. 

(b) Show that no orbit of this map can visit the region 
x > (1 + V5)/4 more than once. Verify also that 
once an orbit exceeds x > ( V5 - l)/4, it does not 
reenter the region x < ( V5 - l)/4. 

(c) If an orbit is in the interval ( VI - l)/4 < x < 1/2, 
where will it be on the next iteration? 

(d) If the symbolic dynamics is such that for x < 1/2 
we use the symbol 0 and for x > 1/2 we use the 
symbol 1 , show that no periodic orbit will have the 
substring _00_ in it. 

(e) On a second thought, is there a periodic orbit that 
violates the above _00_ pruning rule? 
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To continue with this line of thinking, see exercise 15.7 
and exercise 19.2. See also exercise 15.6 and exer- 
cise 15.8. 



just making sure you know how to go back and forth be- 
tween spatial and temporal ordering of trajectory points. 



11.7. Binary 3-step transition matrix. Construct an [8x8] 
binary 3-step transition matrix analogous to the 2-step 
transition matrix (14.10). Convince yourself that the 
number of terms of contributing to tr T " is independent 
of the memory length, and that this [2 m x2 m ] trace is well 
defined in the infinite memory limit m — » oo. 



(a) derive (11.12) 



(b) compute the five periodic points of cycle 1001 1 



(c) compute the five periodic points of cycle 10000 

(d) (optional) plot the above two cycles on the graph 



of the full tent map. 



1 1.8. Full tent map periodic points. This exercise is easy: 



(continued in exercise 13.15) 
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Chapter 12 



Stretch, fold, prune 



1.1. Introduction to conjugacy problems for diffeomor- 
phisms. This is a survey article on the area of global anal- 
ysis defined by differentiable dynamical systems or equiv- 
alently the action (differentiable) of a Lie group G on a 
manifold M. Here Diff(M) is the group of all diffeomor- 
phisms of M and a diffeomorphism is a differentiable map 
with a differentiable inverse. (. . .) Our problem is to study 
the global structure, i.e., all of the orbits of M. 

— Stephen Smale, Differentiable Dynamical Systems 



We have learned that the Rossler attractor is very thin, but otherwise the re- 
turn maps that we found were disquieting - figure 3.3 did not appear to 
be a one-to-one map. This apparent loss of invertibility is an artifact of 
projection of higher-dimensional return maps onto their lower-dimensional sub- 
spaces. As the choice of a lower-dimensional subspace is arbitrary, the resulting 
snapshots of return maps look rather arbitrary, too. Such observations beg a ques- 
tion: Does there exist a natural, intrinsic coordinate system in which we should 
plot a return map? 

We shall argue in sect. 12.1 that the answer is yes: The intrinsic coordinates 
are given by the stable/unstable manifolds, and a return map should be plotted as 
a map from the unstable manifold back onto the immediate neighborhood of the 
unstable manifold. In chapter 5 we established that Floquet multipliers of periodic 
orbits are (local) dynamical invariants. Here we shall show that every equilibrium 
point and every periodic orbit carries with it stable and unstable manifolds which 
provide topologically invariant global foliation of the state space. They will en- 
able us to partition the state space in a dynamically invariant way, and assign 
symbolic dynamics itineraries to trajectories. 

The topology of stretching and folding fixes the relative spatial ordering of tra- 
jectories, and separates the admissible and inadmissible itineraries. We illustrate 
how this works on Henon map example 12.3. Determining which symbol se- 
quences are absent, or 'pruned' is a formidable problem when viewed in the state 
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space, [x\,X2, ...,x c i\ coordinates. It is equivalent to the problem of determining 
the location of all homoclinic tangencies, or all turning points of the Henon attrac- 
tor. They are dense on the attractor, and show no self-similar structure in the state 
space coordinates. However, in the 'danish pastry' representation of sect. 12.3 
(and the 'pruned danish,' in American vernacular, of sect. 12.4), the pruning prob- 
lem is visualized as crisply as the New York subway map; any itinerary which 
strays into the 'pruned region' is banned. 

The level is distinctly cyclist, in distinction to the pedestrian tempo of the 
preceding chapter. Skip most of this chapter unless you really need to get into 
nitty-gritty details of symbolic dynamics. 

fast track: 
chapter 13, p. 266 



12.1 Goin' global: stable/unstable manifolds 

The complexity of this figure will be striking, and I shall 
not even try to draw it. 

— H. Poincare, on his discovery of homoclinic tan- 
gles, Les methodes nouvelles de la mechanique celeste 

The Jacobian matrix / transports an infinitesimal neighborhood, its eigenvalues 
and eigen-directions describing deformation of an initial infinitesimal frame of 



x„ + OA- 
neighboring trajectories into a distorted frame time t later, as in figure4. 1 . 
Nearby trajectories separate exponentially along the unstable directions, approach 
each other along the stable directions, and creep along the marginal directions. 

The fixed point q Jacobian matrix J(x) eigenvectors (5.11) form a rectilinear 
coordinate frame in which the flow into, out of, or encircling the fixed point is 



linear in the sense of sect. 4.3. 




The continuations of the span of the local stable, unstable eigen-directions into 
global curvilinear invariant manifolds are called the stable, respectively unstable 
manifolds. They consist of all points which march into the fixed point forward, 
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respectively backward in time 

W s = [x € M : f\x) - x q -> 0 as f -» oo) 

= jxeAt:/" f (x)-^^Oasf^oo) . (12.1) 

Eigenvectors e w of the monodromy matrix J(x) play a special role - on them the 
action of the dynamics is the linear multiplication by A, (for a real eigenvector) 
along 1 -dimensional invariant curve Wf^ or spiral in/out action in a 2-D surface 
(for a complex pair). For t — > ±oo a finite segment on W? c y respectively W" e) 
converges to the linearized map eigenvector 4 C \ respectively e (e) , where (e) 
stand respectively for 'contracting,' 'expanding.' In this sense each eigenvector 
defines a (curvilinear) axis of the stable, respectively unstable manifold. 

Actual construction of these manifolds is the converse of their definition (12. 1): 
one starts with an arbitrarily small segment of a fixed point eigenvector and lets 
evolution stretch it into a finite segment of the associated manifold. As a periodic 
point x on cycle p is a fixed point of f Tp {x), the fixed point discussion that follows 
applies equally well to equilibria and periodic orbits. 

Expanding real and positive Floquet multiplier. Consider ith expanding eigen- 
value, eigenvector pair (A,-, e®) computed from J = J p (x) evaluated at a fixed 
point x, 

J(x)e (i \x) = A,e (i) (x) , x e M p , A, > 1 . (12.2) 

Take an infinitesimal eigenvector e w (x), ||e w (x)|| = s <§: 1, and its return A;e®(x) 
after one period T p . Sprinkle the straight interval between [e, A/s] c W?* with a 
large number of points x- ', for example equidistantly spaced on logarithmic scale 
between lne and In A, + lne. The successive returns of these points f p {x^), 
f 2Tp {x (k) ), ■ ■ ■, f" lT p{x {k) ) trace out the Id curve W]L within the unstable manifold. 
As separations between points tend to grow exponentially, every so often one 
needs to interpolate new stalling points between the rarified ones. Repeat for 
-e w (x). 

Contracting real and positive Floquet multiplier. Reverse the action of the 
map backwards in time. This turns a contracting direction into an expanding one, 
tracing out the curvilinear stable manifold as a continuation of e^. 

Expanding/contracting real negative Floquet multiplier. As above, but every 
even iterate f 2Tp (x^), f 4Tp (x^), / 6rp (x ( ^) continues in the direction e (,) , every 
odd one in the direction -e (l) . 

Complex Floquet multiplier pair, expanding/contracting. The complex Flo- 
quet multiplier pair {Ay, Ay+i = A*} has Floquet exponents (4.7) of form = 
ffi' ± iuft), with the sign of // k ^ + 0 determining whether the linear neighbor- 
hood is out / in spiralling. The orthogonal pair of real eigenvectors {Re ^ , Im e (7) } 
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Figure 12.1: A 2d unstable manifold obtained by 
continuation from the linearized neighborhood of a 
complex eigenvalue pair of an unstable equilibrium 
of plane Couette flow, a projection from a 61,506- 
dimensional state space ODE truncation of the (co- 
dimensional) Navier-Stokes PDE. (J.E Gibson, 8 
Nov. 2005 blog entry [61]) 



spans a plane. T = In/aft^ is the time of one turn of the spiral, / r Ree ( ^(X) = 
|A ; |Re t^\x) . As in the real cases above, sprinkle the straight interval between 
[e, \ Aj\e] along Ree^(jE) with a large number of points x^ k \ The flow will now 
trace out the 2d invariant manifold as an out / in spiralling strip. Two low- 
dimensional examples are the unstable manifolds of the Lorenz flow, figure 1 1.8 (a), 
and the Rossler flow, figure 11.10(a). For a highly non-trivial example, see fig- 
ure 12.1. 

The unstable manifolds of a flow are 4,-dimensional. Taken together with the 
marginally stable direction along the flow, they are rather hard to visualize. A 
more insightful visualization is offered by (d- l)-dimensional Poincare sections 
(3.2) with the marginal flow direction eliminated (see also sect. 3.1.2). Stable, 
unstable manifolds for maps are defined by 

W s = [xeP: P n {x) - x q -» 0 as n -» oo) 

W u = {xef>: P-"( X ) - x q ^ 0 as n -> oo) , (12.3) 

where P(x) is the (<i-l)-dimensional return map (3.1). In what follows, all invari- 
ant manifolds W", W s will be restricted to their Poincare sections W", W s . 

Example 12.1 A section at a fixed point with a complex Floquet multiplier pair: 

(continued from example 3. 1 ) The simplest choice of a Poincare section for a fixed (or 
periodic) point x q with a complex Floquet multiplier pair is the plane V specified by the 
fixed point (located at the tip of the vector x g ) and the eigenvector Im e (A) perpendicular 
to the plane. A point x is in the section V if it satisfies the condition 

(i-i,)-Ime (i) =0. (12.4) 

In the neighborhood of x q the spiral out/in motion is in the {Ree (t) ,Ime w } plane, and 
thus guaranteed to be cut by the Poincare section V normal to e (A) . 

In general the full state space eigenvectors do not lie in a Poincare section; the 
eigenvectors e (7) tangent to the section are given by (5.22). Furthermore, while in 
the linear neighborhood of fixed point x the trajectories return with approximate 
periodicity T p , this is not the case for the globally continued manifolds; t(x), or 
the first return times (3.1) differ, and the W"., restricted to the Poincare section is 
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obtained by continuing trajectories of the points from the full state space curve 
Wfa to the section V. 

For long times the unstable manifolds wander throughout the connected er- 
godic component, and are no more informative than an ergodic trajectory. For 
example, the line with equitemporal knots in figure 12.1 starts out on a smoothly 
curved neighborhood of the equilibrium, but after a 'turbulent' episode decays 
into an attractive equilibrium point. The trick is to stop continuing an invariant 
manifold while the going is still good. 



fast track: 
sect. 12.2, p. 247 

Learning where to stop is a bit of a technical exercise, the reader might prefer 
to skip next section on the first reading. 




12.1.1 Parametrization of invariant manifolds 



As the flow is nonlinear, there is no 'natural' linear basis to represent it. 
Wistful hopes like 'POD modes,' 'Karhunen-Loeve,' and other linear changes of 
bases do not cut it. The invariant manifolds are curved, and their coordinatizations 
are of necessity curvilinear, just as the maps of our globe are, but infinitely foliated 
and thus much harder to chart. 

Let us illustrate this by parameterizing a Id slice of an unstable manifold by its 
arclength. Sprinkle evenly points {x^ l \ x (2 \ ■■■ , x {N ~ 1 ^} between the equilibrium 
point x q = x^ and point x = x (N \ along the Id unstable manifold continuation 
x (k) e wu G f the unstable e (;) eigendirection (we shall omit the eigendirection 

label rj) in what follows). Then the arclength from equilibrium point x C{ = x^ to 
x = x (N ^ is given by 



s 2 = lim f gtJ djPtbP , dJP = *< fc) - x (k - l) . (12.5) 



k=l 



For the lack of a better idea (perhaps the dynamically determined g = f J would 
be a more natural metric?) let us measure arclength in the Euclidian metric, gj = 

djj, so 



s = lim 

N— >oo 



EM 

k=i 



\l/2 

2 



(12.6) 



By definition f (x) {x) € so f'(x) induces a Id map s(s 0 ,t) - s(f T( - x °\x 0 )). 
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Turning points are points on the unstable manifold for which the local un- 
stable manifold curvature diverges for forward iterates of the map, i.e., points at 
which the manifold folds back onto itself arbitrarily sharply. For our purposes, 
approximate turning points suffice. The Id curve W"^ starts out linear at x q , then 
gently curves until -under the influence of other unstable equilibria and/or peri- 
odic orbits- it folds back sharply at 'turning points' and then nearly retraces itself. 
This is likely to happen if there is only one unstable direction, as we saw in the 
Rtissler attractor example 11.3, but if there are several, the 'turning point' might 
get stretched out in the non-leading expanding directions. 

The trick is to figure out a good base segment to the nearest turning point 
L = [0,Sb], and after the foldback assign to s(x, t) > % the nearest point s on 
the base segment. If the stable manifold contraction is strong, the 2nd coordinate 
connecting s(x, t) —> s can be neglected. We saw in example 1 1.3 how this works. 
You might, by nature and temperament, take the dark view: Rossler has helpful 
properties, namely insanely strong contraction along a 1-dimensional stable direc- 
tion, that are not present in real problems, such as turbulence in a plane Couette 
flow, and thus the lessons of chapter 11 of no use when it comes to real plumb- 
ing. For this reason, both of the training examples to come, the billiards and the 
Henon map are of Hamiltonian, phase-space preserving type, and thus as far from 
being insanely contracting as possible. Yet, to a thoughtful reader, they unfold 
themselves as pages of a book. 

Assign to each cf-dimensional point x e Lq a. coordinate s = s(x) whose value 
is the Euclidean arclength (12.5) to x q measured along the 1-dimensional Pq sec- 
tion of the x q unstable manifold. Next, for a nearby point xq £ L q determine 
the point x.\ e L q which minimizes the Euclidean distance (xo - x,\) 2 , and as- 
sign arc length coordinate value sq = s(x\) to xq. In this way, an approximate 
1-dimensional intrinsic coordinate system is built along the unstable manifold. 
This parametrization is useful if the non-wandering set is sufficiently thin that its 
perpendicular extent can be neglected, with every point on the non-wandering set 
assigned the nearest point on the base segment Lq. 

Armed with this intrinsic curvilinear coordinate parametrization, we are now 
in a position to construct a 1-dimensional model of the dynamics on the non- 
wandering set. If x„ is the nth Poincare section of a trajectory in neighborhood of 
x q , and s n is the corresponding curvilinear coordinate, then = / T "(5 n ) models 
the full state space dynamics x n —> x n +\. We approximate f(s n ) by a smooth, 
continuous 1-dimensional map / : Lq — > L q by taking x. n e L q , and assigning to 
x n+ \ the nearest base segment point s n +\ - s(x n +\). 



12.2 Horseshoes 

If you find yourself mystified by Smale's article abstract quoted on page 247, 
about 'the action (differentiable) of a Lie group G on a manifold M,' time has 
come to bring Smale to everyman. If you still remain mystified by the end of 
this chapter, reading chapter 16 might help; for example, the Liouville operators 
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Figure 12.2: Binary labeling of trajectories of the 
symmetric 3-disk pinball; a bounce in which the tra- 
jectory returns to the preceding disk is labeled 0, and a 
bounce which results in continuation to the third disk 
is labeled 1 . 




form a Lie group of symplectic, or canonical transformations acting on the (p, q) 
manifold. 



If a flow is locally unstable but globally bounded, any open ball of initial 
points will be stretched out and then folded. An example is a 3-dimensional in- 
vertible flow sketched in figure 11.10 which returns a Poincare section of the flow 
folded into a 'horseshoe' (we shall belabor this in figure 12.4). We now offer two 
examples of locally unstable but globally bounded flows which return an initial 
area stretched and folded into a 'horseshoe,' such that the initial area is inter- 
sected at most twice. We shall refer to such mappings with at most 2' transverse 
self-intersections at the nth iteration as the once-folding maps. 

The first example is the 3-disk game of pinball figure 11.5, which, for suf- 
ficiently separated disks (see figure 11.6), is an example of a complete Smale 
horseshoe. We start by exploiting its symmetry to simplify it, and then partition 
its state space by its stable / unstable manifolds. 



Example 12.2 Recoding 3-disk dynamics in binary. (continued from exam- 
ple 11.2) The J{ = {1,2,3} symbolic dynamics for 3-disk system is neither unique, 
nor necessarily the smartest one - before proceeding it pays to quotient the symme- 
tries of the dynamics in order to obtain a more efficient description. We do this in a 
quick way here, and redo it in more detail in sect. 12.5. 

As the three disks are equidistantly spaced, the disk labels are arbitrary; what 
is important is how a trajectory evolves as it hits subsequent disks, not what label the 
starting disk had. We exploit this symmetry by recoding, in this case replacing the 
absolute disk labels by relative symbols, indicating the type of the collision. For the 3- 
disk game of pinball there are two topological^ distinct kinds of collisions, figure 12.2: 



0 : pinball returns to the disk it came from 

1 : pinball continues to the third disk . 



exercise 11.1 
exercise 12.6 
(12.7) 



In the binary recoding of the 3-disk symbolic dynamics the prohibition of self-bounces 
is automatic. If the disks are sufficiently far apart there are no further restrictions on 
symbols, the symbolic dynamics is complete, and all binary sequences (see table 15.1) 
are admissible. 

exercise 1 1 .2 

It is intuitively clear that as we go backward in time (reverse the velocity vec- 
tor), we also need increasingly precise specification of x 0 = {so, po) in order to follow a 
given past itinerary. Another way to look at the survivors after two bounces is to plot 
M Sl . S2 , the intersection of M. Sl with the strips M Sl . obtained by time reversal (the ve- 
locity changes sign sin 4> -> -sin^). M Sl . S2 , figure 12.3(a), is a 'rectangle' of nearby 
trajectories which have arrived from disk si and are heading for disk s 2 . (continued 
in example 12.6) 
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Figure 12.3: The 3-disk game of pinball of fig- 
ure 11.5, generated by starting from disk 1, pre- 
ceded by disk 2, coded in binary, as in figure 12.2. 

(a) Strips M Sj which have survived a bounce in 
the past and will survive a bounce in the future. 

(b) Iteration corresponds to the decimal point shift; 
for example, all points in the rectangle [1.01] map 
into the rectangles [0.10], [0.11] in one iteration. 




The 3-disk repeller does not really look like a 'horseshoe;' the 'fold' is cut 
out of the picture by allowing the pinballs that fly between the disks to fall off the 
table and escape. Next example captures the 'stretch & fold' horseshoe dynamics 
of return maps such as Rtissler's, figure 3.2. 



Example 12.3 A Henon repeller complete horseshoe: 

pie 3.6) Consider 2-dimensional Henon map 

(x„+i,y n +i) = (1 - ax 2 n + by n , x n ) . 



(continued from exam- 

exercise 3.5 
(12.8) 



If you start with a small ball of initial points centered around the fixed point xq, and 
iterate the map, the ball will be stretched and squashed along the unstable manifold 
W'n. Iterated backward in time, 



(x n -i,y n -i) = {y„,-b '(1 -ay 2 n -x n )), 



(12.9) 



this small ball of initial points traces out the stable manifold W 0 9 . Their intersections 
enclose the region M. , figure 12.4(a). Any point outside W£ border of M. escapes 
to infinity forward in time, while -by time reversal- any point outside W£ border arrives 
from infinity back in paste. In this way the unstable - stable manifolds define topologi- 
cally invariant and optimal initial region M . ; all orbits that stay confined for all times are 
confined to M. . 

The Henon map models qualitatively the Poincare section return map of fig- 
ure 11.10. For b = 0 the Henon map reduces to the parabola (11.3), and, as shown in 
sects. 3.3 and 29.1, for b + 0 it is kind of a fattened parabola; by construction, it takes 
a rectangular initial area and returns it bent as a horseshoe. Parameter a controls the 
amount of stretching, while the parameter b controls the amount of compression of the 
folded horseshoe. For definitiveness, fix the parameter values to a = 6, b = -1; the 
map is then strongly stretching but area preserving, the furthest away from the strongly 
dissipative examples discussed in sect. 11.2. The map is quadratic, so it has 2 fixed 
points xo = f(x 0 ), x\ = f(x\) indicated in figure 12.4(a). For the parameter values at 
hand, they are both unstable. 

Iterated one step forward, the region M. is stretched and folded into a Smale 
horseshoe drawn in figure 12.4 (b). Label the two forward intersections f(M.) n At. by 
M s ., with s e {0, 1). The horseshoe consists of the two strips Mo., M\. , and the bent 
segment that lies entirely outside the line. As all points in this segment escape to 
infinity under forward iteration, this region can safely be cut out and thrown away. 
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Figure 12.4: The Henon map (12.8) for a = 6, b = 
- 1 : fixed point 0 with segments of its stable, unsta- 
ble manifolds W\ W", and fixed point 1. (a) Their 
intersection bounds the region At = OBCD which 
contains the non-wandering set CI. (b) The inter- 
section of the forward image /(At) with At. con- 
sists of two (future) strips Ato., Ati., with points 
BCD brought closer to fixed point 0 by the sta- 
ble manifold contraction, (c) The intersection of 
the forward image /(At.) with the backward back- 
ward /"'(At.) is a four-region cover of Q. (d) The 
intersection of the twice-folded forward horseshoe 
/ 2 (At) with backward horseshoe / -1 (At.). (e) 
The intersection of / 2 (At.) with /~ 2 (At.) is a 16- 
region cover of Q.. Iteration yields the complete 
Smale horseshoe non-wandering set Q., i.e., the 
union of all non-wandering points of /, with ev- 
ery forward fold intersecting every backward fold. 
(P. Cvitanovic and Y. Matsuoka) 
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Iterated one step backwards, the region At. is again stretched and folded into 
a horseshoe, figure 12.4 (c). As stability and instability are interchanged under time 
reversal, this horseshoe is transverse to the forward one. Again the points in the horse- 
shoe bend wander off to infinity as n -> -00, and we are left with the two (past) strips 
M.o, Af.i ■ Iterating two steps forward we obtain the four strips Mu., M01., Atoa.Atio., 
and iterating backwards we obtain the four strips M.oo,Mm, M.u,M.w transverse to 
the forward ones just as for 3-disk pinball game figure 12.2. Iterating three steps for- 
ward we get an 8 strips, and so on ad infinitum, (continued in example 12.4) 

What is the significance of the subscript such as on which labels the At.011 
future strip? The two strips M.q,M.\ partition the state space into two regions 
labeled by the two-letter alphabet = {0, 1}. S + = .011 is the future itinerary 
for all x e At.011 - Likewise, for the past strips all x € M s _ m ... s _ lSo . have the past 
itinerary S~ - S- m ••• S-\So . Which partition we use to present pictorially the 
regions that do not escape in m iterations is a matter of taste, as the backward 
strips are the preimages of the forward ones 

Mo.=/(Mo), Ali. =/(Al.i). 

Q, the non-wandering set (2.3) of At., is the union of all points whose forward 
and backward trajectories remain trapped for all time, given by the intersections 
of all images and preimages of At: 

Q. = L : x e lim /'"(At.) H /""(M)) ■ (12.10) 

\ m,n— »oo I I J 

Two important properties of the Smale horseshoe are that it has a complete 
binary symbolic dynamics and that it is structurally stable. 

For a complete Smale horseshoe every forward fold /'(At) intersects transver- 
sally every backward fold f~"'(M), so a unique bi-infinite binary sequence can be 
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A B 




a 




A 



Figure 12.5: Kneading orientation preserving danish 
pastry: mimic the horsheshoe dynamics of figure 12.6 
by: (1) squash the unit square by factor 1/2, (2) stretch 
it by factor 2, and (3) fold the right half back over the 
left half. 



associated to every element of the non-wandering set. A point x e Q is labeled 
by the intersection of its past and future itineraries S(x) = • • • S-2S-1S0.S1S2 ■ ■ •, 
where s n — s if f n {x) e M. s , s e {0, 1} and n e Z. 

The system is said to be structurally stable if all intersections of forward and 
backward iterates of M remain transverse for sufficiently small perturbations / — > 
/ + 6 of the flow, for example, for slight displacements of the disks in the pinball 
problem, or sufficiently small variations of the Henon map parameters a, b. While 
structural stability is exceedingly desirable, it is also exceedingly rare. About this, 
more later. 




section 1 .8 
section 25.2 



12.3 Symbol plane 



Consider a system for which you have succeeded in constructing a covering sym- 
bolic dynamics, such as a well-separated 3-disk system. Now start moving the 
disks toward each other. At some critical separation a disk will start blocking 
families of trajectories traversing the other two disks. The order in which trajec- 
tories disappear is determined by their relative ordering in space; the ones closest 
to the intervening disk will be pruned first. Determining inadmissible itineraries 
requires that we relate the spatial ordering of trajectories to their time ordered 
itineraries. 

So far we have rules that, given a state space partition, generate a temporally 
ordered itinerary for a given trajectory. Our next task is the converse: given a 
set of itineraries, what is the spatial ordering of corresponding points along the 
trajectories? In answering this question we will be aided by Smale's visualization 
of the relation between the topology of a flow and its symbolic dynamics by means 
of 'horseshoes,' such as figure 12.4. 



exercise 12.7 



12.3.1 Kneading danish pastry 

The danish pastry transformation, the simplest baker's transformation appropriate 
to Henon type mappings, yields a binary coordinatization of all possible periodic 
points. 
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Figure 12.6: The dynamics maps two (past) strips 
strips M.o, M.i into two (future) strips Mo., Mi.. 
The corners are labeled to aid visualization. Note 
that the BCGH strip is rotated by 180 degrees. (P. 
Cvitanovic and Y. Matsuoka) 



(e) 



o 




The symbolic dynamics of once-folding map is given by the danish pastry 
transformation. This generates both the longitudinal and transverse alternating 
binary tree. The longitudinal coordinate is given by the head of a symbolic se- 
quence; the transverse coordinate is given by the tail of the symbolic sequence. 
The dynamics on this space is given by symbol shift permutations; volume pre- 
serving, with 2 expansion and 1/2 contraction. 

For a better visualization of 2-dimensional non-wandering sets, fatten the in- 
tersection regions until they completely cover a unit square, as in figure 12.7. 
We shall refer to such a 'map' of the topology of a given 'stretch & fold' dynam- 
ical system as the symbol square. The symbol square is a topologically accurate 
representation of the non-wandering set and serves as a street map for labeling its 
pieces. Finite memory of m steps and finite foresight of n steps partitions the sym- 
bol square into rectangles [s- m+ i • • • sq.s\s 2 ■ ■ ■ s n ], such as those of figure 12.6. In 
the binary dynamics symbol square the size of such rectangle is T m x 2~" ; it cor- 
responds to a region of the dynamical state space which contains all points that 
share common n future and m past symbols. This region maps in a nontrivial way 
in the state space, but in the symbol square its dynamics is exceedingly simple; all 
of its points are mapped by the decimal point shift (1 1 .20) 

Cr(- • • S- 2 S-iS 0 .SiS 2 S 3 ■ • •) = • • • S-2S-lS 0 Sl.S 2 S3 ■ ■ ■ , (12.11) 



Example 12.4 A Henon repeller subshift: (continued from example 12.3) The 
Henon map acts on the binary partition as a shift map. Figure 12.6 illustrates ac- 
tion /(Mo) = Mo.. The square [01.01] gets mapped into the rectangles <x[01.01] = 
[10.1] = {[10.10], [10.11]), see figure 12.4(e). Further examples can be gleaned from 
figure 12.4. 

As the horseshoe mapping is a simple repetitive operation, we expect a simple 
relation between the symbolic dynamics labeling of the horseshoe strips, and their 
relative placement. The symbol square points y(S + ) with future itinerary S + are 
constructed by converting the sequence of Sn 's into a binary number by the algo- 
rithm (11.9). This follows by inspection from figure 12.9. In order to understand 
this relation between the topology of horseshoes and their symbolic dynamics, it 
might be helpful to backtrace to sect. 11.4 and work through and understand first 
the symbolic dynamics of 1 -dimensional unimodal mappings. 
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Figure 12.7: Kneading danish pastry: symbol 
square representation of an orientation preserving 
once-folding map obtained by fattening the Smale 
horseshoe intersections of (a) figure 12.6 (b) fig- 
ure 12.4 into a unit square. Also indicated: the 
fixed points 0, 1 and the 2-cycle points (01,10). In 
the symbol square the dynamics maps rectangles 
into rectangles by a decimal point shift. 
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Figure 12.8: Kneading orientation preserving 
danish pastry: symbol square representation of an 
orientation preserving once-folding map obtained 
by fattening the intersections of two forward iter- 
ates / two backward iterates of Smale horseshoe 
into a unit square. 




Under backward iteration the roles of 0 and 1 symbols are interchanged; At 1 
has the same orientation as At, while At^ 1 has the opposite orientation. We assign 
to an orientation preserving once-folding map the past topological coordinate 
6 = S(S~) by the algorithm: 

/ w n if s n = 0 
W "~ l = \l-w n if ,, = 1 ' WQ = S » 

CO 

5{S') = 0.w 0 W-iW-2 • • • = J^Wi-n/2". (12.12) 

17=1 



exercise 12.4 



Such formulas are best derived by solitary contemplation of the action of a folding 
map, in the same way we derived the future topological coordinate (11.9). 

The coordinate pair (6,y) associates a point (x,y) in the state space Cantor 
set of figure 12.4 to a point in the symbol square of figure 12.9, preserving the 



Figure 12.9: Kneading danish pastry: symbol square 
representation of an orientation preserving once- 
folding map obtained by fattening the Smale horse- 
shoe intersections of figure 12.4(e) into a unit square. 
Also indicated: the fixed points 0, 1, and the 3-cycle 
points (011,110,101). In the symbol square the dynam- 
ics maps rectangles into rectangles by a decimal point 
shift. 
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topological ordering. The symbol square [6, y] serves as a topologically faithful 
representation of the non-wandering set of any once-folding map, and aids us in 
partitioning the set and ordering the partitions for any flow of this type. 

fast track: 
chapter 13, p. 266 

12.4 Prune danish 

Anyone know where I can get a good prune danish in 
Charlotte? I mean a real NY Jewish bakery kind of prune 
danish! 

— Googled 

In general, not all possible symbol sequences are realized as physical trajectories. 
Trying to get from 'here' to 'there' we might find that a short path is excluded by 
some obstacle, such as a disk that blocks the path, or a potential ridge. In order to 
enumerate orbits correctly, we need to prune the inadmissible symbol sequences, 
i.e., describe the grammar of the admissible itineraries. 

The complete Smale horseshoe dynamics discussed so far is rather straight- 
forward, and sets the stage for situations that resembles more the real life. A 
generic once-folding map does not yield a complete horseshoe; some of the horse- 
shoe pieces might be pruned, i.e., not realized for particular parameter values of 
the mapping. In 1 dimension, the criterion for whether a given symbolic sequence 
is realized by a given unimodal map is easily formulated; any orbit that strays 
to the right of the value computable from the kneading sequence (the orbit of 
the critical point (11.13)) is pruned. This is a topological statement, indepen- 
dent of a particular unimodal map. Our objective is to generalize this notion to 
2-dimensional once-folding maps. 

Adjust the parameters of a once-folding map so that the intersection of the 
backward and forward folds is still transverse, but no longer complete, as in fig- 
ure 12.10(a). The utility of the symbol square lies in the fact that the surviving, 
admissible itineraries still maintain the same relative spatial ordering as for the 
complete case. 

In the example of figure 12.10 the rectangles [10.1], [11.1] have been pruned, 
and consequently any trajectory containing blocks h\ - 101, hi — 111 is pruned, 
the symbol dynamics is a subshift of finite type (11.24). We refer to the border 
of this primary pruned region as the pruning front; another example of a pruning 
front is drawn in figure 12. 1 1 (b). We call it a 'front' as it can be visualized as a 
border between admissible and inadmissible; any trajectory whose points would 
fall to the right of the front in figure 12.11 is inadmissible, i.e., pruned. The 
pruning front is a complete description of the symbolic dynamics of once-folding 
maps.For now we need this only as a concrete illustration of how pruning rules 
arise. 
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Figure 12.10: (a) An incomplete Smale horse- 
shoe: the inner forward fold does not intersect the 
outer backward fold, (b) The primary pruned re- 
gion in the symbol square and the corresponding 
forbidden binary blocks. 
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Figure 12.11: (a) An incomplete Smale horseshoe 
which illustrates (b) the monotonicity of the prun- 
ing front: the thick line which delineates the left 
border of the primary pruned region is monotone 
on each half of the symbol square. The backward 
folding in this figure and figure 12.10 is schematic 
- in invertible mappings there are further miss- 
ing intersections, all obtained by the forward and 
backward iterations of the primary pruned region. 
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In the example at hand there are total of two forbidden blocks 101, 111, so 
For now we concentrate on this kind of pruning because it is particularly clean 
and simple. 



fast track: 
chapter 13, p. 266 

Though a useful tool, Markov partitioning is not without drawbacks. One glar- 
ing shortcoming is that Markov partitions are not unique: any of many different 
partitions might do the job. The Cj- and D3- equivariant systems that we discuss 
next offers a simple illustration of different Markov partitioning strategies for the 
same dynamical system. 




12.5 Recoding, symmetries, tilings 

In chapter 9 we made a claim that if there is a symmetry of dynamics, we 
must use it. Here we shall show how to use it, on two concrete examples, and in 
chapter 21 we shall be handsomely rewarded for our labors. First, the simplest 
example of equivariance, a single 'reflection' Ci group of example 9.16. 




Example 12.5 C2 recoded: Assume that each orbit is uniquely labeled by, an c 
infinite string {s,}, s, e {+, -} and that the dynamics is Cj-equivariant under the + <-> - 
interchange. Periodic orbits separate into two classes, the self-dual configurations +-, 

+ + — , + + + , + — + - + +-, ■ ■ ■, with multiplicity m p = 1, and the pairs +,-,+ + -, 

— +,■■■, with multiplicity m p = 2. For example, as there is no absolute distinction 
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Table 12.1: Correspondence between the C2 symmetry reduced cycles p and the full state 
space periodic orbits p, together with their multiplicities m p . Also listed are the two 
shortest cycles (length 6) related by time reversal, but distinct under C2- 



P P 

1 + 2 

0 -+ 1 

01 -- ++ 1 
001 - + + 2 

Oil +++ 1 

0001 - + --+-++ 1 

0011 - + ++ 2 

0111 + + ++ 1 

00001 - + - + - 2 

00011 - + +- + ++ 1 

00101 - + + --+--++ 1 

00111 - + +- + ++ 1 

01011 -- + + + 2 

01111 + + + ++ 1 

001011 - + + +-- + ++ 1 

001101 - + + + --+ ++ 1 



between the 'left' or the 'right' lobe of the Lorenz attractor, figure 3.4(a), the Floquet 
multipliers satisfy A + = A_, A ++ _ = A + __, and so on. 

GX6TCIS0 L- 1 .0 

The symmetry reduced labeling p; e {0, 1} is related to the full state space 
labeling s/ e {+, -} by 

If Si — Si-i then pi — 1 

If si + st-i then pi = 0 (12.13) 

For example, the fixed point + = ■■■ + + + + ■■■ maps into ■ • • 11 1 • • • = 1, and so does 
the fixed point -. The 2-cycle ~ = ■■•- + - + ■■■ maps into fixed point ■ ■ ■ 000 ■ ■ ■ = 0, 

and the 4-cycle - + h — = • • ■ n n — maps into 2-cycle ■ ■ ■ 0101 • • • = 01. A 

list of such reductions is given in table 12.1. 



Next, let us take the old pinball game and 'quotient' the state space by the 
symmetry, or 'desymmetrize.' As the three disks are equidistantly spaced, our 
game of pinball has a sixfold symmetry. For instance, the cycles 12, 23, and 13 in 
figure 12.12 are related to each other by rotation by ±2n/3 or, equivalently, by a 
relabeling of the disks. We exploit this symmetry by recoding, as in (12.7). 

exercise 11.1 
exercise 12.6 

Example 12.6 Recoding ternary symbolic dynamics in binary: Given a ternary 
sequence and labels of 2 preceding disks, rule ( 12. 7) fixes the subsequent binary sym- 
bols. Here we list an arbitrary ternary itinerary, and the corresponding binary sequence: 

ternary : 3121312321231323 

binary : -10101101011010 (12.14) 

The first 2 disks initialize the trajectory and its direction; 3 h 1 h 2 h Due to 
the 3-disk symmetry the six distinct 3-disk sequences initialized by 12, 13, 21, 23, 31, 
32 respectively have the same weights, the same size state space partitions, and are 
coded by a single binary sequence, (continued in example 12. 7) 
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Figure 12.12: The 3-disk game of pinball with the 
disk radius : center separation ratio a:R = 1:2.5. 
(a) 2-cycles 12, 13, 23, and 3-cycles 123 and 132 
(not drawn), (b) The fundamental domain, i.e., the 
small l/6th wedge indicated in (a), consisting of a 
section of a disk, two segments of symmetry axes 
acting as straight mirror walls, and an escape gap. 
The above five cycles restricted to the fundamental 
domain are the two fixed points 0, 1 . See figure 9.3 
for cycle 10 and further examples. 



(a) 




exercise 12.7 
exercise 14.2 



Binary symbolic dynamics has two immediate advantages over the ternary 
one; the prohibition of self-bounces is automatic, and the coding utilizes the sym- 



The 3-disk game of pinball is tiled by six copies of the fundamental domain, a 
one-sixth slice of the full 3-disk system, with the symmetry axes acting as reflect- 
ing mirrors, see figure 12. 12 (b). Every global 3-disk trajectory has a correspond- 
ing fundamental domain mirror trajectory obtained by replacing every crossing 
of a symmetry axis by a reflection. Depending on the symmetry of the full state 
space trajectory, a repeating binary alphabet block corresponds either to the full 
periodic orbit or to a relative periodic orbit (examples are shown in figure 12.12 
and table 12.2). A relative periodic orbit corresponds to a periodic orbit in the 
fundamental domain. 

Table 12.2 lists some of the shortest binary periodic orbits, together with the 
corresponding full 3-disk symbol sequences and orbit symmetries. For a number 
of deep reasons that will be elucidated in chapter 2 1 , life is much simpler in the 
fundamental domain than in the full system, so whenever possible our computa- 
tions will be carried out in the fundamental domain. 

Example 12.7 D 3 recoded - 3-disk game of pinball: (continued from exam- 
ple 12.6) The D 3 recoding can be worked out by a glance at figure 12.12(a) (con- 
tinuation of example 9. 1 7). For the symmetric 3-disk game of pinball the fundamental 
domain is bounded by a disk segment and the two adjacent sections of the symme- 
try axes that act as mirrors (see figure 12.12(b)). The three symmetry axes divide 
the space into six copies of the fundamental domain. Any trajectory on the full space 
can be pieced together from bounces in the fundamental domain, with symmetry axes 
replaced by flat mirror reflections. The binary {0, 1) reduction of the ternary three disk 
{1,2, 3} labels has a simple geometric interpretation, figure 12.2: a collision of type 0 re- 
flects the projectile to the disk it comes from (back-scatter^whereas after a collision of 
type 1 projectile continuesjo the third disk. For example, 23 = • • ■ 232323 • ■ ■ maps into 
... 000 •■ ^0 (and so do 12 and 13), 123 = ••• 12312- •• maps into ■■■ 111 •• ■ = I (and 
so does 132), and so forth. Such reductions for short cycles are given in table 12.2, 
figure 12. 12 and figure 9. 7. 



metry of the 3-disk pinball game in an elegant manner. 



exercise 1 1 .2 
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Table 12.2: D3 correspondence between the binary labeled fundamental domain prime 
cycles p and the full 3-disk ternary labeled cycles p, together with the D 3 transformation 
that maps the end point of the p cycle into the irreducible segment of the p cycle, see 
example 9.1. White spaces in the above ternary sequences mark repeats of the irreducible 
segment; for example, the full space 12-cycle 121231312323 consists of 1212 and its 
symmetry related segments 3131, 2323. The multiplicity of p cycle is m p = 6np/n p . 
The shortest pair of fundamental domain cycles related by time reversal (but no spatial 
symmetry) are the 6-cycles 00101 1 and 001 101. 
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Resume 



In the preceding and this chapter we start with a J-dimensional state space and 
end with a 1 -dimensional return map description of the dynamics. The arc-length 
parametrization of the unstable manifold maintains the 1-to-l relation of the full 
J-dimensional state space dynamics and its 1 -dimensional return-map representa- 
tion. To high accuracy no information about the flow is lost by its 1 -dimensional 
return map description. We explain why Lorenz equilibria are heteroclinically 
connected (it is not due to the symmetry), and how to generate all periodic orbits 
of Lorenz flow up to given length. This we do, in contrast to the rest of the thesis, 
without any group-theoretical jargon to blind you with. 

For 1 -dimensional maps the folding point is the critical point, and easy to 
determine. In higher dimensions, the situation is not so clear - one can attempt 
to determine the (fractal set of) folding points by looking at their higher iterates 
- due to the contraction along stable manifolds, the fold gets to be exponentially 
sharper at each iterate. In practice this set is essentially uncontrollable for the 
same reason the flow itself is chaotic - exponential growth of errors. We prefer to 
determine a folding point by bracketing it by longer and longer cycles which can 
be determined accurately using variational methods of chapter 29, irrespective of 
their period. 

For a generic dynamical system a subshift of finite type is the exception rather 
than the rule. Its symbolic dynamics can be arbitrarily complex; even for the lo- 
gistic map the grammar is finite only for special parameter values. Only some 
repelling sets (like our game of pinball) and a few purely mathematical constructs 
(called Anosov flows) are structurally stable - for most systems of interest an 



smale - 19apr2009 



ChaosBook.org veraionl4.4.1, Apr21 2013 



CHAPTER 12. STRETCH, FOLD, PRUNE 



259 



infinitesimal perturbation of the flow destroys and/or creates an infinity of trajec- 
tories, and specification of the grammar requires determination of pruning blocks 
of arbitrary length. The repercussions are dramatic and counterintuitive; for ex- 
ample, the transport coefficients such as the deterministic diffusion constant of 
sect. 25.2 are emphatically not smooth functions of the system parameters. The 
importance of symbolic dynamics is often under appreciated; as we shall see in 
chapters 20 and 23, the existence of a finite grammar is the crucial prerequisite for 
construction of zeta functions with nice analyticity properties. This generic lack 
of structural stability is what makes nonlinear dynamics so hard. 

The conceptually simpler finite subshift Smale horseshoes suffice to motivate 
most of the key concepts that we shall need for time being. Our strategy is akin 
to bounding a real number by a sequence of rational approximants; we converge 
toward the non-wandering set under investigation by a sequence of self-similar 
Cantor sets. The rule that everything to one side of the pruning front is forbid- 
den is striking in its simplicity: instead of pruning a Cantor set embedded within 
some larger Cantor set, the pruning front cleanly cuts out a compact region in the 
symbol square, and that is all - there are no additional pruning rules. A 'self- 
similar' Cantor set (in the sense in which we use the word here) is a Cantor set 
equipped with a subshift of finite type symbol dynamics, i.e., the corresponding 
grammar can be stated as a finite number of pruning rules, each forbidding a finite 
subsequence s\S2 . . . s n - Here the notation s\S2 ■ ■ ■ s n _ stands for n consecutive 
symbols s\l, S2, . . -, s n , preceded and followed by arbitrary symbol strings. 

The symbol square is a useful tool in transforming topological pruning into 
pruning rules for inadmissible sequences; those are implemented by constructing 
transition matrices and/or graphs, see chapters 14 and 15. 



Commentary 

Remark 12.1 Stable/unstable manifolds. For pretty hand-drawn pictures of invariant 
manifolds, see Abraham and Shaw [12]. Consnuction of invariant manifolds by map 
iteration is described in Simo [34]. Fixed point stable / unstable manifolds and their 
homoclinic and heteroclinic intersections can be computed using DsTool [58, 59, 60]. 
Unstable manifold turning points were utilized in refs. [12, 2, 3, 31, 32, 33] to partition 
state space and prune inadmissible symbol sequences. The arclength parameterized return 
maps were introduced by Christiansen et al. [62], and utilized in ref. [?] Even though no 
dynamical system has been studied more exhaustively than the Lorenz equations, the 
analysis of sect. 11.2 is new. The desymmetrization follows Gilmore and Lettelier [15], 
but the key new idea is taken from Christiansen et al. [62]: the arc-length parametrization 
of the unstable manifold maintains the 1-to-l relation of the full cZ-dimensional state space 
dynamics and its 1 -dimensional return-map representation, in conttast to 1 -dimensional 
projections of the (d— l)-dimensional Poincare section return maps previously deployed 
in the literature. In other words, to high accuracy no information about the flow is lost by 
its 1-dimensional return map description. 

Remark 12.2 Smale horseshoe. S. Smale understood clearly that the crucial ingre- 
dient in the description of a chaotic flow is the topology of its non-wandering set, and he 
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provided us with the simplest visualization of such sets as intersections of Smale horse- 
shoes. In retrospect, much of the material covered here can already be found in Smale's 
fundamental paper [27], but an engineer or a scientist who has run into a chaotic time 
series in his laboratory might not know that he is investigating the action (differentiable) 
of a Lie group G on a manifold M, and that the Lefschetz trace formula is the way to go. 

We have tried to explain the geometric picture the best we could in the static text for- 
mat, but there is no substitute for dynamics but the dynamics itself. We found Demidov's 
"Chaotic maps" [65] simulations of the Henon map particularly helpful in explaining how 
horsheshoes partition the non-wandering sets. 

Remark 12.3 Pruning fronts. The 'partition conjecture' is due to Grassberger and 
Kantz [3], The notion of a pruning front and the 'pruning-front conjecture' was formu- 
lated by Cvitanovic et al. [12], and developed by K.T Hansen for a number of dynamical 
systems in his Ph.D. thesis [22] and a series of papers [23]-[27]. The 'multimodal map 
approximation' is described in the K.T. Hansen thesis [22]. Hansen's thesis is still the 
most accessible exposition of the pruning theory and its applications. Detailed studies of 
pruning fronts are carried out in refs. [ 13, ?, 14]; ref. [5] is the most detailed study carried 
out so far. The rigorous theory of pruning fronts has been developed by Y. Ishii [18, 19] 
for the Lozi map, and A. de Carvalho [16, 17] in a very general setting. Beyond the or- 
bit pruning and its infinity of admissible unstable orbits, an attractor of Henon type may 
also own an infinity of attractive orbits coexisting with the strange attractor [20, 21], We 
offer heuristic arguments and numerical evidence that the coexistence of attractive orbits 
does not destroy the strange attractor/repeller, which is also in this case described by the 
2-dimensional danish pastry plot. 
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Exercises 

12.1. A Smale horseshoe. The Henon map of example 3.6 



x' 




y 





1 — ax" + by 

x 



(12.15) 



maps the [x,y] plane into itself - it was constructed 
by Henon [6] in order to mimic the Poincare section 
of once-folding map induced by a flow like the one 
sketched in figure 11.10. For definitiveness fix the pa- 
rameters to a = 6, b = — 1 . 

a) Draw a rectangle in the (x,y) plane such that its 
«th iterate by the Henon map intersects the rectan- 
gle 2" times. 

b) Construct the inverse of the (12.15). 

c) Iterate the rectangle back in the time; how many 
intersections are there between the n forward and 
m backward iterates of the rectangle? 

d) Use the above information about the intersections 
to guess the {x,y) coordinates for the two fixed 
points, a 2-periodic point, and points on the two 
distinct 3-cycles from table 15.1. The exact peri- 
odic points are computed in exercise 13.13. 

12.2. Kneading Danish pastry. Write down the (x,y) — > 
(x, y) mapping that implements the baker's map 




! ; : 110 



FIG. 4. Itenth 



Figure: Kneading danish pastry: symbol square repre- 
sentation of an orientation reversing once-folding map 
obtained by fattening tlie Smale horseshoe intersections 



12.3. 



12.4. 



12.5. 



of figure 12.4 into a unit square. In the symbol square 
the dynamics maps rectangles into rectangles by a dec- 
imal point shift. together with the inverse mapping. 

Sketch a few rectangles in symbol square and their for- 
ward and backward images. (Hint: the mapping is very 
much like the tent map (11 .4)). 

Kneading danish without flipping. The baker's map 
of exercise 12.2 includes a flip - a map of this type is 
called an orientation reversing once-folding map. Write 
down the (x,y) — > (x,y) mapping that implements an 
orientation preserving baker's map (no flip; Jacobian de- 
terminant = 1). Sketch and label the first few folds of the 
symbol square. 

Orientation reversing once-folding map. By adding 
a reflection around the vertical axis to the horseshoe map 
g we get the orientation reversing map g shown in the 
second Figure above. Qq and Q\ are oriented as Qq and 
<2i, so the definition of the future topological coordi- 
nate y is identical to the y for the orientation preserving 
horseshoe. The inverse intersections (5 0 ' and Q^ 1 are 
oriented so that Q~ l is opposite to Q, while Q~ l has the 
same orientation as Q. Check that the past topological 
coordinate 5 is given by 



w„-i = 



6(x) 



1 — w„ if s n — 0 

W„ if S n = 1 ' 



W 0 = SO 



0.wow_iw_ 



|]w>i_ n /2" (12.16) 



Infinite symbolic dynamics. Let cr be a func- 

tion that returns zero or one for every infinite binary 
string: cr : {0,1 } N — > {0,1}. Its value is represented 
by cr(ei, €2, . . .) where the e, are either 0 or 1. We will 
now define an operator T that acts on observables on the 
space of binary strings. A function a is an observable if 
it has bounded variation, that is, if 



||a|| = sup|a(ei,€2, ■ ■ -)l < 00 • 



For these functions 



Ta(ei,e 2 ,...) 



a(0, 61,62, ...)cr(0, 6i,62,...) 
+a(l,ei,62,...)0-(l,6i,62,...). 
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(a) (easy) Consider a finite version T„ of the operator 
T: 

T„a(ei,e 2 ,...,e u „) = 

a(0, e\,ei,..., e„_i)cr(0, e u 62, . . . , e„_i) 

a(l, ei, e 2 , ■ • ■ , e„-i)(r(l, ei, 62, ■ • ■ , 

Show that T„ is a 2" x 2" matrix. Show that its 
trace is bounded by a number independent of n. 

(b) (medium) With the operator norm induced by the 
function norm, show that T is a bounded operator. 

(c) (hard) Show that T is not trace class. 

12.6. 3-disk fundamental domain cycles. (continued 
from exercise 9.6) Try to sketch 0, T, 67, OOT, 67T, 
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in the fundamental domain, and interpret the symbols 
{0, 1 } by relating them to topologically distinct types of 
collisions. Compare with table 12.2. Then try to sketch 
the location of periodic points in the Poincare section of 
the billiard flow. The point of this exercise is that while 
in the configuration space longer cycles look like a hope- 
less jumble, in the Poincare section they are clearly and 
logically ordered. The Poincare section is always to be 
preferred to projections of a flow onto the configuration 
space coordinates, or any other subset of state space co- 
ordinates which does not respect the topological organi- 
zation of the flow. 

12.7. 3-disk pruning. (Not easy) Show that for 3-disk 
game of pinball the pruning of orbits starts at R : a = 
2.04821419 . . ., figure 11.6. (K.T. Hansen) 
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Chapter 13 



Fixed points, and how to get them 



H 



Cycles. Is there anything they can't do? 

— Mason Porter, channeling Homer Simpson 



aving set up the dynamical context, we now turn to the key and unavoidable 
numerical task in this subject; we must search for the solutions (x, T), 
x e E. d , T e R satisfying the periodic orbit condition 

f nT {x) = f\x), T>0 (13.1) 



for a given flow or map. 

In chapters 18 and 19 we will establish that spectra of evolution operators can 
be extracted from periodic orbit sums: 



(spectral eigenvalues) = ^ (periodic orbits) 



Hence, periodic orbits are the necessary ingredient for evaluation of the spectra 
of evolution operators. We need to know what periodic orbits can exist, and the 
symbolic dynamics developed so far is an invaluable tool toward this end. 

Sadly, searching for periodic orbits will never become as popular as a week 
on Cote d'Azur, or publishing yet another log-log plot in Phys. Rev. Letters. This 
chapter is intended as a hands-on guide to extracting periodic orbits, and should 
be skipped on first reading - you can return to it whenever the need for finding 
actual cycles arises. A serious cyclist will want to also learn about the variational 
methods to find cycles, chapter 29. They are particularly useful when little is 
known about the topology of a flow, such as in high-dimensional periodic orbit 
searches. 



fast track: 
chapter 14, p. 288 



chapter 29 
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A prime cycle p of period T p is a single traversal of the periodic orbit, so our 
task will be to find a periodic point x e M p and the shortest time T p for which 
(13.1) has a solution. A periodic point of a flow f crossing a Poincare section 
n times is a fixed point of P", the nth iterate of P, the return map (3.1); hence, 
we shall refer to all cycles as "fixed points" in this chapter. By cyclic invariance, 
Floquet multipliers and the period of the cycle are independent of the choice of 
the initial point, so it will suffice to solve (13.1) at a single periodic point. 

If the cycle is an attracting limit cycle with a sizable basin of attraction, it 
can be found by integrating the flow for a sufficiently long time. If the cycle is 
unstable, simple integration forward in time will not reveal it, and the methods 
to be described here need to be deployed. In essence, any method for finding 
a cycle is based on devising a new dynamical system which possesses the same 
cycle, but for which this cycle is attractive. Beyond that, there is a great freedom 
in constructing such systems, and many different methods are used in practice. 

Due to the exponential divergence of nearby trajectories in chaotic dynamical 
systems, fixed point searches based on direct solutions of the fixed-point condition 
(13.1) as an initial value problem can be numerically very unstable. Methods that 
start with initial guesses for a number of points along the cycle, such as the mul- 
tipoint shooting method described here in sect. 13.3, and the variational methods 
of chapter 29, are considerably more robust and safer. 

A prerequisite for any exhaustive cycle search is a good understanding of the 
topology of the flow: a preliminary step to any serious periodic orbit calculation is 
preparing a list of all distinct admissible prime periodic symbol sequences, such as 
the list given in table 15.1. The relations between the temporal symbol sequences 
and the spatial layout of the topologically distinct regions of the state space dis- 
cussed in chapters 1 1 and 1 2 should enable us to guess the location of a series of 
periodic points along a cycle. Armed with such an informed guess we proceed 
to improve it by methods such as Newton-Raphson iteration; we show how this 
works by applying Newton method to 1- and J-dimensional maps. But first, where 
are the cycles? 



section 5.3 



chapter 29 



13.1 Where are the cycles? 



Q: What if you choose a really bad initial condition and it 
doesn't converge? A: Well then you only have yourself to 
blame. 

— T.D. Lee 



The simplest and conceptually easiest setting for guessing where the cycles are is 
the case of planar billiards. The Maupertuis principle of least action here dictates 
that the physical trajectories extremize the length of an approximate orbit that 
visits a desired sequence of boundary bounces. 



Example 13.1 Periodic orbits of billiards. Consider how this works for 3-disk 

pinball game of sect. 12.5. . Label the three disks by 1,2 and 3, and associate to every 

H w ' section 12.5 

section 1 .4 
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trajectory an itinerary, a sequence of labels indicating the order in which the disks are 
visited, as in figure 3.8. Given the itinerary, you can construct a guess trajectory by 
taking a point on the boundary of each disk in the sequence, and connecting them by 
straight lines. Imagine that this is a rubber band wrapped through 3 rings, and shake 
the band until it shrinks into the physical trajectory, the rubber band of shortest length. 

Extremization of a cycle length requires variation of n bounce positions s,. 
The computational problem is to find the extremum values of cycle length L(s) where 
s = Oi, . . . , s n ) , a task that we postpone to sect. 29.3. As an example, the short peri: 

ods and stabilities of 3-disk cycles computed this way are listed table 29.3, and some 

exercis 

examples are plotted in figure 3.8. It's a no brainer, and millions of such cycles have 
been computed. 



If we were only so lucky. Real life finds us staring at something like Yang- 
Mills or Navier-Stokes equations, utterly clueless. What to do? 

One, there is always mindless computation. In practice one might be satisfied 
with any rampaging robot that finds "the most important" cycles. The ergodic 
explorations of recurrences sometimes perform admirably well, and we discuss 
this next. 



13.1.1 Cycles from long time series 

Two wrongs don't make a right, but three lefts do. 
— Appliance guru 

(L. Rondoni and R Cvitanovic) 

The equilibria and periodic orbits (with the exception of sinks and stable limit 
cycles) are never seen in simulations and experiments because they are unstable. 
Nevertheless, one does observe close passes to the least unstable equilibria and 
periodic orbits, as in figure 13.1. Ergodic exploration by long-time trajectories (or 
long-lived transients, in case of strange repellers) can uncover state space regions 
of low velocity, or finite time recurrences. In addition, such trajectories preferen- 
tially sample the natural measure of the 'turbulent' flow, and by initiating searches 
within the state space concentrations of natural measure bias the search toward the 
dynamically important invariant solutions. 

The search consists of following a long trajectory in state space, and looking 
for close returns of the trajectory to itself, see figure 13.1. Whenever the trajectory 
almost closes in a loop (within a given tolerance), another point close to this near 
miss of a cycle can be taken as an initial condition. Supplemented by a Newton 
routine described below, a sequence of improved initial conditions may indeed 
rapidly lead to closing a cycle. The method preferentially finds the least unstable 
orbits, while missing the more unstable ones that contribute little to the cycle 
expansions. 

This blind search is seriously flawed: in contrast to the 3-disk example 13.1, 
it is not systematic, it gives no insight into organization of the ergodic sets, and 



remark 13.1 



section 16.1 
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Figure 13.1: An ergodic trajectory can shadow an un- 
stable periodic orbit p for a finite time. 




can easily miss very important cycles. Foundations to a systematic exploration 
of ergodic state space are laid in chapters 11 and 12, but are a bit of work to 
implement. 

13.1.2 Cycles found by thinking 



A systematic charting out of state space starts out by a hunt for equilibrium points. 
If the equations of motion are a finite set of ODEs, setting the velocity field v(x) 
in (2.7) to zero reduces search for equilibria to a search for zeros of a set of al- 
gebraic equations. We should be able, in principle, to enumerate and determine 
all real and complex zeros in such cases, e.g. the Lorenz example 2.2 and the 
Rossler example 2.3. If the equations of motion and the boundary conditions are 
invariant under some symmetry, some equilibria can be determined by symmetry 
considerations: if a function is e.g. antisymmetric, it must vanish at origin, e.g. 
the Lorenz EQq = (0, 0, 0) equilibrium. 

As to other equilibria: if you have no better idea, create a state space grid, 
about 50 Xk across M in each dimension, and compute the velocity field ^ = v(jCjt) 
at each grid point; a few million vt values are easily stored. Plot Xk for which 
\vk\ 2 < e, e « \v max \ 2 but sufficiently large that a few thousand Xk are plotted. 
If the velocity field varies smoothly across the state space, the regions |i^| 2 < e 
isolate the (candidate) equilibria. Start a Newton iteration with the smallest ^| 2 
point within each region. Barring exceptionally fast variations in v(x) this should 
yield all equilibrium points. 

For ODEs equilibria are fixed points of algebraic sets of equations, but steady 
states of PDEs such as the Navier-Stokes flow are themselves solutions of ODEs 
or PDEs, and much harder to determine. 

Equilibria-by definition-do not move, so they cannot be "turbulent." What 
makes them dynamically important are their stable/unstable manifolds. A chaotic 
trajectory can be thought of as a sequence of near visitations of equilibria. Typi- 



Thinking is extra price. 
— Dicho Colombiano 
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Figure 13.2: (a) The y — > P\(y,z) return map for 
the x = 0, y > 0 Poincare section of the Rossler 
flow figure 2.6. (b) The 1 -cycle found by taking 
the fixed point y k+n = y k together with the fixed 
point of the z — * z return map (not shown) as an 
initial guess (0,;y <0) ,z (0) ) for the Newton-Raphson 
search, (c) The third iterate, y k+3 = P\{yk,Zk), 
of the Poincare return map (3.1) together with the 
corresponding plot for Zk+i = P\(yk,Zk), is used 
to pick initial guesses for the Newton-Raphson 
searches for the two 3-cycles: (d) the 001 cycle, 
and (e) the Oil cycle. (G. Simon) 



(a) 



(c) 





(e) 



cally such neighborhoods have many stable, contracting directions and a handful 
of unstable directions. Our strategy will be to generalize the billiard Poincare sec- 
tion maps P Sn+1 <- Sn °f example 3.9 to maps from a section of the unstable manifold 
of equilibrium s n to the section of unstable manifold of equilibrium and thus 
reduce the continuous time flow to a sequence of maps. These Poincare section 
maps do double duty, providing us both with an exact representation of dynamics 
in terms of maps, and with a covering symbolic dynamics. 

We showed in the Lorenz flow example 1 1.4 how to reduce the 3-dimensional 
Lorenz flow to a 1-dimensional return map. In the Rossler flow example 2.3 we 
sketched the attractor by running a long chaotic trajectory, and noting that the 
attractor is very thin, but that otherwise the return maps that we plotted were dis- 
quieting - figure 3.3 did not appear to be a 1-to-l map. In the next example we 
show how to use such information to locate cycles approximately. In the remain- 
der of this chapter and in chapter 29 we shall learn how to turn such guesses into 
highly accurate cycles. 



Example 13.2 Rossler attractor. We run a long simulation of the Rossler flow 
f, plot a Poincare section, as in figure 3.2, and extract the corresponding Poincare 
return map P, as in figure 3.3. Luck is with us, since figure 13.2(a) return map 
y -> P\(y,z) is quite reminiscent of a parabola, we take the unimodal map symbolic 
dynamics, sect. 11.3, as our guess for the covering dynamics. Strictly speaking, the 
attractor is "fractal," but for all practical purposes the return map is 1-dimensional; your 
printer will need a resolution better than 10 14 dots per inch to even begin resolving its 
structure. 

Periodic points of a prime cycle p of cycle length n p for the x - 0, y > 0 Poincare 
section of the Rossler flow figure 2.6 are fixed points (y, z) = P" (y, z) of the n th Poincare 
return map. 

Using the fixed point yt+\ = yk in figure 13.2(a) together with the simultaneous 
fixed point of the z -> P](y,z) return map (not shown) as a starting guess (0,y (0) ,z (0) ) 
for the Newton-Raphson search for the cycle p with symbolic dynamics label 1, we find 
the cycle figure 13.2(b) with the Poincare section point (Q,y p ,z p ), period T p , expand- 
ing, marginal, contracting Floquet multipliers (A ;v , A Ptin , A PtC ), and the corresponding 

Lyapunov exponents (A„ e , A„ „,, A„ c ): 

'' '' exercise 13 
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l-cycle: 



(x,y, z) 
T\ 

(Ai, e ,A ljm ,A ljC ) 

C^l.e, Mjn> ^l,c) 



(0,6.09176832,1.2997319) 
5.88108845586 

(-2.40395353, 1 + 10~ 14 , -1.29 x 10~ 14 ) 
(0.149141556, 10~ 14 , -5.44). 



(13.2) 



The Newton-Raphson method that we used is described in sect. 13.4. 

As an example of a search for longer cycles, we useyk+3 = P\(jk,Zk), the third 
iterate of the Poincare return map (3.1) plotted in figure 13.2(c), together with a cor- 
responding plot for Zk+3 = f 3 (yk,Zk), to pick starting guesses for the Newton-Raphson 
searches for the two 3-cycles plotted in figure 13.2(d), (e). For a listing of the short 
cycles of the Rossler flow, consult exercise 13.10. 

The numerical evidence suggests (though a proof is lacking) that all cycles 
that comprise the strange attractor of the Rossler flow are hyperbolic, each with an 
expanding eigenvalue |A e | > 1, a contracting eigenvalue |A C | < 1, and a marginal 
eigenvalue |AJ = 1 corresponding to displacements along the direction of the flow. 

For the Rossler flow the contracting eigenvalues turn out to be insanely con- 
tracting, a factor of e~ 32 per one par-course of the attractor, so their numerical deter- 
mination is quite difficult. Fortunately, they are irrelevant; for all practical purposes 
the strange attractor of the Rossler flow is 1 -dimensional, a very good realization of a 
horseshoe template. (G. Simon and P. Cvitanovic) 



13.2 One-dimensional maps 



So far we have given some qualitative hints for how to set out on a periodic orbit 
hunt. In what follows, we teach you how to nail down periodic orbits numerically. 



13.2.1 Inverse iteration 

Let us first consider a very simple method to find the unstable cycles of a 1- 
dimensional map such as the logistic map. Unstable cycles of 1 -dimensional maps 
are attracting cycles of the inverse map. The inverse map is not single-valued, so 
at each backward iteration we have a choice of branch to make. By choosing the 
branch according to the symbolic dynamics of the cycle we are trying to find, we 
will automatically converge to the desired cycle. The rate of convergence is given 
by the stability of the cycle, i.e., the convergence is exponentially fast. Figurel3.3 
shows such a path to the 01 -cycle of the logistic map. 



The method of inverse iteration is fine for finding cycles for 1-d maps and 
some 2-dimensional systems such as the repeller of exercise 13.13. It is not par- 
ticularly fast, however, especially if the inverse map is not known analytically. It 
also completely fails for higher dimensional systems where we have both stable 
and unstable directions. Inverse iteration will exchange these, but we will still be 
left with both stable and unstable directions. The best strategy is to directly attack 
the problem of finding solutions of f T (x) = x. 



(F. Christiansen) 



exercise 13.13 
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Figure 13.3: The inverse time path to the 01 -cycle of 
the logistic map f(x) = 4x(l - x) from an initial guess 
of x = 0.2. At each inverse iteration we chose the 0 
(respectively 1) branch. 



Figure 13.4: Convergence of Newton method (0) 
vs. inverse iteration (+). The error after n itera- 
tions searching for the 01 -cycle of the logistic map 
f(x) = 4x(l - x) with an initial starting guess of 
X\ = 0.2, x 2 = 0.8. The y-axis is log 10 of the error. 
The difference between the exponential convergence of 
the inverse iteration method and the super-exponential 
convergence of Newton method is dramatic. 



13.2.2 Newton method 

Newton method for determining a zero x* of a function F(x) of one variable is 
based on a linearization around a starting guess x^: 

F(x) * F(xo) + F'(xo)(x - xo). (13.3) 

An approximate solution x\ of F(x) = 0 is 

xx = xo - F(x 0 )/F'(jco). (13.4) 

The approximate solution can then be used as a new starting guess in an iterative 
process. A fixed point of a map / is a solution to F(x) = x - f(x) = 0. We 
determine x by iterating 

X m = g(Xm-l) = Xm-l ~ F{x m -\)IF'(x m -\) 

= Xm-! - — "(Xm-1 -f(x m -i)). (13.5) 

1 -f'(x m -i) 

Provided that the fixed point is not marginally stable, f{x) + 1 at the fixed point 
x, a fixed point of / is a super-stable fixed point of the Newton-Raphson map g, 
g'(x) = 0, and with a sufficiently good initial guess, the Newton-Raphson iteration 
will converge super-exponentially fast. 

To illustrate the efficiency of Newton method we compare it to the inverse 
iteration method in figure 13.4. Newton method wins hands down: the number 
of significant digits of the accuracy of the x estimate typically doubles with each 
iteration. 




-35 1 1 1 1 1 1 1 1 1 1 1 

0 2 4 6 8 10 12 14 16 18 20 
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Figure 13.5: Newton method: bad initial guess 
leads to the Newton estimate x (b+i) far away from the 
desired zero of F(x). Sequence x {m \ x (m+1) , • ■ ■, 
starting with a good guess converges super- 
exponentially to x*. The method diverges if it iterates 
into the basin of attraction of a local minimum x°. 




In order to avoid jumping too far from the desired x* (see figure 13.5), one 
often initiates the search by the damped Newton method, 

Ax (m) = x ( m+ l) _ x (m) = _£±^> At 0 < AT < 1 , 

F'(jt< m >) 

takes small At steps at the beginning, reinstating to the full At = 1 jumps only 
when sufficiently close to the desired x*. 



13.3 Multipoint shooting method 



(F. Christiansen) 



Periodic orbits of length n are fixed points of f so in principle we could 
use the simple Newton method described above to find them. However, this is 
not an optimal strategy. The function f oscillates wildly, with as many as 2" 
or more closely spaced fixed points, and finding a specific periodic point, such 
as one with a given symbolic sequence, requires a very good starting guess. For 
binary symbolic dynamics we must expect to improve the accuracy of our initial 
guesses by at least a factor of 2" to find orbits of length n. Furthermore, the 
Jacobian of f will be ill-conditioned because eigenvalues go like A". But if the 
map is broken up, the eigenvalues are * A. A better alternative is the multipoint 
or multiple shooting method. While it might very hard to give a precise initial 
guess for a long periodic orbit, if our guesses are informed by a good state space 
partition, a rough guess for each point along the desired trajectory might suffice, 
as for the individual short trajectory segments the errors have no time to explode 
exponentially. And, indeed, in chapter 1 1 we have developed a qualitative theory 
of how these cycle points are laid out topologically. 

A cycle of length n is a zero of the ^-dimensional vector function F: 



F{x) 



X\ 

X2 



' Xi-fiXn) 
X2~f(Xl) 

x n ~ f{x n -\) 



The relationship between the temporal symbol sequences and the spatial layout 
of the topologically distinct regions of state space discussed in chapter 11 enable 
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us to guess the location of a series of periodic points along a cycle. Armed with 
such informed initial guesses, we can initiate a Newton-Raphson iteration. The 
iteration in Newton's method now takes the form 



—F(x)(x - x) - -F(x), 
ax 



(13.6) 



where j^-F(x) is an [n x n] matrix: 



1 

-f'(xi) 1 



-/'(*») 



1 



-f'(Xn-l) 1 



(13.7) 



This matrix can easily be inverted numerically by first eliminating the elements 
below the diagonal. This creates non-zero elements in the nth column. We elimi- 
nate these and are done. 



Example 13.3 Newton inversion for a 3-cycle. Let us illustrate how this works step 
by step for a 3-cycle. The initial setup for a Newton step is: 



( 1 


0 


-fix,) 


\ 


' Ax i ) 




\ Fl ) 


-/'(*l) 


1 


0 




Ax 2 






I o 


-f'(x 2 ) 


1 


/ 


, Ax 3 ) 




{ F 3 ) 



where Ax; = x\ - x, is the correction to our initial guess x u and F t = x t - f{Xi-\) is the 
error at ith periodic point. Eliminate the sub-diagonal elements by adding f'(xi) times 
the first row to the second row, then adding f'(x 2 ) times the second row to the third 
row: 



( 1 0 


-f'(x 3 ) \ 


' Ax, N 


0 1 


-/'(*l)/'(*3) 


Ax 2 


, 0 0 


Wfe)/'(*i)/'ta) ) 


, Ax 3 , 




{ Fl 












I F 3 +f'(x2)F 2 +f'(x2)f'(x l )F 1 



The next step is to invert the last element in the diagonal, i.e., divide the third row by 
1 - f'{x2)f'(x\)f'(xi). If this element is zero at the periodic orbit this step cannot work. 
As f'(x 2 )f'(xi)f'(x3) represents the stability of the cycle (when the Newton iteration 
has converged), this is not a good method to find marginally stable cycles. We now 
have 



( 1 


0 


-Z'fe) ^ 


' Ax, 1 


> 


0 


1 


-/'(xilffe) 


Ax 2 




I o 


0 


1 ) 


, Ax 3 ) 


V 



Fi 

F 1 +f'(x l )F l 

F 3 +f'{x 2 )F 2 +f'Ui)f'MF i 



Finally we add /'(x 3 ) times the third row to the first row and /'(xi)/'(x 3 ) times the third 
row to the second row. The left hand side matrix is now the unit matrix, and the right 
hand side is an explicit formula for the corrections to our initial guess. With this, we 
have gone through one Newton iteration. 
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When one sets up Newton iteration on a computer, it is not necessary to write 
the left hand side as a matrix. All one needs is a vector containing the /(x,)'s and 
a vector containing the rc'th column, i.e., the cumulative product of the /(^;)'s 
and a vector containing the right hand side. After iteration the vector containing 
the right hand side is the correction to the initial guess. 

13.3.1 J-dimensional maps 

Armed with clever initial guesses from a system's symbolic dynamics, we can 
easily extend the Newton-Raphson iteration method to J-dimensional maps. In 
this case /'(xf) is a [d x d] matrix, and j^F(x) is an [nd x nd] matrix. In each 
of the steps above, we are then manipulating d rows of the left-hand-side matrix. 
(Remember that matrices do not commute - always multiply from the left.) In 
inverting the nth element of the diagonal we are inverting a [d x d] matrix (1 - 
EI f'( x i)) which can be done as long as none of the eigenvalues of n f'(xd equals 
1, i.e., if the cycle has no marginally stable eigen-directions. 

Example 13.4 Newton method for time delay maps. Some d-dimensional maps 
(such as the Henon map (3.17)) can be written as 1 -dimensional time delay maps of 
the form 

f(xd = /(xi-uXi-2, ■ Xi- d ). (13.8) 

In this case, ^F(x) is an [n xn] matrix as in the case of usual 1 -dimensional maps but 
with non-zero matrix elements on d off-diagonals. 



13.4 Flows 

(R. Paskauskas and R Cvitanovic) 

For a continuous time flow the periodic orbit the Floquet multiplier (5.15) along 
the flow direction always equals unity; the separation of any two points along 
a cycle remains unchanged after a completion of the cycle. More unit Floquet 
multipliers arise if the flow satisfies conservation laws, such as the symplectic in- 
variance for Hamiltonian flows, or the dynamics is equivariant under a continuous 
symmetry transformation. 

Let us apply the Newton method of (13.4) to search for periodic orbits with 
unit Floquet multipliers, starting with the case of a continuous time flow. Assume 
that the periodic orbit condition (13.1) holds for x + Ax and T + At, with the initial 
guesses x and T close to the desired solution, i.e., with |Ax|, At small. The Newton 
setup (13.4) 

0 = x + Ax - f T+At (x + Ax) 

* x-f T (x) + (l-J(x))-Ax-v(f(x))At (13.9) 




section 5.3.1 



section 10.3 
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suffers from two shortcomings. First, we now need to solve not only for the pe- 
riodic point x, but for the period T as well. Second, the marginal, unit Floquet 
multiplier (5.15) along the flow direction (arising from the time-translation invari- 
ance of a periodic orbit) renders the factor (1 - J) in (13.5) non-invertible: if x 
is close to the solution, f T (x) ~ x, then J(x) ■ v(x) = v(f T (x)) ~ v(x). If Ax is 
parallel to the velocity vector, the derivative term (1 - J) ■ Ax ~ 0, and it becomes 
harder to invert ( 1 - J) as the iterations approach the solution. 

As a periodic orbit p is a 1 -dimensional set of points invariant under dynamics, 
Newton guess is not improved by picking Ax such that the new point lies on the 
orbit of the initial one, so we need to constrain the variation Ax to directions 
transverse to the flow, by requiring, for example, that 

v(x)-Ax = 0. (13.10) 

Combining this constraint with the variational condition (13.9) we obtain a New- 
ton setup for flows, best displayed in the matrix form: 



1 - J(x) v(x) \ Ax | _ _ | x - f(x) . 
v(x) O A/ " 0 | 



This illustrates the general strategy for determining periodic orbits in presence of 
continuous symmetries - for each symmetry, pick a point on the orbit by imposing 
a constraint, and compute the value of the corresponding continuous parameter 
(here the period T) by iterating the enlarged set of Newton equations. Constraining 
the variations to transverse ones thus fixes both of Newton's shortcomings: it 
breaks the time-translation invariance, and the period T can be read off once the 
fixed point has been found (hence we omit the superscript in f for the remainder 
of this discussion). 

More generally, the Poincare surface of section technique of sect. 3. 1 turns 
the periodic orbit search into a fixed point search on a suitably defined surface of 
section, with a neighboring point variation Ax with respect to a reference point x 
constrained to stay on the surface manifold (3.2), 

U(x + Ax) = U(x) = 0 . (13.12) 

The price to pay are constraints imposed by the section: in order to stay on the 
surface, arbitrary variation Ax is not allowed. 



Example 13.5 A hyperplane Poincare section. Let us for the sake of simplicity 
assume that the Poincare surface of section is a (hyper)-plane, i.e., it is given by the 
linear condition (3.6) 

(x - x 0 ) ■ h = 0, (13.13) 
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where h is a vector normal to the Poincare section and x a is any point in the Poincare 
section. The Newton setup is then (derived as (13.11)) 



1-7 



v(jc) 
0 



x — X 
At 



-F(x) 
0 



(13.14) 



The last row in this equation ensures that x will be in the surface of section, and the 
addition of v(x)At, a small vector along the direction of the flow, ensures that such an x 
can be found, at least if x is sufficiently close to a fixed point off. 

To illustrate that the addition of the extra constraint resolves the problem of 
(1-7) non-invertability we consider the particularly simple example of a 3-d flow with 
the (x,y,Q)-plane as the Poincare section, a = (0,0, 1). Let all trajectories cross the 
Poincare section perpendicularly, so that v = (0, 0, v.), which means that the marginally 
stable direction is also perpendicular to the Poincare section. Furthermore, let the 
unstable direction be parallel to the x-axis and the stable direction be parallel to the 
y-axis. The Newton setup is now 



( 1-A„ 0 0 0 

0 1-A V 0 0 

0 0 0 v z 

0 0 10 



\ 


r s * 1 




( ~ F * ) 




6y 




-Fy 








~F Z 


/ 


v 8t ) 




{ 0 J 



(13.15) 



If one considers only the upper-left [3x3] matrix (which we started out with, prior to 
adding the constraint (13.13)) then this matrix is not invertible and the equation does 
not have a unique solution. However, the full [4x4] matrix is invertible, as det(-) = 
-v z det(l - Mj_), where M ± is the [2x2] monodromy matrix for a surface of section 
transverse to the orbit (see sect. 5.5). (F. Christiansen) 



13.4.1 Cost function 



It pays to think in terms of a cost (or error) function I(Ax) = (x+Ax-f(x+Ax)f /2. 
Periodic orbit condition (13.1) corresponds both to a zero of I (Ax), and of its first 

~ 2 

Ax variation. Expand I (Ax) to the second order in Ax, / « Ax 1 2 + (x - f(x)) ■ 
Ax + (x - f(x)) 2 /2, where Ax = (1 - J(x))Ax. To find an extremum, we set the 
derivative with respect to Ax to zero. As the term (x - f(x)) 2 /2 is a constant under 
Ax variation, let us define an unconstrained cost function 

I 0 (Ax) - jAx ■ Ax + (x- f(x)) ■ Ax , (13.16) 

Setting the derivative of this function 
dlo(Ax) 

\ =Ax + x- f(x) = (1 - /(*)) • Ax + x - f(x) (13.17) 
dAx 

to zero recovers the Newton setup (13.4) 

Next, we need to enforce the constraint that curbs the directions in which Ax 
can point. Lagrange multipliers come to help. 
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A local surface of section can be constructed when fix) is "near" the initial 
point x. A natural choice is a hyperplane perpendicular to the velocity vector vix). 
The reference point ;toin(13.13)is;c itself, and the surface of section condition 
is U(x + Ax) = v(x) ■ Ax = 0. Introduce a Lagrange multiplier A, and assemble a 
cost function with the constraint: 



h {Ax, X) = -Ax- Ax + [x- f{x)] ■ Ax + Av{x) -Ax. (13.18) 



Now we differentiate I\ {Ax, A) with respect to each argument and set the deriva- 
tives to zero. We recover the Newton setup (13.1 1), with the Lagrange multiplier 
A - At interpreted as the time increment needed to place f(x) onto the section, 
f{x) -> f{x) + v(f(x))At. 

A global surface of section is a fixed surface Uix + Ax) - U(xty) « dUix)Ax + 
U(x) - U (jco) that hopefully transects all or a significant portion of recurrent parts 
of the flow. It is not as 'natural' as the local section (13.10), but hard to avoid in 
practice, and one is interested not only in the fixed point itself, but in the global 
reach of its unstable manifold as well. The simplest choice is a hyperplane (13. 13). 
The cost function and the variational equations are then 



I 2 iAx,At) - -Ax[\-Jix)]Ax + (x-fix))Ax 

+ At(dU(x)Ax + Uix) - U(xq)) , (13.19) 



l-Jix) dU(x) \ Ax \__ x-fix) 

dUix) 0 II At I" 1 U(x)-U(x 0 ) ) (inv) 



Further continuous symmetries can be handled in the same fashion. Suppose, 
for example, that we are searching for periodic orbits of a Hamiltonian flow. 
There, periodic orbits not only have the time-translation symmetry, but energy- 
translation symmetry as well. What is energy-translation symmetry? If there ex- 
ists a periodic orbit at x with energy Hix) = E, and period T, it is very likely that it 
belongs to a family of orbits (x+eAx(E), T+eAtiE)) continuous under variation of 
E. As with the time-translation symmetry, this implies a unit Floquet multiplier: 
indeed, we know from sect. 7.4 that symplectic eigenvalues come in pairs, so unit 
multiplier in the time direction implies a unit multiplier in its dual, the energy 
direction, (A t , A#, • • •) = (1, 1, • • •)• But extending the number of constraints is no 
longer a problem: add more Lagrange multipliers. Consider the following system 

hiAx, X\ , A 2 ) = Ax[l - J(x)]Ax/2 + (x - /(*)) Ax 

+ Ai iUix + Ax) - U(xq)) + A 2 (H(x + Ax) - E 0 ) (13.21) 



( 1 - J(x) 
dUix) 
dH{x) 



example 13.5 



dU{x) 


dH(xY 




Ax 1 




( x-fix) ) 




0 


0 








Uix) - U(xq) 


(13.22) 


0 


0 , 


V 


*2 J 




v Hix) - E 0 J 
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This is the Newton iteration setup for how to search for a periodic orbit of a Hamil- 
tonian flow with a global surface of section U{x) = U(xq) and fixed energy Eq. 
Note that these instructions do not put every iteration on a surface U(x) = U(xj) 
and energy H(x) = Eq, unless the surface is a plane U(x) = a ■ (x - xq), but 
instead assure that the iterations (provided they converge) will approach super- 
exponentially to the surfaces. 

For periodic orbits multi-point shooting generalizes in the same way as (13.7), 
but with n additional equations - one for each point on a Poincare section. The 
Newton setup looks like this: 



t 1 
-h 1 



~Jn 



-Jn-1 1 



0 



\ 


' Axi ' 




r -Fi \ 




AX2 




-F 2 




Ax n 




-F„ 




Afi 




0 


/ 






, 0 , 



Solving this equation resembles the corresponding task for maps. However, we 
will need to invert a [(d + \)n x (d + \)n\ matrix rather than a [d x d] matrix. 



Resume 



There is no general computational algorithm that is guaranteed to find all solutions 
(up to a given period r ma x) to the periodic orbit condition 

f t+T (x) = f(x), T>0 

for a general flow or mapping. Due to the exponential divergence of nearby trajec- 
tories in chaotic dynamical systems, direct solution of the periodic orbit condition 
can be numerically very unstable. 

A prerequisite for a systematic and complete cycle search is a good (but hard 
to come by) understanding of the topology of the flow. Usually one starts by - 
possibly analytic - determination of the equilibria of the flow. Their locations, sta- 
bilities, stability eigenvectors and invariant manifolds offer skeletal information 
about the topology of the flow. The next step is numerical long-time evolution 
of "typical" trajectories of the dynamical system under investigation. Such nu- 
merical experiments build up the "natural measure" and reveal which regions are 
most frequently visited. Periodic orbit searches can then be initialized by taking 
nearly recurring orbit segments and deforming them into closed orbits. With a 
sufficiently good initial guess, the Newton-Raphson formula 
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yields improved estimate x 1 = x + 6x,T' = T + ST. Iteration then yields the period 
T and the location of a periodic point x p in the Poincare section (x p - xq) • a - 0, 
where a is a vector normal to the Poincare section at x$. 

The problem one faces with high-dimensional flows is that their topology is 
hard to visualize, and that even with a decent starting guess for a point on a peri- 
odic orbit, methods like the Newton-Raphson method are likely to fail. Methods 
that start with initial guesses for a number of points along the cycle, such as the 
multipoint shooting method of sect. 13.3, are more robust. Relaxation (or vari- 
ational) methods take this strategy to its logical extreme, and start by a guess of 
not a few points along a periodic orbit, but a guess of the entire orbit. Just as 
these methods are intimately related to variational principles and path integrals, 
we postpone their introduction until chapter 29. 



Commentary 

Remark 13.1 Close recurrence searches. For low-dimensional maps of flows (for 
high-dimensional flows, forget about it) picking initial guesses for periodic orbits from 
close recurrences of a long ergodic trajectory seems like an obvious idea. Nevertheless, 
ref. [1] is frequently cited. Such methods have been deployed by many, among them 
G. Tanner, L. Rondoni, G. Morris, CP. Dettmann, and R.L. Davidchack [ 2, 15, 16, 11,12] 
(see also sect. 20.6). Sometimes one can determine most of the admissible itineraries and 
their weights without working too hard, but method comes with no guarantee. 

Remark 13.2 Piecewise linear maps. The Lozi map (3.19) is linear, and hundred of 
thousands of cycles can easily be computed by [2x2] matrix multiplication and inversion. 

Remark 13.3 Cycles, searches, and symmetries. A few comments about the role 
of symmetries in actual extraction of cycles. In the ,/V-disk billiard example, a fundamen- 
tal domain is a sliver of the ,/V-disk configuration space delineated by a pair of adjoining 
symmetry axes. The flow may further be reduced to a return map on a Poincare surface 
of section. While in principle any Poincare surface of section will do, a natural choice in 
the present context are crossings of symmetry axes, see example 7.7. In actual numerical 
integrations only the last crossing of a symmetry line needs to be determined. The cycle is 
run in global coordinates and the group elements associated with the crossings of symme- 
try lines are recorded; integration is terminated when the orbit closes in the fundamental 
domain. Periodic orbits with non-trivial symmetry subgroups are particularly easy to find 
since their points lie on crossings of symmetry lines, see example 7.7. 

Remark 13.4 Newton gone wild. Skowronek and Gora [24] offer an interesting 
discussion of Newton iterations gone wild while searching for roots of polynomials as 
simple as x 2 + 1 =0. 

Remark 13.5 Symmetries of the symbol square. For a discussion of symmetry 
lines see refs. [5, 6, 6, 7, 8]. It is an open question (see remark 21.2) as to how time 
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reversal symmetry can be exploited for reduction of cycle expansions of chapter 20. For 
example, the fundamental domain symbolic dynamics for reflection symmetric systems 
is discussed in some detail in sect. 21.5, but how does one recode from time-reversal 
symmetric symbol sequences to desymmetrized 1/2 state space symbols? 
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Exercises 

13.1. Cycles of the Ulam map. Test your cycle-searching 
routines by computing a bunch of short cycles and their 
stabilities for the Ulam map f(x) = 4x( 1 — x) . 

13.2. Cycles stabilities for the Ulam map (exact). In ex- 
ercise 13.1 you should have observed that the numerical 
results for the cycle Floquet multipliers (4.42) are ex- 
ceptionally simple: the Floquet multiplier of the xq — 0 
fixed point is 4, while the eigenvalue of any other n- 
cycle is +2". Prove this. (Hint: the Ulam map can be 
conjugated to the tent map (11 .4). This problem is per- 
haps too hard, but give it a try - the answer is in many 
introductory books on nonlinear dynamics.) 

13.3. Stability of billiard cycles. Compute the stabilities of 
few simple cycles: 

(a) A simple scattering billiard is the two-disk bil- 
liard. It consists of a disk of radius one centered 
at the origin and another disk of unit radius lo- 
cated at distance L + 2. Find all periodic orbits 
for this system and compute their stabilities. (You 
might have done this already in exercise 1.2; at 
least now you will be able to see where you went 
wrong when you knew nothing about cycles and 
their extraction.) 

(b) Find all periodic orbits and their stabilities for a 
billiard ball bouncing between the diagonal y — x 
and one of the hyperbola branches y = — 1 jx. 




13.4. Cycle stability. Add to the pinball simulator of ex- 
ercise 8.1a routine that evaluates the expanding eigen- 
value for a given cycle. 

13.5. Pinball cycles. Determine the stability and length of 
all fundamental domain prime cycles of the binary sym- 
bol string lengths up to 5 (or longer) foiR : a — 6 3-disk 
pinball. 

13.6. Newton-Raphson method. Implement the Newton- 
Raphson method in 2-dimensional and apply it to the 
determination of pinball cycles. 



13.7. Fundamental domain fixed points. Use the for- 
mula (8.11) for billiard Jacobian matrix to compute the 
periods T p and the expanding eigenvalues A /; of the fun- 
damental domain 0 (the 2-cycle of the complete 3-disk 
space) and 1 (the 3-cycle of the complete 3-disk space) 
fixed points: 





T p A p 


0: 


R-2 R-l +/?vr 


-2/R 


I: 




- y/3/R 



(13.23) 



We have set the disk radius to a — 1 . 

13.8. Fundamental domain 2-cycle. Verify that for the 
10-cycle the cycle length and the trace of the Jacobian 
matrix are given by 

L w = 2 a//? 2 - V3fl +1-2, 
trJio = Aio + 1/Au, (13.24) 

1 L w (L lQ + 2) 2 

= ZL W + 1+ - — . 

2 V3A/2 - 1 

The 10-cycle is drawn in figure 12.12. The unstable 
eigenvalue Aio follows from (7.30). 

13.9. A test of your pinball simulator: 10-cycle. Test 
your exercise 8.3 pinball simulator stability evaluation 
by checking numerically the exact analytic 10-cycle sta- 
bility formula (13.24). 

13.10. Rossler flow cycles. (continuation of exer- 

cise 4.4) Determine all cycles for the Rossler flow 
(2.18), as well as their stabilities, up to 

(a) 3 Poincare section returns 

(b) (optional) 5 Poincare section returns (Hint: imple- 
ment (13.14), the multipoint shooting methods for 
flows; you can cross-check your shortest cycles 
against the ones listed in the table.) 

Table: The Rossler flow (2.18): The itinerary p, a peri- 
odic point x p — (0, y p , z p ) and the expanding eigenvalue 
A p for all cycles up to topological length 7. 
( J. Mathiesen, G. Simon, A. Basu) 
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2 
3 



4 
5 



P 

1 

01 
001 

on 

0111 

01011 

01111 

001011 

010111 

011111 

0101011 

0110111 

0101111 

0111111 



6.09 rfm 

3.915804 
2.278281 
2.932877 
3.466759 
4.162799 
3.278914 
2.122094 
4.059211 
3.361494 
3.842769 
3.025957 
4.102256 
3.327986 



1. 2997*32 
3.692833 
7.416481 
5.670806 
4.506218 
3.303903 
4.890452 
7.886173 
3.462266 
4.718206 
3.815494 
5.451444 
3.395644 
4.787463 



A. 
-2.403953 
-3.512007 
-2.341923 

5.344908 
-16.69674 
-23.19958 

36.88633 
-6.857665 

61.64909 
-92.08255 

77.76110 
-95.18388 
-142.2380 

218.0284 



13.11. Cycle stability, helium. Add to the helium integrator 
of exercise 2.10 a routine that evaluates the expanding 
eigenvalue for a given cycle. 

13.12. Colinear helium cycles. Determine the stability 
and length of all fundamental domain prime cycles up 
to symbol sequence length 5 or longer for collinear he- 
lium of figure 7.2. 

13.13. Uniqueness of unstable cycles***. Prove that there 
exists only one 3 -disk prime cycle for a given finite ad- 
missible prime cycle symbol string. Hints: look at the 
Poincare maps; can you show that there is exponen- 
tial contraction to a unique periodic point with a given 
itinerary? Exercise 29.1 might be helpful in this effort. 

13.14. Inverse iteration method for a Henon repeller. 
Table: All periodic orbits up to 6 bounces for the Hamil- 
tonian Henon mapping ( 13.25) with a — 6. Listed are 
the cycle itinerary, its expanding eigenvalue A p , and its 
"center of mass." The "center of mass" is listed be- 
cause it turns out that it is often a simple rational or a 
quadratic irrational. 



p 


A* 


2 x pj 


0 


0.715168x10' 


-0.607625 


1 


-0.295285x10' 


0.274292 


10 


-0.989898x10' 


0.333333 


100 


-0.131907xl0 3 


-0.206011 


110 


0.558970xl0 2 


0.539345 


1000 


-0.104430xl0 4 


-0.816497 


1100 


0.577998xl0 4 


0.000000 


1110 


-0.103688xl0 3 


0.816497 


10000 


-0.760653xl0 4 


-1.426032 


11000 


0.444552xl0 4 


-0.606654 


10100 


0.770202xl0 3 


0.151375 


11100 


-0.710688X10 3 


0.248463 


11010 


-0.589499xl0 3 


0.870695 


11110 


0.390994xl0 3 


1.095485 


100000 


-0.545745xl0 5 


-2.034134 


110000 


0.322221xl0 5 


-1.215250 


101000 


0.513762xl0 4 


-0.450662 


111000 


-0.478461xl0 4 


-0.366025 


110100 


-0.639400X10 4 


0.333333 


101100 


-0.639400xl0 4 


0.333333 


111100 


0.390194xl0 4 


0.548583 


111010 


0.109491xl0 4 


1.151463 


111110 


-0.104338xl0 4 


1.366025 



Consider the Henon map (3.17) for the area-preserving 
("Hamiltonian") parameter value b = -1. The coordi- 
nates of a periodic orbit of length n p satisfy the equation 



Xpj+i + x p ,i-i = 1 - ax 2 pi , i=l, n p , (13.25) 

with the periodic boundary condition x p> o = Xpji ■ Ver- 
ify that the itineraries and the stabilities of the short pe- 
riodic orbits for the Henon repeller ( 13.25) at a — 6 are 
as listed above. 

Hint: you can use any cycle-searching routine you wish, 
but for the complete repeller case (all binary sequences 
are realized), the cycles can be evaluated simply by in- 
verse iteration, using the inverse of (13.25) 



X p,i ~ $ 



1 



"V . 1 



X ■ 1 



i = l,...,n p . 



Here S p j are the signs of the corresponding periodic 
point coordinates, S p j = x p j/\x p j\. (G. Vattay) 



13. 15. Ulam map periodic points. 

cise 11.8) 



(continued from exer- 



(a) compute the five periodic points of cycle 10011 
for the Ulam map ( 1 1 .5) f(x) = 4x(l - x) . using 
your Newton or other routine. 



(b) compute the five periodic points of cycle 10000 

(c) plot the above two cycles on the graph of the Ulam 
map, verify that their topological ordering is as in 
the 'canonical' full tent map exercise 11.8. 

(d) (optional) This works only for the Ulam map: 
compute periodic points by conjugating the full 
tent map periodic points of exercise 11.8 using ex- 
ercise B.4. 

13.16. Newton setups for flows. 

(a) We have formulated three Newton setups for 
flows: the 'local' setup (13.11), the 'hyperplane' 
setup (13.14), and the 'global' setup ( 13.20). De- 
rive (13.20) and verify that if the surface of sec- 
tion is a hyperplane, it reduces to (13.14). (Hint: 
it is not inconceivable that (13.14) is wrong as it 
stands.) 

(b) (optional) Derive (13.22), the Newton setup for 
Hamiltonian flows. 

13.17. "Center of mass" puzzle**. Why is the "center of 
mass," tabulated in exercise 13.14, often a rational num- 
ber? 
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Qunadry: all these cycles, but what to do with them? What you have now is a 
topologically invariant road map of the state space, with the chaotic region pinned 
down by a rigid skeleton, a tree of cycles (periodic orbits) of increasing lengths 
and self-similar structure. In chapter 15 we shall turn this topological dynamics into a 
multiplicative operation on the state space partitions by means of transition matrices of 
chapter 14, the simplest examples of evolution operators. This will enable us to count the 
distinct orbits, and in the process touch upon all the main themes of this book, going the 
whole distance from diagnosing chaotic dynamics to computing zeta functions. 



1. Partition the state space and describe all allowed ways of getting from 'here' to 
'there' by means of transition graphs (transition matrices). These generate the total- 
ity of admissible itineraries. (Chapter 14) 

2. Learn to count (Chapter 15) 

3. Learn how to measure what's important (Chapter 16) 

4. Learn how to evolve the measure, compute averages (Chapter 17) 

5. Learn what a 'Fourier transform' is for a nonlinear world, not a circle (Chapter 18), 

6. and how the short-time / long-time duality is encoded by spectral determinant ex- 
pression for evolution operator spectrum in terms of periodic orbits. (Chapter 19) 

7. Learn how use short period cycles to describe chaotic world at times much beyond 
the Lyapunov time (Chapter 20). 



Next ponder how symmetries simplify spectral determinants (chapter 21), develop some 
feeling for the traces of evolution operators (chapter 22), why all this works (chapter 23), 
when does it not work (chapter 24), what does it have to do with foundations of statistical 
mechanics (chapter 25) and turbulence (chapter 26). 
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Chapter 14 



Walkabout: Transition graphs 



I think I'll go on a walkabout 

find out what it's all about [...] take a ride to the other side 
— Red Hot Chili Peppers, 'Walkabout' 



In chapters 1 1 and 12 we learned that invariant manifolds partition the state 
space in invariant way, and how to name distinct orbits. We have established 
and related the temporally and spatially ordered topological dynamics for a 
class of 'stretch & fold' dynamical systems, and discussed pruning of inadmissi- 
ble trajectories. 

Here we shall use these results to generate the totality of admissible itineraries. 
This task will be particularly easy for repellers with complete Smale horseshoes 
and for subshifts of finite type, for which the admissible itineraries are generated 
by finite transition matrices, and the topological dynamics can be visualized by 
means of finite transition graphs. We shall then turn topological dynamics into a 
linear multiplicative operation on the state space partitions by means of transition 
matrices, the simplest examples of 'evolution operators.' They will enable us - in 
chapter 15 - to count the distinct orbits. 



14.1 Matrix representations of topological dynamics 



The allowed transitions between the regions of a partition {At, M2, ■•• , M m ] are 
encoded in the [mxm] -dimensional transition matrix whose elements take values 

I 1 if the transition At/ — > At; is possible (14 1) 

,; ~~ I 0 otherwise . 

The transition matrix is an explicit linear representation of topological dynam- 
ics. If the partition is a dynamically invariant partition constructed from sta- 
ble/unstable manifolds, it encodes the topological dynamics as an invariant law 
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Figure 14.1: Points from the region M21 reach re- 
gions {Alio, -M12}, and no other regions, in one 
time step. Labeling exemplifies the 'shift map' of ex- 
ample 11.7 and (11.20). 




of motion, with the allowed transitions at any instant independent of the trajectory 
history, requiring no memory. 

Several related matrices as well will be needed in what follows. Often it is 
convenient to distinguish between two or more paths connecting the same two 
regions; that is encoded by the adjacency matrix with non-negative integer entries, 



j k if a transition At/ — > At,- is possible in k ways 
l i ~ I 0 otherwise . 



More generally, we shall encounter [mxm] matrices which assign different real or 
complex weights to different transitions, 



1 Ljj € R or C if At ; — > At,- is allowed 
y ' ~~ 10 otherwise . 



As in statistical physics, we shall refer to these as transfer matrices. 

At,- is accessible from At/ in k steps if (L k )jj t 0. A matrix L is called 
reducible if there exists one or more index pairs {i,j} such that 0)ij = 0 for all 
k, otherwise the matrix is irreducible. This means that a trajectory starting in any 
partition region eventually reaches all of the partition regions, i.e., the partition 
is dynamically transitive or indecomposable, as assumed in (2.3). The notion of 
topological transitivity is crucial in ergodic theory: a mapping is transitive if it 
has a dense orbit. If that is not the case, state space decomposes into disconnected 
pieces, each of which can be analyzed separately by a separate irreducible matrix. 
Region At, is said to be transient if no trajectory returns to it. Region At, is said 
to be absorbing if no trajectory leaves it, Ljj + 0, Ljj = 0 for all i ± j. Hence it 
suffices to restrict our considerations to irreducible matrices. 

If L has strictly positive entries, Ljj > 0, the matrix is called positive; if Ljj > 
0, the matrix is called non-negative. Matrix L is said to be eventually positive or 
Perron-Frobenius if L k is positive for some power k (as a consequence, the matrix 
is transitive as well). A non-negative matrix whose columns conserve probability, 
2, Ljj - 1, is called Markov, probability or stochastic matrix. 
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Figure 14.2: Topological dynamics: shrink each state 
space partition region figure 14.1 to a node, and indi- 
cate the possibility of reaching a region by a directed 
link. The links stand for transition matrix elements 
Tum = T n .ii = T n ,2i = 1; remaining T im = 0. 




Example 14.1 Markov chain. The Google PageRank of a webpage is computed by 
a Markov chain, with a rather large Markov matrix M. 

A subshift (11.22) of finite type is a topological dynamical system (£, cr), 
where the shift cr acts on the space of all admissible itineraries ($) 



2 - {(sidkeZ ■ T Sk+m = 1 for all k) , s k e {a,b,c,- ■ ■ ,z] ■ (14.4) 



The task of generating the totality of admissible itineraries is particularly easy for 
subshifts of finite type, for which the admissible itineraries are generated by finite 
transition matrices, and the topological dynamics can be visualized by means of 
finite transition graphs. 



14.2 Transition graphs: wander from node to node 

Let us abstract from a state space partition such as figure 14.1 its topological 
essence: indicate a partition region A\, by a node, and indicate the possibility of 
reaching the region A%, Lf, a + 0 by a directed link, as in figure 14.2. Do this for 
all nodes. The result is a transition graph. 

A transition graph (or digraph, or simply 'graph') consists of a set of nodes 
(or vertices, or states), one for each letter in the alphabet = \a,b,c,- ■ ■ ,z}, 
connected by a set of directed links (edges, arcs, arrows). A directed link starts 
out from node j and terminates at node i whenever the matrix element (14.3) 
takes value Ljj + 0. A link connects two nodes, or originates and terminates on 
the same node (a 'self-loop'). For example, if a partition includes regions labeled 
{• • • , Afioi, Alno, • • •}, the transition matrix element connecting the two is drawn 
as Liouio - (S)-"* - ©, whereas L^o = \ ®- Here a dotted link indicates that the 
shift cr(^oii -) = *n- involves symbol 0, and a full one a shift o~(x\\o-) = xio- 
that involves 1 . A j —>■•■—> k walk (path, itinerary) traverses a connected set 
of directed links, starting at node j and ending at node k. A loop {periodic orbit, 
cycle) is a walk that ends at the starting node (which can be any node along the 
loop), for example 




(14.5) 
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Our convention for ordering indices is that the successive steps in a visitation se- 
quence j — > i — > k are generated by matrix multiplication from the left, = 
Z TkiTjj. Two graphs are isomorphic if one can be obtained from the other by 
relabeling links and nodes. As we are interested in recurrent (transitive, indecom- 
posable) dynamics, we restrict our attention to irreducible or strongly connected 
graphs, i.e., graphs for which there is a path from any node to any other node. (In 
a connected graph one may reach node j from node k, but not node k from node 
j.) 

A transition graph compactly describes the ways in which the state space re- 
gions map into each other, accounts for finite memory effects in dynamics, and 
generates the totality of admissible trajectories as the set of all possible walks 
along its links. 

Construction of a good transition graph is, like combinatorics, unexplainable. 
The only way to learn is by some diagrammatic gymnastics, so we work our way 
through a sequence of exercises in lieu of plethora of baffling definitions. 

Example 14.2 Full binary shift. Consider a full shift on two-state partition M = 
{0, 1), with no pruning restrictions. The transition matrix and the corresponding transi- 
tion graph are 




(14.6) 



Dotted links correspond to shifts originating in region 0, and the full ones to shifts origi- 
nating in 1. The admissible itineraries are generated as walks on this transition graph, 
(continued in example 14.8) 



Example 14.3 Complete N-ary dynamics: If all transition matrix entries equal 
unity (one can reach any region from any other region in one step), 



T r = 



(I 1 
1 1 

1 1 



l\ 
1 

1 ) 



(14.7) 



the symbolic dynamics is called complete, or a full shift. The corresponding transition 
graph is obvious, but a bit tedious to draw for arbitrary N. 



Example 14.4 Pruning rules for a 3-disk alphabet: As the disks are convex, there 
can be no two consecutive reflections off the same disk, hence the covering symbolic 
dynamics consists of all sequences which include no symbol repetitions 11, 22, 33. 
This is a finite set of finite length pruning rules, hence, the dynamics is a subshift of 
finite type (see (1 1.23) for definition), with the transition matrix I graph given by 



T = 



(0 
1 
1 



1 l\ 

0 1 

1 0 




(14.8) 
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The complete unrestricted symbolic dynamics is too simple to be illuminating, 
so we turn next to the simplest example of pruned symbolic dynamics, the finite 
subshift obtained by prohibition of repeats of one of the symbols, let us say_ 1 L . 
This situation arises, for example, in studies of the circle maps, where this kind of 
symbolic dynamics describes "golden mean" rotations. 



exercise 15.6 
exercise 15.8 



Example 14.5 'Golden mean' pruning. Consider a subshift on two-state partition 
${ = {0, 1), with the simplest grammar Q possible, a single pruned block b = 
(consecutive repeat of symbol 1 is inadmissible): the state Mo maps both onto Mo and 
M\, but the state Mi maps only onto Mo- The transition matrix and the corresponding 
transition graph are 



T = (\ l) = (X">. (14-9) 



Admissible itineraries correspond to walks on this finite transition graph, (continued in 
example 14.9) 



In the complete rV-ary symbolic dynamics case (see example 14.3) the choice 
of the next symbol requires no memory of the previous ones. However, any further 
refinement of the state space partition requires finite memory. 



Example 14.6 Finite memory transition graphs. For the binary labeled repeller with 
complete binary symbolic dynamics, we might chose to partition the state space into 
four regions {Moo, Mm, M\o, Mu),a 1-step refinement of the initial partition {Mo, Mi}. 
Such partitions are drawn in figure 12.3, as well as figure 1.9. Topological^ f acts 
as a left shift (12.11), and its action on the rectangle [.01] is to move the decimal 
point to the right, to [0.1], forget the past, [A], and land in either of the two rectangles 
{[.10], [.11]}. Filling in the matrix elements for the other three initial states we obtain the 
1-step memory transition matrix/graph acting on the 4-regions partition 



exercise 1 1 .7 



' Too.oo 


0 


^00, 10 


0 > 


?o 1,00 


0 


7*01,10 


0 


0 


7*10,01 


0 


710,11 


I 0 


7*11,01 


0 


7*11,11 ' 




— a (oo) V A (uj ). (14.10) 



T = 



(continued in example 15.7) 

By the same token, for M-step memory the only nonvanishing matrix elements 
are of the form T SlS2 „, SM+l!Sm .„ SM , sm+i 6 {0, 1}. This is a sparse matrix, as the 
only non vanishing entries in the a = s^si • • • $M column of 7^ are in the rows 
b = si . . . smO and b - s\ . . . sm^- If we increase the number of remembered 
steps, the transition matrix grows large quickly, as the N-wy dynamics with M- 
step memory requires an [N M+l x N M+i ] matrix. Since the matrix is very sparse, 
it pays to find a compact representation for T. Such a representation is afforded by 
transition graphs, which are not only compact, but also give us an intuitive picture 
of the topological dynamics. 



exercise 15.1 
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Figure 14.3: Transition graph (graph whose links cor- 
respond to the nonzero elements of a transition matrix 
Ti, a ) describes which regions b can be reached from the 
region a in one time step. The 7 nodes correspond to 
the 7 regions of the partition (14.11). The links repre- 
sent non- vanishing transition matrix elements, such as 
r 101>1 io = ©— <-©. Dotted links correspond to a shift 
by symbol 0, and the full ones by symbol 1 . 




Figure 14.4: The self-similarity of the complete bi- 
nary symbolic dynamics represented by a binary tree: 
trees originating in nodes B, C, ■■■ (actually - any 
node) are the same as the tree originating in node A. 
Level m = 4 partition is labeled by 16 binary strings, 
coded by dotted (0) and full (1) links read down the 
tree, starting from A. See also figure 1 1.14. 




A. J 



Example 14.7 A 7-state transition graph. Consider a state space partitioned into 7 
regions 

{Moo, Mon, Mow, Mno, Mnu Mwu Mwo) ■ (14.11) 

Let the evolution in time map the regions into each other by acting on the labels as 
shift(12.11): Mon -» {Muo,Mm}, Moo -» {A1oo,Aion,Atoio} • • ■ , with nonvanish- 
ing Lno.oii, £011,00. ■ ■ ■, etc.. This is compactly summarized by the transition graph of 
figure 14.3. (continued as example 15.6) 



14.3 Transition graphs: stroll from link to link 

exercise 15.1 

What do finite graphs have to do with infinitely long trajectories? To understand 
the main idea, let us construct a graph that enumerates all possible itineraries for 
the case of complete binary symbolic dynamics. In this construction the nodes 
will be unlabeled, links labeled, signifying different kinds of transitions. 



Example 14.8 Complete binary topological dynamics. Mark a dot '■' on a piece of 
paper. Draw two short lines out of the dot, end each with a dot. The full line will signify 
that the first symbol in an itinerary is '1,' and the dotted line will signifying '0.' Repeat 
the procedure for each of the two new dots, and then for the four dots, and so on. The 
result is the binary tree of figure 14.4. Starting at the top node, the tree enumerates 
exhaustively all distinct finite itineraries of lengths n = 1, 2, 3, ■ ■ ■ 

{0,1} {00,01,10,11) 
{000,001,010,011,100,101,111,110} 

The n = 4 nodes in figure 14.4 correspond to the 16 distinct binary strings of length 
4, and so on. By habit we have drawn the tree as the alternating binary tree of fig- 
ure 11.14, but that has no significance as far as enumeration of itineraries is concerned 
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A 



Figure 14.5: The self-similarity of the _00_ pruned bi- 
nary tree: trees originating from nodes C and E are the 
same as the entire tree. 




o 

p 



- a binary tree with labels in the natural order, as increasing binary 'decimals' would 
serve just as well. 

The trouble with an infinite tree is that it does not fit on a piece of paper. On 
the other hand, we are not doing much - at each node we are turning either left or 
right. Hence all nodes are equivalent. In other words, the tree is self-similar; the trees 
originating in nodes B and C are themselves copies of the entire tree. The result of 
identifying B = A, C = A is a single node, 2-link transition graph with adjacency matrix 
(14.2) 




(14.12) 



An itinerary generated by the binary tree figure 14.4, no matter how long, corresponds 
to a walk on this graph. 



This is the most compact encoding of the complete binary symbolic dynamics. 
Any number of more complicated transition graphs such as the 2-node (14.6) and 
the 4-node (14.10) graphs generate all itineraries as well, and might be sometimes 
preferable. 

We turn next to the simplest example of pruned symbolic dynamics, the finite 
subshift obtained by prohibition of repeats of one of the symbols, let us say_ Od . 



exercise 15.6 
exercise 15.5 



Example 14.9 'Golden mean' pruning, (a link-to-link version of example 14.5) Now 
the admissible itineraries are enumerated by the pruned binary tree of figure 14.5. 
Identification of nodes A-C-E leads to the finite 2-node, 3-links transition graph 




(14.13) 



As 0 is always followed by 1, the walks on this graph generate only the admissible 
itineraries. This is the same graph as the 2-node graph (14.9). (continued in exam- 
ple 15.4) 
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1000^ 



(a) 



Figure 14.6: Conversion of the pruning front of 
figure 12.11 (b) into a finite transition graph, (a) 
Starting with the initial node delineate all 
pruning blocks on the binary tree. A solid line 
stands for "1" and a dashed line for "0". The ends 
of forbidden strings are marked with x. Label 
all internal nodes by reading the bits connecting 
the base of the tree, to the node, (b) Indi- 
cate all admissible starting blocks by arrows, (c) 
Recursively drop the leading bits in the admissi- 
ble blocks; if the truncated string corresponds to 
an internal node in (a), connect them, (d) Delete 
the transient, non-circulating nodes; all admissi- 
ble sequences are generated as walks on this finite 
transition graph, (e) Identify all distinct loops and 
construct the determinant (15.20). 



100, 





<■)) 



14.3.1 Converting pruning blocks into transition graphs 

Suppose now that, by hook or crook, you have been so lucky fishing for pruning 
rules that you now know the grammar (11.23) in terms of a finite set of pruning 
blocks Q = {b\, Z?2, • • • bt), of lengths < m. Our task is to generate all admissible 
itineraries. What to do? 

We have already seen the main ingredients of a general algorithm: (1) tran- 
sition graph encodes self-similarities of the tree of all itineraries, and (2) if we 
have a pruning block of length m, we need to descend m levels before we can start 
identifying the self-similar sub-trees. 



Finite grammar transition graph algorithm. 



1 . Starting with the root of the tree, delineate all branches that correspond to 
all pruning blocks; implement the pruning by removing the last node in each 
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pruning block (marked 'x' in figure 14.6 (a)). 

2. Label all nodes internal to pruning blocks by the itinerary connecting the 
root point to the internal node, figure 14.6 (b). Why? So far we have pruned 
forbidden branches by looking w% steps into future for a given pruning 
block, let's say b = 10110. However, the blocks with a right combina- 
tion of past and future [1.0110], [10.110], [101.10] and [1011.0] are also 
pruned. In other words, any node whose near past coincides with the begin- 
ning of a pruning block is potentially dangerous - a branch further down the 
tree might get pruned. 

3. Add to each internal node all remaining branches allowed by the alphabet, 
and label them, figure 14.6(c). Why? Each one of them is the beginning 
point of an infinite tree, a tree that should be similar to another one origi- 
nating closer to the root of the whole tree. 

4. Pick one of the free external nodes closest to the root of the entire tree, 
forget the most distant symbol in its past. Does the truncated itinerary cor- 
respond to an internal node? If yes, identify the two nodes. If not, forget 
the next symbol in the past, repeat. If no such truncated past corresponds to 
any internal node, identify with the root of the tree. 

This is a little bit abstract, so let's say the free external node in question is 
[1010.]. Three time steps back the past is [010.]. That is not dangerous, as 
no pruning block in this example starts with 0. Now forget the third step in 
the past: [10.] is dangerous, as that is the start of the pruning block [10.1 10]. 
Hence the free external node [1010.] should be identified with the internal 
node [10.]. 

5. Repeat until all free nodes have been tied back into the internal nodes. 

6. Clean up: check whether every node can be reached from every other node. 
Remove the transient nodes, i.e., the nodes to which dynamics never returns. 

7. The result is a transition graph. There is no guarantee that this is the 
smartest, most compact transition graph possible for given pruning (if you 
have a better algorithm, teach us), but walks around it do generate all ad- 
missible itineraries, and nothing else. 



Example 14.10 Heavy pruning. 

We complete this training by examples by implementing the pruning of fig- 
ure 12. 1 1 (b). The pruning blocks are 

[100.10], [10.1], [010.01], [011.01], [11.1], [101.10]. (14.14) 

Blocks 01 101, 101 10 contain the forbidden block 101, so they are redundant as pruning 
rules. Draw the pruning tree as a section of a binary tree with 0 and 1 branches and 
label each internal node by the sequence of 0's and 1 's connecting it to the root of the 
tree (figure 14.6 (a). These nodes are the potentially dangerous nodes - beginnings of 
blocks that might end up pruned. Add the side branches to those nodes (figure 14.6 (b). 
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As we continue down such branches we have to check whether the pruning imposes 
constraints on the sequences so generated: we do this by knocking off the leading bits 
and checking whether the shortened strings coincide with any of the internal pruning 
tree nodes: 00 -> 0; 110 -> 10; Oil -> 11, :0101 -> 101 (pruned); 1000 -> 00 -> 00 -> 0; 
10011 -> 0011 -> Oil -> 1 1; 01000 -> 0. 

The frees originating in identified nodes are identical, so the tree is "self-similar." 
Now connect the side branches to the corresponding nodes, figure 14.6(d). Nodes "." 
and 1 are transient nodes; no sequence returns to them, and as you are interested here 
only in infinitely recurrent sequences, delete them. The result is the finite transition 
graph of figure 14.6 (d); the admissible bi-infinite symbol sequences are generated as 
all possible walks on this graph. 



Resume 

The set of all admissible itineraries is encoded multiplicatively by transition ma- 
trices, diagrammatically by transition graphs. Pruning rules for inadmissible se- 
quences are implemented by constructing corresponding transition matrices and/or 
transition graphs. 



Commentary 

Remark 14.1 Transition graphs. We enjoyed studying Lind and Marcus [ ] intro- 
duction to symbolic dynamics and transition graphs. Finite transition graphs or finite 
automata are discussed in refs. [2, 3, 4]. They belong to the category of regular lan- 
guages. Transition graphs for unimodal maps are discussed in refs. [8, 9, 10]. (see also 
remark 11.1) 

Remark 14.2 Inflating transition graphs. In the above examples the symbolic dy- 
namics has been encoded by labeling links in the transition graph. Alternatively one can 
encode the dynamics by labeling the nodes, as in example 14.6, where the 4 nodes refer 
to 4 Markov partition regions {Moo, Atoi» Alio, Aln), and the 8 links to the 8 non-zero 
entries in the 2-step memory transition matrix ( 14.10). 

Remark 14.3 The unbearable growth of transition graphs. A construction of finite 
Markov partitions is described in refs. [11, 12], as well as in the innumerably many other 
references. 

If two regions in a Markov partition are not disjoint but share a boundary, the bound- 
ary trajectories require special treatment in order to avoid overcounting, see sect. 21.3.1. 
If the image of a trial partition region cuts across only a part of another trial region and 
thus violates the Markov partition condition (11 .2), a further refinement of the partition is 
needed to distinguish distinct trajectories. 
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The finite transition graph construction sketched above is not necessarily the minimal 
one; for example, the transition graph of figure 14.6 does not generate only the "funda- 
mental" cycles (see chapter 20), but shadowed cycles as well, such as fooon m (15.20). 
For methods of reduction to a minimal graph, consult refs. [ 8, 48, 9]. Furthermore, when 
one implements the time reversed dynamics by the same algorithm, one usually gets a 
graph of a very different topology even though both graphs generate the same admissible 
sequences, and have the same determinant. The algorithm described here makes some 
sense for 1 -dimensional dynamics, but is unnatural for 2-dimensional maps whose dy- 
namics it treats as 1 -dimensional. In practice, generic pruning grows longer and longer, 
and more plentiful pruning rules. For generic flows the refinements might never stop, 
and almost always we might have to deal with infinite Markov partitions, such as those 
that will be discussed in sect. 15.5. Not only do the transition graphs get more and more 
unwieldy, they have the unpleasant property that every time we add a new rule, the graph 
has to be constructed from scratch, and it might look very different form the previous 
one, even though it leads to a minute modification of the topological entropy. The most 
determined effort to construct such graphs may be the one of ref. [ 13]. Still, this seems to 
be the best technology available, unless the reader alerts us to something superior. 
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Exercises 

14.1. Time reversibility.** Hamiltonian flows are time re- 
versible. Does that mean that their transition graphs are 
symmetric in all node — > node links, their transition 
matrices are adjacency matrices, symmetric and diago- 
nalizable, and that they have only real eigenvalues? 

14.2. Alphabet {0,1}, prune _1000^ .00100^ DllOO^ 
This example is motivated by the pruning front descrip- 
tion of the symbolic dynamics for the Henon-type map- 
sremark 12.3. 

step 1. _1000_ prunes all cycles with a _000_ subse- 
quence with the exception of the fixed point 0; hence we 
factor out (1 - frj) explicitly, and prune D00_ from the 
rest. This means that xq is an isolated fixed point - no 
cycle stays in its vicinity for more than 2 iterations. In 
the notation of sect. 14.3.1, the alphabet is {1, 2, 3; 0}, 



and the remaining pruning rules have to be rewritten in 
terms of symbols 2=10, 3=100: 

step 2. alphabet {1, 2, 3; 0), prune 33^ .213^313- 
This means that the 3-cycle 3 = 100 is pruned and no 
long cycles stay close enough to it for a single _100_ 
repeat. Prohibition of 33 _ is implemented by drop- 
ping the symbol "3" and extending the alphabet by the 
allowed blocks 13, 23: 

step 3. alphabet {1, 2, 13, 23; 0), prune ^13_„ _23_13_ , 
_13 13 _, where 13 = 13, 23_ = 23 are now used as single 
letters. Pruning of the repetitions _13_13__ (the 4-cycle 
13 = 1 100 is pruned) yields the 

result: alphabet {1, 2, 23, 113; 0), unrestricted 4-ary 
dynamics. The other remaining possible blocks , 213 -, 
_2313 _ are forbidden by the rules of step 3. 
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Chapter 15 



Counting 



I'm gonna close my eyes 
And count to ten 
I'm gonna close my eyes 
And when I open them again 
Everything will make sense to me then 
— Tina Dico, 'Count To Ten' 



We are now in a position to apply the periodic orbit theory to the first and 
the easiest problem in theory of chaotic systems: cycle counting. This 
is the simplest illustration of the raison d'etre of periodic orbit theory; 
we derive a duality transformation that relates local information - in this case the 
next admissible symbol in a symbol sequence - to global averages, in this case 
the mean rate of growth of the number of cycles with increasing cycle period. In 
chapter 14 we have transformed, by means of the transition matrices / graphs, the 
topological dynamics of chapter 1 1 into a multiplicative operation. Here we show 
that the nth power of a transition matrix counts all itineraries of length n. The 
asymptotic growth rate of the number of admissible itineraries is therefore given 
by the leading eigenvalue of the transition matrix; the leading eigenvalue is in turn 
given by the leading zero of the characteristic determinant of the transition matrix, 
which is - in this context - called the topological zeta function. 

For flows with finite transition graphs this determinant is a finite topological 
polynomial which can be read off the graph. However, (a) even something as 
humble as the quadratic map generically requires an infinite partition (sect. 15.5), 
but (b) the finite partition approximants converge exponentially fast. 

The method goes well beyond the problem at hand, and forms the core of the 
entire treatise, making tangible the abstract notion of "spectral determinants" yet 
to come. 
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15.1 How many ways to get there from here? 



In the 3-disk system of example 11.1 the number of admissible trajectories dou- 
bles with every iterate: there are K n = 3-2" distinct itineraries of length n. If disks 
are too close and a subset of trajectories is pruned, this is only an upper bound and 
explicit formulas might be hard to discover, but we still might be able to establish 
a lower exponential bound of the form K,^ > Ce nh . Bounded exponentially by 



3e" > K n > Ce , the number of trajectories must grow exponentially as a 
function of the itinerary length, with rate given by the topological entropy: 



h = lim -lnK n . 

n— >oo n 



(15.1) 



We shall now relate this quantity to the spectrum of the transition matrix, with 
the growth rate of the number of topologically distinct trajectories given by the 
leading eigenvalue of the transition matrix. 



The transition matrix element e {0, 1} in (14. 1) indicates whether the tran- 
sition from the starting partition j into partition i in one step is allowed or not, and 
the (i, j) element of the transition matrix iterated n times 



exercise 15.1 



( T ")ij = XI Tik[ Tk ^--- Tk «- 1 J 

ki,k 2 ,...,k„-l 



(15.2) 



receives a contribution 1 from every admissible sequence of transitions, so (7"),y 
is the number of admissible n symbol itineraries starting with j and ending with i. 



Example 15.1 3-disk itinerary counting. 

the 3-disk transition matrix ( 14.8) 



The(T 2 ) l3 = T 12 T 2 3 = 1 element of T 2 for 



(° 


1 


1 \ 


2 


(2 1 1\ 


1 


0 


1 




1 2 1 


, 1 


1 


0) 




,112, 



(15.3) 



corresponds to path 3 -> 2 -> 1, the only2-step path fronn 3 to l^while (T 2 ) 33 = T 3i T 13 + 
T 3 iT 23 = 2 counts the two returning, periodic paths 31 and 32. Note that the trace 
trT 2 = (r 2 )nj- (7^)22_+ (r 2 ) 33 = 2T X3 T M + 2T 2x T n + 2T 32 T 23 has a contribution from 
each 2-cycle 12, 13, 23 twice, one contribution from each periodic point. 



The total number of admissible itineraries of n symbols is 



K n = Y i cni J = (i,i,...,i)T" 



( i 
i 



(15.4) 
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We can also count the number of prime cycles and pruned periodic points, but 
in order not to break up the flow of the argument, we relegate these pretty results 
to sect. 15.7. Recommended reading if you ever have to compute lots of cycles. 

A finite [N x N] matrix T has eigenvalues {Aq, A\ , • • • , A m -\} and (right) eigen- 
vectors {ifQ, (pi, • • • , ifm-i] satisfying Tip a = A a tp a . Expressing the initial vector in 
(15.4) in this basis (which might be incomplete, with m < N eigenvectors), 



1 

1 / 



m-1 



m— 1 



a=0 



a=0 



and contracting with ( 1 , 1 , . . . , 1 ), we obtain 



m— 1 



a=0 



exercise 15.3 



The constants c a depend on the choice of initial and final partitions: In this ex- 
ample we are sandwiching T" between the vector ( 1, 1 1 ) and its transpose, 

but any other pair of vectors would do, as long as they are not orthogonal to the 
leading eigenvector (p$. In an experiment the vector ( 1, 1, . . . , 1 ) would be re- 
placed by a description of the initial state, and the right vector would describe the 
measurement time n later. 

Perron theorem states that a Perron-Frobenius matrix has a nondegenerate 
(isolated) positive real eigenvalue Ao > 1 (with a positive eigenvector) which 
exceeds the moduli of all other eigenvalues. Therefore as n increases, the sum 
is dominated by the leading eigenvalue of the transition matrix, A) > IRe^alj 
a = 1,2, • • • ,m - 1, and the topological entropy (15.1) is given by 



lim - In co/ln 

n— *oo yi 



l + £i(^ 

Co \A 0 



+ 



= In Aq + lim 

n— »oc 

= In ^0 • 



In co 1 c\ lA\ 
n n co \Aq 



+ 



(15.5) 



What have we learned? The transition matrix T is a one-step, short time operator, 
advancing the trajectory from one partition to the next admissible partition. Its 
eigenvalues describe the rate of growth of the total number of trajectories at the 
asymptotic times. Instead of painstakingly counting K\, K2, K3, . . . and estimating 
(15.1) from a slope of a log-linear plot, we have the exact topological entropy if 
we can compute the leading eigenvalue of the transition matrix T. This is reminis- 
cent of the way free energy is computed from transfer matrices for 1 -dimensional 
lattice models with finite range interactions. Historically, it is this analogy with 
statistical mechanics that led to introduction of evolution operator methods into 
the theory of chaotic systems. 
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15.2 Topological trace formula 

There are two standard ways of computing eigenvalues of a matrix - by evaluating 
the trace tr T" = £ X^, or by evaluating the determinant det(l - zT). We start by 
evaluating the trace of transition matrices. The main lesson will be that the trace 
receives contributions only from itineraries that return to the initial partition, i.e., 
periodic orbits. 

Consider an M-step memory transition matrix, like the 1-step memory exam- 
ple (14.10). The trace of the transition matrix counts the number of partitions that 
map into themselves. More generally, each closed walk through n concatenated 
entries of T contributes to trT" the product (15.2) of the matrix entries along the 
walk. Each step in such a walk shifts the symbolic string by one symbol; the trace 
ensures that the walk closes on a periodic string c. Define to be the local trace, 
the product of matrix elements along a cycle c, each term being multiplied by a 
book keeping variable z. In chapters that follow, the 'local trace' {■ will take a con- 
tinuum of values, so for the remainder of this chapter we stick to the notation 
rather than to the 0 or f values specific to the counting problem. 

The quantity z"tt T" is then the sum of t c for all cycles of period n. The t c 
= (product of matrix elements along cycle c is manifestly cyclically invariant, 
^ioo - tow = tool, so a prime cycle p of period n p contributes n p times, once for 
each periodic point along its orbit. For the purposes of periodic orbit counting, 
the local trace takes values 



i.e., (setting z — 1) the local trace is t p - 1 if the cycle is admissible, and t p - 0 
otherwise. 

Example 15.2 Traces for binary symbolic dynamics. For example, for the [8x8] 
transition matrix T SjS2S3tSoSjS2 version of (14.10), or any refined partition [2"x2 n ] transition 
matrix, n arbitrarily large, the periodic point 100 contributes t m = z 3 r l00010 7 ' 01000i r 001 100 
to z 3 trT 3 . This product is manifestly cyclically invariant, fioo = foio = fooi, so a prime 
cycle p = 001 of period 3 contributes 3 times, once for each periodic point along its 
orbit. 

exercisi 

For the binary labeled non-wandering set the first few traces are given by (con- 
sult tables 15.1 and 15.2) 

ztiT = t 0 + t u 

zhrT 2 = tl + 1\ + 2%, 

z 3 trT 3 = f 3 + f 3 + 3fioo + 3fioi, 

z 4 tr T 4 = t 4 0 + t 4 + 2t 2 w + 4/iooo + 4f K )oi + 4fiou- (15.7) 

In the binary case the trace picks up only two contributions on the diagonal, r 0 ...o,o-o + 
T\...\,i...\, no matter how much memory we assume. We can even take infinite memory 




z n P if p is an admissible cycle 
0 otherwise, 



(15.6) 
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Table 15.1: Prime cycles for the binary symbolic dynamics up to length 9. The numbers 
of prime cycles are given in table 15.3. 



Hp 


P 


n p p 


Hp p 


n p p 


n p p 


1 
1 


A 

U 


7 000 1 00 1 
/ UUU1UU1 


8 00001 1 1 1 
o UUUU1 111 


Q 00000 1101 
y UUUUU 1 1 U 1 


Q 00 1 00 1111 
y UU 1 UU 1111 




1 

1 


00001 1 1 
UUUU 111 


0001 01 1 1 
UUU1U1 1 1 


00001 001 1 
UUUU 1 UUl 1 


00 10101 1 1 
UU1U1U1 1 1 


z 


01 

Ul 


000 1011 
UUU1U1 1 


000 11011 
UUUl 1U11 


0000 10101 
UUUU1U1U1 


00 1011011 
UU 1 U 1 1 U 1 1 


2 


001 
UUl 


000 1101 
UUUl 1U1 


000 11101 
UUU 1 1 1 U 1 


0000 1 1 00 1 
UUUU 1 1 UU 1 


00 1 0 1 1 1 0 1 
UU 1 U 1 1 1 U 1 




01 1 
Ul 1 


00 1 00 1 1 
UUl UUl 1 


001 001 1 1 
UU 1 UU 1 1 1 


0001 0001 1 
UUUl UUUl 1 


00 1 1 00 1 1 1 
UU 1 1 UU 111 


A 
H 


0001 
UUUl 


00 1 0 1 0 1 
UU1U1U1 


00 1 0 1 0 1 1 
UU 1 U 1 U 1 1 


0001 00101 
UUU1UU1U1 


001 1 01 01 1 
UUl 1U1U1 1 




001 1 
UU 1 1 


0001 1 1 1 
UUUl 111 


00 1 0 1 101 
UU 1 U 1 1 U 1 


0001 01 001 
UUU 1 U 1 UU 1 


001 1 01 1 01 
UUl 1U1 1U1 




01 1 1 


00 10111 
UU1U1 1 1 


001 10101 
UUl 1U1U1 


000001 1 1 1 
UUUUU 1111 


00 1 1 10101 
UUl 1 1U1U1 


J 


00001 


00 1 1 0 1 1 
UU 1 1 U 1 1 


000 11111 
UUU 11111 


0000 10111 
UUUU 1 U 1 1 1 


01 01 01 01 1 
U1U1U1U1 1 




0001 1 
UUUl 1 


00 1 1 1 0 1 
UU 1 1 1 U 1 


00 1 0 1 1 1 1 
UU 1 U 1 1 1 1 


00001 1 01 1 
UUUU 1 1 U 1 1 


0001 11111 
UUU 111111 




001 01 
UU1U1 


010101 1 
U1U1U1 1 


001 10111 
UU 1 1 U 1 1 1 


00001 1 101 
UUUUl 1 1U1 


00 1011111 
UU 1 U 1 1 1 1 1 




001 1 1 
UUl 1 1 


001 1111 
UU 11111 


00 111011 
UU 1 1 1 U 1 1 


0001 001 1 1 
UUU1UU1 1 1 


001 101111 
UUl 1U1 1 1 1 




01 01 1 
U1U1 1 


0101111 
U1U1 111 


00 1 1 1 1 0 1 
UU 1 1 1 1 U 1 


0001 01 01 1 
UUU1U1U1 1 


00 1 1 1 0 1 1 1 
UUl 1 1U1 1 1 




0 1 1 1 1 

Ul 1 1 1 


0110111 
Ul 1U1 1 1 


01 01 01 1 1 
U1U1U1 1 1 


0001 01 101 
UUU1U1 1U1 


00 1 1 1 1 0 1 1 
UU 1 1 1 1 U 1 1 


6 


000001 


0111111 


01011011 


000110011 


001111101 




000011 


8 00000001 


00111111 


000110101 


010101111 




000101 


00000011 


01011111 


000111001 


010110111 




000111 


00000101 


01101111 


001001011 


010111011 




001011 


00001001 


01111111 


001001101 


001111111 




001101 


00000111 


9 000000001 


001010011 


010111111 




001111 


00001011 


000000011 


001010101 


011011111 




010111 


00001101 


000000101 


000011111 


011101111 




011111 


00010011 


000001001 


000101111 


011111111 


7 


0000001 


00010101 


000010001 


000110111 






0000011 


00011001 


000000111 


000111011 






0000101 


00100101 


000001011 


000111101 





Table 15.2: The total numbers N„ of periodic points of period n for binary symbolic dy- 
namics. The numbers of contributing prime cycles illustrates the preponderance of long 
prime cycles of period n over the repeats of shorter cycles of periods n p , where n - rn p . 
Further enumerations of binary prime cycles are given in tables 15.1 and 15.3. (L. Ron- 
doni) 



n N„ # of prime cycles of period n 



1234567 8 9 10 
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2 


4 
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2 


2 






4 


16 


2 ] 




3 




5 


32 


2 






6 


6 


64 


2 ] 


2 




9 


7 


128 


2 






18 


8 


256 


2 ] 




3 


30 


9 


512 


2 


2 




56 


10 


1024 


2 ] 
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M — > oo, in which case the contributing partitions are shrunk to the fixed points, trT = 

* 0,0 + *T,T- 

If there are no restrictions on symbols, the symbolic dynamics is complete, and 
all binary sequences are admissible (or allowable) itineraries. As this type of symbolic 
dynamics pops up frequently, we list the shortest binary prime cycles in table 15.1. 

©X6I"CIS6 1 1 -t— 

Hence tr T" - N„ counts the number of admissible periodic points of period 
n. The nth order trace (15.7) picks up contributions from all repeats of prime 
cycles, with each cycle contributing n p periodic points, so N n , the total number of 
periodic points of period n is given by 



z n N n = z"tr T" = J] n/J"" = £ n p £ 6 n ,„ pr f p . (15.8) 

n p \n p r=l 

Here m\n means that m is a divisor of n. An example is the periodic orbit counting 
in table 15.2. 

In order to get rid of the awkward divisibility constraint n = j^r in the above 
sum, we introduce the generating function for numbers of periodic points 



T 

n=\ 



The right hand side is the geometric series sum of N n - tr T". Substituting (15.8) 
into the left hand side, and replacing the right hand side by the eigenvalue sum 
tr T" = 2 A n a , we obtain our first example of a trace formula, the topological trace 
formula 

y _ZA«_ = yn f t JL 



A trace formula relates the spectrum of eigenvalues of an operator - here the tran- 
sition matrix - to the spectrum of periodic orbits of a dynamical system. It is a 
statement of duality between the short-time, local information - in this case the 
next admissible symbol in a symbol sequence - to long-time, global averages, in 
this case the mean rate of growth of the number of cycles with increasing cycle 
period. 

The z n sum in (15.9) is a discrete version of the Laplace transform (see sect. 18.1.2), 
and the resolvent on the left hand side is the antecedent of the more sophisticated 
trace formulas (18.10) and (18.23).We shall now use this result to compute the 
spectral determinant of the transition matrix. 
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15.3 Determinant of a graph 



Our next task is to determine the zeros of the spectral determinant of an [m x m] 
transition matrix 



m— 1 

det(l -zT) = -zA a ) . (15.11) 

We could now proceed to diagonalize T on a computer, and get this over with. It 
pays, however, to dissect det (1 -zT) with some care; understanding this computa- 
tion in detail will be the key to understanding the cycle expansion computations of 
chapter 20 for arbitrary dynamical averages. For T a finite matrix, (15.11) is just 
the characteristic polynomial for T . However, we shall be able to compute this ob- 
ject even when the dimension of T and other such operators becomes infinite, and 
for that reason we prefer to refer to (15. 1 1) loosely as the "spectral determinant." 

There are various definitions of the determinant of a matrix; we will view the 
determinant as a sum over all possible permutation cycles composed of the traces 
tr T k , in the spirit of the determinant-trace relation (1.16): 



det ( 1 - zT) = exp (tr ln(l - zT)) = exp I - J] -tr T" I 

V n=l / 

2 

= \-ztrT- — ((tr T) 2 - tr 7 2 ) - . . . (15.12) 



This is sometimes called a cumulant expansion. Formally, the right hand is a 
Taylor series in z about z = 0. If T is an [ntxm] finite matrix, then the characteristic 
polynomial is at most of order m. In that case the coefficients of i must vanish 
exactly for n > m. 

We now proceed to relate the determinant in (15.12) to the corresponding 
transition graph of chapter 14: toward this end, we start with the usual textbook 
expression for a determinant as the sum of products of all permutations 

detM = J] (-lfM Ul M 2 ,n 2 ■ ■ ■ M m , nm (15.13) 



where M = 1 - zT is a [m X m] matrix, {n\ denotes the set of permutations of m 
symbols, n\ is the permutation n applied to k, and (-If = +1 is the parity of 
permutation n. The right hand side of (15.13) yields a polynomial in T of order m 
in z: a contribution of order n in z picks upm-n unit factors along the diagonal, 
the remaining matrix elements yielding 

(-zT(-l) n T Sinsr --T SnnSn (15.14) 



exercise 4.1 
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where n is the permutation of the subset of n distinct symbols s± • • • s„ indexing T 
matrix elements. As in (15.7), we refer to any combination t c = T Sl Sk T S3S2 ■ ■ • T S2Sl , 
for a given itinerary c - si S2 • • • Sk, as the local trace associated with a closed loop 
c on the transition graph. Each term of the form (15.14) may be factored in terms 
of local traces t Cl t C2 ■ ■ ■ t Ck , i.e., loops on the transition graph. These loops are non- 
intersecting, as each node may only be reached by one link, and they are indeed 
loops, as if a node is reached by a link, it has to be the starting point of another 
single link, as each sj must appear exactly once as a row and column index. 

So the general structure is clear, a little more thinking is only required to get 
the sign of a generic contribution. We consider only the case of loops of length 
1 and 2, and leave to the reader the task of generalizing the result by induction. 
Consider first a term in which only loops of unit length appear in (15.14), i.e., 
only the diagonal elements of T are picked up. We have k — m loops and an even 
permutation n so the sign is given by (-if, where k is the number of loops. Now 
take the case in which we have i single loops and j loops of length n - 2j + i. 
The parity of the permutation gives (-1) 7 and the first factor in (15.14) gives 
(-1)" = (-l) 2j+l . So once again these terms combine to (-if, where k = i + j is 
the number of loops. Let f be the maximal number of non-intersecting loops. We 

exercise 15.4 

may summarize our findings as follows: 

The characteristic polynomial of a transition matrix is given by 
the sum of all possible partitions n of the corresponding transi- 
tion graph into products of k non-intersecting loops, with each loop 
trace t p carrying a minus sign: 

/ 

det(l-zr>=2 J^(-if tpi ---t Pk (15.15) 

k=0 n 

Any self-intersecting loop is shadowed by a product of two loops that share the 
intersection point. As both the long loop and its shadow t a tb in the case at hand 
carry the same weight ^"" + "*, the cancelation is exact, and the loop expansion 
(15.15) is finite. In the case that the local traces count prime cycles (15.6), t p = 0 
or z", we refer to det (1 - zT) as the topological polynomial. 

We refer to the set of all non-self-intersecting loops \t px , t P2 , ■ ■ ■ t Pf \ as the fun- 
damental cycles (for an explicit example, see the loop expansion of examplel5.6). 
This is not a very good definition, as transition graphs are not unique -the most we 
know is that for a given finite-grammar language, there exist transition graph(s) 
with the minimal number of loops. Regardless of how cleverly a transition graph 
is constructed, it is always true that for any finite transition graph the number of 
fundamental cycles / is finite. If the graph has m nodes, no fundamental cycle is 
of period longer than m, as any longer cycle is of necessity self-intersecting. 

The above loop expansion of a determinant in terms of traces is most easily 
grasped by working through a few examples. The complete binary dynamics tran- 
sition graph of figure 14.4 is a little bit too simple, but let us start humbly and 
consider it anyway. 
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Example 15.3 Topological polynomial for complete binary dynamics: (continu- 
ation of example 14.2) There are only two non-intersecting loops, yielding 

det(l-zr) = 1 -to-h -(% -t 0 h) = l-2z (15.16) 

{ (sf^~y^y = i - ( ® - (^y - ( (§c^~~js - c O ( -®) • 

Due to the symmetry under 0 <-> 1 interchange, this is a redundant graph (the 2-cycle 
foi is exactly shadowed by the 1 -cycles). Another way to see is that itineraries are 
labeled by the {0, 1} links, node labels can be omitted. As both nodes have 2 in-links 
and 2 out-links, they can be identified, and a more economical presentation is in terms 
of the [lxl] adjacency matrix ( 14. 12) 

det(l-zA) = l-f 0 -fi = l-2z (15.17) 

oe = l -0>-$ 



The leading (and only) zero of this characteristic polynomial yields the topological en- 
tropy e h = 2. As there are K„ = 2" binary strings of length N, this comes as no 
surprise. 



Similarly, for the complete symbolic dynamics of ,/V symbols the transition graph 
has one node and /V links, yielding 

det(l -zT) = 1 -Nz, (15.18) 

which gives the topological entropy h = In N. 

Example 15.4 Golden mean pruning: The "golden mean" pruning of example 14.5 
has one grammar rule: the substring _1 1 _ is forbidden. The corresponding transition 

0^Q|-Q|g0 15 5 

graph non-intersecting loops are of length 1 and 2, so the topological polynomial is 
given by 

det(l-zr) = 1 -to-kn = 1 -z-z 2 (15.19) 

i' (of Ol) = 1 - {"® - (of 



The leading root of this polynomial is the golden mean, so the entropy (15.5) is the 
logarithm of the golden mean, h = In 



fast track: 

sect. 15.4, p. 311 
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Figure 15.1: (a) The region labels in the nodes 
of transition graph figure 14.3 can be omitted, as 
the links alone keep track of the symbolic dynam- 
ics. (b)-(j) The fundamental cycles (15.23) for the 
transition graph (a), i.e., the set of its non-self- 
intersecting loops. Each loop represents a local 
trace t p , as in (14.5). 




Example 15.5 Nontrivial pruning: The non-self-intersecting loops of the transition 
graph of figure 14.6(d) are indicated in figure 14.6 (e). The determinant can be written 
down by inspection, as the sum of all possible partitions of the graph into products of 
non-intersecting loops, with each loop carrying a minus sign: 



det(l-zr) - 1 - fo - foon - foooi _ *ooon 
+fo?ooii + root lfoooi ■ 



(15.20) 



With t p = z"", where n„ is the period of the p -cycle, the smallest root of 



0 = 1 - z - 2z 4 + z 8 



(15.21) 



yields the topological entropy h = - lnz, z = 0.658779 ...,h = 0.417367 . . ., significantly 
smaller than the entropy of the covering symbolic dynamics, the complete binary shift 
with topological entropy h = In 2 = 0.693 . . . 



exercise 15.9 



Example 15.6 Loop expansion of a transition graph. (continued from exam- 
ple 14. 7) Consider a state space covered by 7 neighborhoods (14.11 ), with the topo- 
logical time evolution given by the transition graph of figure 14.3. 
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The determinant det(l - zT) of the transition graph in figure 14.3 can be read 
off the graph, and expanded as a polynomial in z, with coefficients given by products of 
non-intersecting loops (traces of powers ofT)of the transition graph figure 15. 1 : 

det(l - zT) = 1 - {t 0 + h)z- (foi - toh)z 2 - (t m +t ou - foifo - foifi)z 3 

- (/ooii + f om _ f ooi f i - fon^o _ tonh + foi f o f i)^ 4 

-(fooin - touito — toouh +toutoti)z 5 (15.22) 

- (fooion + foonoi _ f oon f oi - fooifoii)^ 6 

-(fooioin + foonioi _ fooionfi - foonoifi - foomtoi + foonfoifi + fooifoiifi)^ 7 • 

Twelve cycles up to period 7 are fundamental cycles: 

o,T, oT, ooT, oTT, ooTT, oTTT, ooTTT, ooion, oonoi, ooiom, oomoi , (15.23) 

out of the total of 41 prime cycles (listed in table 15.1) up to cycle period 7. The 
topological polynomial t p -> z"" 

l/^op(z) = l-2z-z 7 

is interesting; the shadowing fails first at the cycle length n = 7, so the topological 
entropy is only a bit smaller than the binary h = In 2. Not exactly obvious from the 
partition (14.11). 



15.4 Topological zeta function 



What happens if there is no finite-memory transition matrix, if the transition graph 
is infinite? If we are never sure that looking further into the future will reveal no 
further forbidden blocks? There is still a way to define the determinant, and this 
idea is central to the whole treatise: the determinant is then defined by its cumulant 
expansion (15.12) 

CO 

det(l -zT) = 1 ~Yj £ nZ n ■ (15.24) 

n=\ 



exercise 4.1 



Example 15.7 Complete binary det(l - zT) expansion. (continuation of exam- 
ple 14.6) consider the loop expansion of the binary 1-step memory transition graph 
(14.10) 




= 1 - T ® - 6 



-(<^>-<OC>) 



1 - t Q - h - [fan - fi?o)] - [from - foi*o) + Oon - foi'i)] 
-[(foooi - frjfooi) + (fom _ fonfi) 
+(foon _ f ooifi ~ fofon + fo%*i)] 

1-J//-Z e " = l ~ 2z - (15 ' 25) 
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For finite dimensional matrices the expansion is a finite polynomial, and (15.24) 
is an identity; however, for infinite dimensional operators the cumulant expansion 
coefficients c„ define the determinant. 

Let us now evaluate the determinant in terms of traces for an arbitrary transi- 
tion matrix. In order to obtain an expression for the spectral determinant (15.11) 
in terms of cycles, substitute (15.8) into (15.24) and sum over the repeats of prime 
cycles using ln( 1 - x) = -2r*'7 r ; 



det(l -zT) 



exp 



CO f \ 



p r=l 



-XX f =exp X 11 * 1 -^ 



V p 



~]a-zA a ) = ~[a-t P ), 



(15.26) 



where for the topological entropy the weight assigned to a prime cycle p of period 
n p is t p - z" p if the cycle is admissible, or t p = 0 if it is pruned. This determinant 
is called the topological or the Artin-Mazur zeta function, conventionally denoted 
by 



i/<rto P (z) = n (1 - z " p) = 1_ X 



(15.27) 



Counting cycles amounts to giving each admissible prime cycle p weight tp - z Hp 
and expanding the Euler product (15.27) as a power series in z. The number of 
prime cycles p is infinite, but if T is an [mXm] finite matrix, then the number of 
roots A a is at most m, the characteristic polynomial is at most of order m, and the 
coefficients of z" vanish for n > m. As the precise expression for the coefficients q, 
in terms of local traces t p is more general than the current application to counting, 
we postpone its derivation to chapter 20. 

The topological entropy h can now be determined from the leading zero z = 
e~ h of the topological zeta function. For a finite [m x m] transition matrix, the 
number of terms in the characteristic equation (15.15) is finite, and we refer to 
this expansion as the topological polynomial of order < m. The utility of defining 
the determinant by its cumulant expansion is that it works even when the partition 
is infinite, m — > oo; an example is given in sect. 15.5, and many more later on. 



fast track: 

sect. 15.5, p. 313 



15.4.1 Topological zeta function for flows 



We now apply the method that we shall use in deriving (18.23) to the 
problem of deriving the topological zeta functions for flows. The time-weighted 
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density of prime cycles of period t is 

m = Z Z T P 5{t - rT p ] ■ (15 - 28) 



p r=l 



The Laplace transform smooths the sum over Dirac delta spikes (see (18.22)) 
and yields the topological trace formula 

Xoo °° 
dte~" 6(t - rT p ) = £ T p £ e"*V (15.29) 
-+ p r=l 



and the topological zeta function for flows: 

llbo V {s) = \\(l-e- sT o) , (15.30) 

p 

related to the trace formula by 



CO „ 

p r=\ 



This is the continuous time version of the discrete time topological zeta function 
(15.27) for maps; its leading zero s = -h yields the topological entropy for a flow. 



15.5 Topological zeta function for an infinite partition 

(K.T Hansen and P. Cvitanovic) 



To understand the need for topological zeta function (15.24), we turn a 
dynamical system with (as far as we know - there is no proof) an infinite partition, 
or an infinity of ever-longer pruning rules. Consider the 1 -dimensional quadratic 
map (11.3) 

/(*) = Ax(l -x), A = 3.8 . 

Numerically the kneading sequence (the itinerary of the critical point x — 1/2 

exercise 15.20 

K = 1011011110110111101011110111110... 
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Figure 15.2: The logarithm lnlzj, - zo\ of the dif- 
ference between the leading zero of the n-th polyno- 
mial approximation to topological zeta function and 
our best estimate (15.33), as a function of order of the 
polynomial n (the topological zeta function evaluated 
for the closest value of A to A = 3.8 for which the 
quadratic map has a stable cycle of period n). (from 
K.T. Hansen [22]) 




20 10 60 80 
length 



Figure 15.3: The 90 zeroes of the topological zeta 
function for the quadratic map for A = 3.8 approxi- 
mated by the nearest topological zeta function with a 
stable cycle of length 90. (from K.T Hansen [22]) 




-1.5 -1 -0.5 0 0.5 1 1.5 
ReU) 



where the symbolic dynamics is defined by the partition of figure 11.12. How this 
kneading sequence is converted into a series of pruning rules is a dark art.For the 
moment it suffices to state the result, to give you a feeling for what a "typical" 
infinite partition topological zeta function looks like. For example, approximating 
the dynamics by a transition graph corresponding to a repeller of the period 29 
attractive cycle close to the A = 3.8 strange attractor yields a transition graph with 
29 nodes and the characteristic polynomial 



1/4 



(29) 
top 



l-z 

+ z n 



2 3 
Z +Z 



, 5 _1_ , 6 

z +z 



12 



,13 , 14 

z + z. 



,15 , ,16 

z + z 



,10 



18 19 , 20 
Z + Z + Z 



,21 , ,22 ,23 , 24 , 25 26 , 27 ,28 

-z + z - z + z + z - z + z - z 



(15.31) 



The smallest real root of this approximate topological zeta function is 



z = 0.62616120... 



(15.32) 



Constructing finite transition graphs of increasing length corresponding to A —> 
3.8 we find polynomials with better and better estimates for the topological en- 
tropy. For the closest stable period 90 orbit we obtain our best estimate of the 
topological entropy of the repeller: 



h = - In 0.62616130424685 ... - 0.46814726655867 



(15.33) 



Figure 15.2 illustrates the convergence of the truncation approximations to the 
topological zeta function as a plot of the logarithm of the difference between the 
zero of a polynomial and our best estimate (15.33), plotted as a function of the 
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period of the stable periodic orbit. The error of the estimate (15.32) is expected 
to be of order z 29 » e~ 14 because going from period 28 to a longer truncation 
typically yields combinations of loops with 29 and more nodes giving terms +r 9 
and of higher order in the polynomial. Hence the convergence is exponential, 
with an exponent of -0.47 = -h, the topological entropy itself. In figure 15.3 
we plot the zeroes of the polynomial approximation to the topological zeta func- 
tion obtained by accounting for all forbidden strings of length 90 or less. The 
leading zero giving the topological entropy is the point closest to the origin. Most 
of the other zeroes are close to the unit circle; we conclude that for infinite state 
space partitions the topological zeta function has a unit circle as the radius of 
convergence. The convergence is controlled by the ratio of the leading to the 
next-to-leading eigenvalues, which is in this case indeed A/Aq = \je h = e~ h . 



15.6 Shadowing 

The topological zeta function is a pretty function, but the infinite product (15.26) 
should make you pause. For finite transition matrices the left hand side is a deter- 
minant of a finite matrix, therefore a finite polynomial; so why is the right hand 
side an infinite product over the infinitely many prime periodic orbits of all peri- 
ods? 

The way in which this infinite product rearranges itself into a finite polynomial 
is instructive, and crucial for all that follows. You can already take a peek at the 
full cycle expansion (20.7) of chapter 20; all cycles beyond the fundamental 
and t\ appear in the shadowing combinations such as 

t Si S 2 -S„ ~ tsiS 2 -S m ts m +l-S„ ■ 

For subshifts of finite type such shadowing combinations cancel exactly, if we are 
counting cycles as we do in (15.16) and (15.25), or if the dynamics is piecewise 
linear, as in exercise 19.3. As we argue in sect. 1.5.4, for nice hyperbolic flows 
whose symbolic dynamics is a subshift of finite type, the shadowing combina- 
tions almost cancel, and the spectral determinant is dominated by the fundamental 
cycles from (15.15), with longer cycles contributing only small "curvature" cor- 
rections. 

These exact or nearly exact cancelations depend on the flow being smooth and 
the symbolic dynamics being a subshift of finite type. If the dynamics requires 
an infinite state space partition, with pruning rules for blocks of increasing length, 
most of the shadowing combinations still cancel, but the few corresponding to new 
forbidden blocks do not, leading to a finite radius of convergence for the spectral 
determinant, as depicted in figure 15.3. 

One striking aspect of the pruned cycle expansion (15.31) compared to the 
trace formulas such as (15.9) is that coefficients are not growing exponentially - 
indeed they all remain of order 1, so instead having a radius of convergence e h , in 
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the example at hand the topological zeta function has the unit circle as the radius 
of convergence. In other words, exponentiating the spectral problem from a trace 
formula to a spectral determinant as in (15.24) increases the analyticity domain: 
the pole in the trace (15.10) at z = e~ h is promoted to a smooth zero of the spectral 
determinant with a larger radius of convergence. 

This sensitive dependence of spectral determinants on whether or not the sym- 
bolic dynamics is a subshift of finite type is bad news. If the system is generic and 
not structurally stable (see sect. 12.2), a smooth parameter variation is in no sense 
a smooth variation of topological dynamics - infinities of periodic orbits are cre- 
ated or destroyed, and transition graphs go from being finite to infinite and back. 
That will imply that the global averages that we intend to compute are generi- 
cally nowhere differentiable functions of the system parameters, and averaging 
over families of dynamical systems can be a highly nontrivial enterprise; a simple 
illustration is the parameter dependence of the diffusion constant computed in a 
remark in chapter 25. 

You might well ask: What is wrong with computing the entropy from (15.1)? 
Does all this theory buy us anything? An answer: If we count K„ level by level, we 
ignore the self-similarity of the pruned tree - examine for example figure 14.5, or 
the cycle expansion of (15.35) - and the finite estimates of h n = In K n /n converge 
nonuniformly to h, and on top of that with a slow rate of convergence, \h - fa] * 
0(l/n) as in (15.5). The determinant (15.11) is much smarter, as by construction 
it encodes the self-similarity of the dynamics, and yields the asymptotic value of 
h with no need for any finite n extrapolations. 

fast track: 
sect. 1 6, p. 327 



15.7 Counting cycles 

Chaos is merely order waiting to be deciphered 
— Jose Saramago, The Double 

i 

In what follows, we shall occasionally need to compute all cycles up to 
topological period n, so it is important to know their exact number. The formulas 
are fun to derive, but a bit technical for plumber on the street, and probably best 
skipped on the first reading. 
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15.7.1 Counting periodic points 

The number of periodic points of period n is denoted N n . It can be computed from 
(15.24) and (15.9) as a logarithmic derivative of the topological zeta function 

Yj n ^ = tt r z J; ln(1 ~ zT) ) = ~ z Y z lndet (1 ~ zT) 



(15.34) 



<>top 



Observe that the trace formula (15.10) diverges at z — > e h , because the denomi- 
nator has a simple zero there. 



Example 15.8 Complete N-ary dynamics: To check formula (15.34) for the finite- 
grammar situation, consider the complete N-ary dynamics (14.7) for which the number 
of periodic points of period n is simply tiT" = N". Substituting 

(zN)" 



z ^vr = _ ln(1 _ zA0) 



into (15.24) we verify (15.18). The logarithmic derivative formula (15.34) in this case 
does not buy us much either, it simply recovers 

2->" nZ l-Nz' 

n=l 



Example 15.9 Nontrivial pruned dynamics: Consider the pruning of figure 14.6(e). 
Substituting (15.34) we obtain 



)N n e = - — r. (15.35) 

^ 1 - z - 2z 4 + z 8 



The topological zeta function is not merely a tool for extracting the asymptotic growth 
ofN n ; it actually yields the exact numbers of periodic points. In case at hand it yields 
a nontrivial recursive formula Ni = N2 = = 1, N„ = 2n + 1 for n = 4,5,6,7,8, and 
N„ = N„-\ + 2N„-4 - N n -s forn > 8. 



15.7.2 Counting prime cycles 

Having calculated the number of periodic points, our next objective is to evaluate 
the number of prime cycles M n for a dynamical system whose symbolic dynamics 
is built from ,/V symbols. The problem of finding M n is classical in combinatorics 
(counting necklaces made out of n beads of ,/V different kinds) and is easily solved. 
There are ^V" possible distinct strings of length n composed of N letters. These 
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Table 15.3: Number of prime cycles for various alphabets and grammars up to period 
10. The first column gives the cycle period, the second gives the formula ( 15.37) for the 
number of prime cycles for complete A^-symbol dynamics, and columns three through five 
give the numbers of prime cycles for N -2,3 and 4. 



n 


M„(N) 


M„(2) 


M„(3) 


M„(4) 


1 


N 


2 


3 


4 


2 


N(N - l)/2 


1 


3 


6 


3 


N(N 2 - l)/3 


2 


8 


20 


4 


N 2 (N 2 - l)/4 


3 


18 


60 


5 


(N 5 - AO/5 


6 


48 


204 


6 


(N b - N 3 - N 2 + AO/6 


9 


116 


670 


7 


(N 1 - AO/7 


18 


312 


2340 


8 


N\N 4 - l)/8 


30 


810 


8160 


9 


N\N 6 - l)/9 


56 


2184 


29120 


10 


(/V 10 -^ 5 -N 2 +N)/ 10 


99 


5880 


104754 



N n strings include all Mj prime <i-cycles whose period d equals or divides n. A 
prime cycle is a non-repeating symbol string: for example, p — 011 = 101 = 
TlO = ... 01 101 1 ... is prime, but 0101 = 010101 ... = Ol is not. A prime d- 
cycle contributes d strings to the sum of all possible strings, one for each cyclic 
permutation. The total number of possible periodic symbol sequences of period n 
is therefore related to the number of prime cycles by 

Ar„ = 2]</M rf , (15.36) 

d\n 

where N n equals tr T". The number of prime cycles can be computed recursively 



M„ = - 



d<n \ 

N„ - ^ dM d 

d\n 



or by the Mobius inversion formula 

M n = n 1 Yj^ Nd - (1537) 

d\n 

where the Mobius function //(l) = 1, /u(n) = 0 if n has a squared factor, and 
HiPiPi ■ ■ -Pk) = ( _ 1)^ if all prime factors are different. 

We list the number of prime cycles up to period 10 for 2-, 3- and 4-letter 
complete symbolic dynamics in table 15.3, obtained by Mobius inversion (15.37). 



exercise 15.10 



exercise 15.1 1 



Example 15.10 Counting N-disk periodic points: A simple example of 

pruning is the exclusion of "self-bounces" in the N-disk game of pinball. The number of 
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points that are mapped back onto themselves after n iterations is given by N„ = trT". 
The pruning of self-bounces eliminates the diagonal entries, T N - disk = T c - 1, so the 
number of the N-disk periodic points is 

N„ = *T n N _ disk =(N- 1)" + (-l)"(N - 1) . (15.38) 

Here T c is the complete symbolic dynamics transition matrix (14.7). For the N-disk 
pruned case (15.38), Mobius inversion (15.37) yields 

d\n d\n 

= M ( ^- V) for n > 2 . (15.39) 

There are no fixed points, so M?~ d,sk = 0. The number of periodic points of period 2 
is N 2 - N, hence there are M% sk = N(N - l)/2 prime cycles of period 2; for periods 
n > 2, the number of prime cycles is the same as for the complete (N - 1 )-ary dynamics 
of table 15.3. 



Example 15.11 Pruning individual cycles: Consider the 3-disk game 

of pinball. The prohibition of repeating a symbol affects counting only for the fixed 
points and the 2-cycles. Everything else is the same as counting for a complete binary 
dynamics (15.39). To obtain the topological zeta function, just divide out the binary 1- 
and 2-cycles (1 - zfo)(l - zh)(l - z 2 foi) and multiply with the correct 3-disk 2-cycles 

(1-Z 2 fl2)(l-Z 2 fl3)(l-Z 2 f23)-' 



(1 -z ) 

Vb-disk = (l-2z) 



exercise 15.14 
exercise 15.15 



(l-z) 2 (l-z 2 ) 

= (1 - 2z)(l + z) 2 = 1 - 3r - 2z 3 . (15.40) 

The factorization reflects the underlying 3-disk symmetry; we shall rederive it in (21.25). 
As we shall see in chapter 21, symmetries lead to factorizations of topological polyno- 
mials and topological zeta functions. 



Example 15.12 Alphabet {a, cb k ; b}: (continuation of exercise 15.16) In the cycle 
counting case, the dynamics in terms of a — > z, cb k — > z + z 2 + z 3 + ■ ■ ■ = z/(l - z) is a 
complete binary dynamics with the explicit fixed point factor (1 - tb) = (1 - z) 



exercise 15.19 



l/?top = (l-z)(l-z - 737) = l-3z + z 2 



Resume 



The main result of this chapter is the cycle expansion (15.27) of the topological 
zeta function (i.e., the spectral determinant of the transition matrix): 

l/<rtop(z) - l-J] 2 ***- 
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Table 15.4: List of 3-disk prime cycles up to period 10. Here n is the cycle period, M„ is 
the number of prime cycles, N„ is the number of periodic points, and S „ the number of 
distinct prime cycles under D3 symmetry (see chapter 21 for further details). Column 3 
also indicates the splitting of N„ into contributions from orbits of periods that divide n. 
The prefactors in the fifth column indicate the degeneracy m p of the cycle; for example, 
3-12 stands for the three prime cycles 12, 13 and 23 related by 27r/3 rotations. Among 
symmetry-related cycles, a representative p which is lexically lowest is listed. The cycles 
of period 9 grouped with parentheses are related by time reversal symmetry, but not by 
any D3 transformation. 



n 


M„ 


N„ 


s„ 


trip ■ p 


1 


0 


0 


0 




2 


3 


6=3-2 


1 


3-12 


3 


2 


6=2-3 


1 


2-123 


4 


3 


18=3-2+3-4 


1 


3-1213 


5 


6 


30=6-5 


1 


6-12123 


6 


9 


66=3-2+2-3+9-6 


2 


6-121213 + 3-121323 


7 


18 


126=18-7 


3 


6-1212123 + 6-1212313 + 6-1213123 


8 


30 


258=3-2+3-4+30-8 


6 


6-12121213 + 3-12121313 + 6-12121323 
+ 6 12123123 + 6-12123213 + 3-12132123 


9 


56 


510=2-3+56-9 


10 


6-121212123 + 6(121212313 + 121212323) 
+ 6 (121213123 + 121213213) + 6-121231323 
+ 6(121231213 + 121232123) + 2-121232313 
+ 6-121321323 


10 


99 


1022 


18 





Table 15.5: The 4-disk prime cycles up to period 8. The symbols is the same as shown 
in table 15.4. Orbits related by time reversal symmetry (but no C\ v symmetry) already 
appear at cycle period 5. Cycles of period 7 and 8 have been omitted. 



n 


M n 


N„ 


s„ 


m p ■ p 


1 


0 


0 


0 




2 


6 


12=6-2 


2 


4-12 + 2-13 


3 


8 


24=8-3 


1 


8-123 


4 


18 


84=6-2+18-4 


4 


8-1213 + 4-1214 + 2-1234 + 4-1243 


5 


48 


240=48-5 


6 


8 (12123 +- 12124) + 8-12313 










+ 8-(12134+ 12143) + 8-12413 


6 


116 


732=6-2+8-3+116-6 


17 


8-121213 + 8-121214 + 8-121234 










+ 8-121243 + 8-121313 + 8-121314 










+ 4-121323 + 8-(121324 + 121423) 










+ 4-121343 + 8-121424 + 4-121434 










+ 8-123124 + 8-123134 + 4-123143 










+ 4-124213 + 8-124243 


7 


312 


2184 


39 




8 


810 


6564 


108 
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For subshifts of finite type, the transition matrix is finite, and the topological zeta 
function is a finite polynomial evaluated by the loop expansion (15.15) of det(l - 
zT). For infinite grammars the topological zeta function is defined by its cycle 
expansion. The topological entropy h is given by the leading zero z - e h . This 
expression for the entropy is exact; in contrast to the initial definition (15.1), no 
n — > oo extrapolations of In K n /n are required. 

What have we accomplished? We have related the number of topologically 
distinct paths from one state space region to another region to the leading eigen- 
value of the transition matrix T. The spectrum of T is given by topological zeta 
function, a certain sum over traces tr T\ and in this way the periodic orbit theory 
has entered the arena through the trace formula (15.10), already at the level of the 
topological dynamics. 

The main lesson of learning how to count well, a lesson that will be constantly 
reaffirmed, is that while trace formulas are a conceptually essential step in deriving 
and understanding periodic orbit theory, the spectral determinant is the right object 
to use in actual computations. Instead of summing all of the exponentially many 
periodic points required by trace formulas at each level of truncation, spectral det- 
erminants incorporate only the small incremental corrections to what is already 
known - and that makes them a more powerful tool for computations. 

Contrary to claims one all too often encounters in the literature, "exponential 
proliferation of trajectories" is not the problem; what limits the convergence of 
cycle expansions is the proliferation of the grammar rules, or the "algorithmic 
complexity," as illustrated by sect. 15.5, and figure 15.3 in particular. Nice, finite 
grammar leads to nice, discrete spectrum; infinite grammar leads to analyticity 
walls in the complex spectral plane. 

Historically, these topological zeta functions were the inspiration for applying 
the transfer matrix methods of statistical mechanics to the problem of computation 
of dynamical averages for chaotic flows. The key result was the dynamical zeta 
function to be derived in chapter 18, a weighted generalization of the topological 
zeta function. 



Commentary 

Remark 15.1 Artin-Mazur zeta functions. Motivated by A. Weil's zeta function for 
the Frobenius map [8], Artin and Mazur [11] introduced the zeta function (15.27) that 
counts periodic points for diffeomorphisms (see also ref. [ 9] for their evaluation for maps 
of the interval). Smale [ 1 0] conjectured rationality of the zeta functions for Axiom A 
diffeomorphisms, later proved by Guckenheimer [11] and Manning [12]. See remark 19.4 
on page 392 for more zeta function history. 

Remark 15.2 "Entropy." The ease with which the topological entropy can be motivated 
obscures the fact that our construction does not lead to an invariant characterization of the 
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dynamics, as the choice of symbolic dynamics is largely arbitrary: the same caveat ap- 
plies to other entropies. In order to obtain invariant characterizations we will have to work 
harder. Mathematicians like to define the (impossible to evaluate) supremum over all pos- 
sible partitions. The key point that eliminates the need for such searches is the existence 
of generators, i.e., partitions that under the dynamics are able to probe the whole state 
space on arbitrarily small scales. A generator is a finite partition M = {M \ . . . Mn) with 
the following property: consider the partition built upon all possible intersections of sets 
f"{Mi), where / is dynamical evolution and n takes all possible integer values (positive 
as well as negative), then the closure of such a partition coincides with the 'algebra of all 
measurable sets.' For a thorough (and readable) discussion of generators and how they 
allow a computation of the Kolmogorov entropy, see ref . [ 1 ] . 

Remark 15.3 Perron-Frobenius matrices. For a proof of the Perron theorem on the 
leading eigenvalue see ref. [26]. Appendix A4. 1 of ref. [2] offers a clear discussion of the 
spectrum of the transition matrix. 

Remark 15.4 Determinant of a graph. Many textbooks offer derivations of the 
loop expansions of characteristic polynomials for transition matrices and their transition 
graphs, see for example refs. [3, 4, 5]. 

Remark 15.5 Ordering periodic orbit expansions. In sect. 20.6 we will introduce 
an alternative way of hierarchically organizing cumulant expansions, in which the order 
is dictated by stability rather than cycle period: such a procedure may be better suited to 
perform computations when the symbolic dynamics is not well understood. 

Remark 15.6 T is not trace class. Note to the erudite reader: the transition matrix 
T (in the infinite partition limit (15.24)) is not trace class. Still the trace is well defined in 
the n — > oo limit. 

Remark 15.7 Counting prime cycles. Duval has an efficient algorithm for generating 
Lyndon words (non-periodic necklaces, i.e., prime cycle itineraries). 



Exercises 



15.1. A transition matrix for 3-disk pinball. 



to the graph. Show that iteration of the transi- 
tion matrix results in two coupled linear differ- 
ence equations, - one for the diagonal and one for 
the off diagonal elements. (Hint: relate XiT" to 
u-r"- 1 + ....) 



a) Draw the transition graph corresponding to the 3- 
disk ternary symbolic dynamics, and write down 
the corresponding transition matrix corresponding 
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b) Solve the above difference equation and obtain the 
number of periodic orbits of length n. Compare 
your result with table 15.4. 

c) Find the eigenvalues of the transition matrix T for 
the 3-disk system with ternary symbolic dynamics 
and calculate the topological entropy. Compare 
this to the topological entropy obtained from the 
binary symbolic dynamics {0, 1). 

15.2. 3-disk prime cycle counting. A prime cycle p 
of length tip is a single traversal of the orbit; its label is 
a non-repeating symbol string of n p symbols. For ex- 
ample, 12 is prime, but 2121 is not, since it is 21 = 12 
repeated. 

Verify that a 3-disk pinball has 3, 2, 3, 6, 9, ■ ■ ■ prime 
cycles of length 2, 3, 4, 5, 6, ■ ■ -. 

15.3. Sum of Aij is like a trace. Let A be a matrix with 
eigenvalues A^. Show that 

r„ : = J> n :b = Z c ^ • 

u * 

(a) Under what conditions do In |tr A " | and In |F„ | have 
the same asymptotic behavior as n — > oo, i.e., their 
ratio converges to one? 

(b) Do eigenvalues At need to be distinct, Au + At for 
k + 11 How would a degeneracy A^ — A/ affect 
your argument for (a)? 

15.4. Loop expansions. Prove by induction the sign rule in 
the determinant expansion (15.15): 

det(l- Z T) = £ Z 

k>0 P1+-+/J* 

15.5. Transition matrix and cycle counting. Suppose you 
are given the transition graph 

b . 

- -- 




This diagram can be encoded by a matrix T, where the 
entry 7y means that there is a link connecting node i to 
node j. The value of the entry is the weight of the link. 

a) Walks on the graph are given a weight that is the 
product of the weights of all links crossed by the 
walk. Convince yourself that the transition matrix 
for this graph is: 



a c 
b 0 



b) Enumerate all the walks of length three on the 
transition graph. Now compute T 3 and look at the 
entries. Is there any relation between the terms in 
T 3 and all the walks? 



c) Show that T". is the number of walks from point 
i to point j in n steps. (Hint: one might use the 
method of induction.) 

d) Estimate the number K n of walks of length n for 
this simple transition graph. 

e) The topological entropy /; measures the rate of ex- 
ponential growth of the total number of walks K„ 
as a function of n. What is the topological entropy 
for this transition graph? 

15.6. Alphabet {0,1}, prune _00_ . The transition graph ex- 
ample 14.9 implements this pruning rule which implies 
that "0" must always be bracketed by "l"s; in terms of a 
new symbol 2 := 10, the dynamics becomes unrestricted 
symbolic dynamics with with binary alphabet {1,2}. The 
cycle expansion (15.15) becomes 

l/{ = (l-ti)(l-fe)(l-»i2)(l-*m)... 
= 1 - h - 1 2 - (hi - ht 2 ) (15.41) 

_ (*112 _ fl2*l) — ( f 122 - tnh) ■ ■ ■ 

In the original binary alphabet this corresponds to: 

l/C = l-ti-t w -(t m -ht w ) (15.42) 
-(fmo - hwh) - (?i ioio _ fno f io) • ■ • 

This symbolic dynamics describes, for example, circle 
maps with the golden mean winding number. For uni- 
modal maps this symbolic dynamics is realized by the 
tent map of exercise 1 1 .6. 

15.7. "Golden mean" pruned map. (continuation of exer- 
cise 11.6) Show that the total number of periodic orbits 
of length n for the "golden mean" tent map is 

(1 + V5)" + (1 - V5)" 
2" ' 

Continued in exercise 19.2. See also exercise 15.8. 

15.8. A unimodal map with golden mean pruning. Con- 
sider the unimodal map 





for which the critical point maps into the right hand fixed 
point in three iterations, S + = 1001. Show that the ad- 
missible itineraries are generated by the above transition 
graph, with transient neighborhood of 0 fixed point, and 
_00_ pruned from the recurrent set. (K.T. Hansen) 
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15.9. Glitches in shadowing. (medium difficulty) Note 
that the combination fooon minus the "shadow" f 0*0011 in 
(15.20) cancels exactly, and does not contribute to the 
topological zeta function (15.21). Are you able to con- 
struct a smaller transition graph than figure 14.6 (e)? 

15.10. Whence Mobius function? To understand the origin 
of the Mobius function ( 15.37), consider the function 



f(n) = Y j g(d) 



(15.43) 



d\n 



where d\n stands for sum over all divisors d of «. Invert 
recursively this infinite tower of equations and derive the 
Mobius inversion formula 



g(n) = J]n(n/d)f(d). 

d\n 



(15.44) 



15.11. Counting prime binary cycles. In order to get com- 
fortable with Mobius inversion reproduce the results of 
the second column of table 15.3. 

Write a program that determines the number of prime 
cycles of length n. You might want to have this program 
later on to be sure that you have missed no 3-pinball 
prime cycles. 

15.12. Counting subsets of cycles. The techniques de- 
veloped above can be generalized to counting subsets 
of cycles. Consider the simplest example of a dynami- 
cal system with a complete binary tree, a repeller map 
(1 1.4) with two straight branches, which we label 0 and 
1. Every cycle weight for such map factorizes, with a 
factor to for each 0, and factor t\ for each 1 in its sym- 
bol string. Prove that the transition matrix traces ( 15.7) 
collapse to tr{T k ) = (to + t\) k , and 1/f is simply 



no 



t P ) = 1 - *o - *i 



(15.45) 



p 



Substituting (15.45) into the identity 

ne*<j-n^£ 

p p 1 

we obtain 



1 - tl - t\ 



no-,) - 



to-h 



1 + to + h + 



1 - *o - *i 



= 1 + to + h 



n=2 A- 1 

Hence for n > 2 the number of terms in the cumulant 
expansion with k 0's and n - k l's in their symbol se- 
quences is 2 (^zfj- 



In order to count the number of prime cycles in each 
such subset we denote with M„^ (n = l,2,...;k 
10, 1 } for n 1; k — 1, . . . ,n — 1 for n > 2) the number 
of prime n-cycles whose labels contain k zeros. Show 
that 



nM„, k 



M 



i.i 



1 



n>2,k=l, 



1 



n/m 
k/m 



where the sum is over all m which divide both n and k. 
(continued as exercise 20.7) 

15.13. Logarithmic periodicity of In jV„. (medium diffi- 
culty) Plot (lnN n ,nli) for a system with a nontrivial fi- 
nite transition graph. Do you see any periodicity? If yes, 
why? 

15.14. Symmetric 4-disk pinball topological zeta function. 

Show that the 4-disk pinball topological zeta function 
(the pruning affects only the fixed points and the 2- 
cycles) is given by 



i I yA—disk 
i '^top 



= (l-3z) 



(l-z 1 ) 



2,6 



(1 -z) 3 (l -z 2 ) 



2\3 



(l-3z)(l+z) 3 
1 - 6z 2 - 8z 3 - 3z 4 



(15.46) 



15.15. Symmetric A^-disk pinball topological zeta function. 

Show that for an A^-disk pinball, the topological zeta 
function is given by 



■N-disk 

top 



(l-(N-l)z)x 

(1 _ z 2yV(2V-l)/2 
(1 _ z )N-l(\ _ z 2)(JV-l)(JV-2)/2 

= (l-{N-l)z)(l+zf- 1 .(15.47) 

The topological zeta function has a root z~' = N — 1, 
as we already know it should from (15.38) or (15.18). 
We shall see in sect. 21.4 that the other roots reflect the 
symmetry factorizations of zeta functions. 

15.16. Alphabet {a,b, c], prune jub_ . Write down the 
topological zeta function for this pruning rule. 

15.17. Alphabet {0,1}, prune n repeats of "0" D00 . 00_. 

This is equivalent to the n symbol alphabet {1, 2, . . ., 
«} unrestricted symbolic dynamics, with symbols corre- 
sponding to the possible 10. . .00 block lengths: 2: = 10, 
3: = 100, . . ., «:=100. . .00. Show that the cycle expansion 
(15.15) becomes 



1/f 



1 - *1 - * 2 . . 
—(tin ~ *l*n) ■ 



*„-(*12-*l*2). 
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15.18. Alphabet {0,1}, prune _1000_, DOIOO^, DllOO^ 
Show that the topological zeta function is given by 

1/f = (1 - f 0 )(l -t x -h- t 23 - t m ) (15.48) 

with the unrestricted 4-letter alphabet {1, 2, 23, 113} . 
Here 2 and 3 refer to 10 and 100 respectively, as in ex- 
ercise 15.17. 

15.19. Alphabet {0,1}, prune _1000^ DOIOO^, D1100., 
_1001 1 _. (This grammar arises from Henon map 
pruning, see remark 12.3.) The first three pruning rules 
were incorporated in the preceeding exercise. 

(a) Show that the last pruning rule _1 00 1 1 _ leads (in a 
way similar to exercise 15.18) to the alphabet { 21* , 23 , 
21 A 1 13 ; 1,0), and the cycle expansion 

1/f = (l-f 0 )(l-fl-f2-f23+flf23-f2113) -(15.49) 

Note that this says that 1, 23, 2, 21 13 are the fundamen- 
tal cycles; not all cycles up to length 7 are needed, only 
2113. 

(b) Show that the topological zeta function is 

= (1 - z)(l - z - z 2 - z 5 + z 6 - z 1 ) (15.50) 



and that it yields the entropy h = 0.522737642 . . .. 

15.20. Alphabet {0,1}, prune only the fixed point 0 . This 
is equivalent to the infinite alphabet {1, 2, 3, 4, . . .} 
unrestricted symbolic dynamics. The prime cycles are 
labeled by all non-repeating sequences of integers, or- 
dered lexically: t n ,n > 0; t mn , t mmn , . . . , n > m > 0; 
tmnr, r > n > m > 0, . . . (see sect. 24.3). Now the num- 
ber of fundamental cycles is infinite as well: 

n>0 n>m>0 

(jmmn ^mhnn) 



-z 

«>m>0 



(15.51) 



«>»!>() 

r>n>m>Q 

. As shown in table 24.1, this grammar plays an im- 
portant role in description of fixed points of marginal 
stability. 
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Chapter 16 



Transporting densities 



Paulina: I'll draw the curtain: 

My lord's almost so far transported that 

He'll think anon it lives. 

— W. Shakespeare: The Winter's Tale 

(P. Cvitanovic, R. Artuso, L. Rondoni, and E.A. Spiegel) 

In chapters 2, 3, 7 and 8 we learned how to track an individual trajectory, and 
saw that such a trajectory can be very complicated. In chapter 4 we stud- 
ied a small neighborhood of a trajectory and learned that such neighborhood 
can grow exponentially with time, making the concept of tracking an individual 
trajectory for long times a purely mathematical idealization. 

While the trajectory of an individual representative point may be highly con- 
voluted, as we shall see, the density of these points might evolve in a manner that 
is relatively smooth. The evolution of the density of representative points is for 
this reason (and other that will emerge in due course) of great interest. So are 
the behaviors of other properties carried by the evolving swarm of representative 
points. 

We shall now show that the global evolution of the density of representative 
points is conveniently formulated in terms of linear action of evolution operators. 
We shall also show that the important, long-time "natural" invariant densities are 
unspeakably unfriendly and essentially uncomputable everywhere singular func- 
tions with support on fractal sets. Hence, in chapter 17 we rethink what is it that 
the theory needs to predict ("expectation values" of "observables"), relate these 
to the eigenvalues of evolution operators, and in chapters 18 to 20 show how to 
compute these without ever having to compute a "natural" invariant density 
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Figure 16.1: (a) First level of partitioning: A 
coarse partition of M into regions Mo, Mi, and 
M2- (b) n = 2 level of partitioning: A refinement 
of the above partition, with each region Mi subdi- 
vided into Mm, Mn , and Ma- 



(a) 




(b) 



16.1 Measures 



Do I then measure, O my God, and know not what I mea- 
sure? 

— St. Augustine, The confessions of Saint Augustine 



A fundamental concept in the description of dynamics of a chaotic system is that 
of measure, which we denote by dp.{x) = p(x)dx. An intuitive way to define and 
construct a physically meaningful measure is by a process of coarse-graining. 
Consider a sequence 1, 2, n, ... of increasingly refined partitions of state space, 
figure 16.1, into regions Mi defined by the characteristic function 



A coarse-grained measure is obtained by assigning the "mass," or the fraction of 
trajectories contained in the ith region A\ ■ <z M at the nth level of partitioning of 
the state space: 



The function p{x) - p(x, t) denotes the density of representative points in state 
space at time t. This density can be (and in chaotic dynamics, often is) an ar- 
bitrarily ugly function, and it may display remarkable singularities; for instance, 
there may exist directions along which the measure is singular with respect to the 
Lebesgue measure (namely the uniform measure on the state space). We shall 
assume that the measure is normalized 



where the sum is over subregions i at the nth level of partitioning. The infinites- 
imal measure p{x) dx can be thought of as an infinitely refined partition limit of 




(16.1) 




(16.2) 




(16.3) 
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Figure 16.2: The evolution rule /'can be used to map 

a region \ I of the state space into the region /'(At/ ). \ "ijjjl ^ — — i / i(M. ) 



A//,- = \Mi\p(xj), where \Aij\ is the volume of subregion A% and Xj e At,-; also 
p(x) is normalized 

\ dxp(x) = 1 . (16.4) 
JM 

Here |A1;| is the volume of region A%, and all |A1,| — > 0 as n — > oo. 

So far, any arbitrary sequence of partitions will do. What are intelligent ways 
of partitioning state space? We already know the answer from chapter 1 1 , but let 
us anyway have another look at this, in order to develop some intuition about how 
the dynamics transports densities. 



chapter 1 1 



16.2 Perron-Frobenius operator 



Given a density, the question arises as to what it might evolve into with time. 
Consider a swarm of representative points making up the measure contained in a 
region Mi at time t - 0. As the flow evolves, this region is earned into f{Mi), 
as in figure 16.2. No trajectory is created or destroyed, so the conservation of 
representative points requires that 



dxp(x,t) = I dxop(xo,0). 
Jf'(Mi) J Mi 



Transform the integration variable in the expression on the left hand side to the 
initial points xq = f~\x), 

I dxop(f t (x 0 ),t)\detf(x 0 )\= I dx 0 p(x 0 ,0). 

The density changes with time as the inverse of the Jacobian (4.28) 

p{x,t)= x = /*(*>), (16.5) 

|det J'(x 0 )\ 

which makes sense: the density varies inversely with the infinitesimal volume 
occupied by the trajectories of the flow. 
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0.2 0.4 0.6 0.8 1 



The relation (16.5) is linear in p, so the manner in which a flow transports 
densities may be recast into the language of operators, by writing 

p{x,t) = (£ t op){x)= f dx 0 d(x-f(x 0 ))p(x 0 ,0). (16.6) 



exercise 16.1 



Let us check this formula. As long as the zero is not smack on the border of dM, 
integrating Dirac delta functions is easy: f M dx6(x) = lifOeM, zero otherwise. 
The integral over a 1 -dimensional Dirac delta function picks up the Jacobian of its 
argument evaluated at all of its zeros: 



(x—x )h'(x ) 



f 



dx 6(h(x)) 




{x:h(x)=Q\ 



\h'(x)\ ' 



(16.7) 



and in d dimensions the denominator is replaced by 



J dx5(h(x)) = J]J dx6(h(x)) = 



Idet 

J \ ael dx 



(16.8) 



where Aij is any open neighborhood that contains the single Xj zero of h. Now 
you can check that (16.6) is just a rewrite of (16.5): 



exercise 16.2 



p(xo) 



, 0 =A) |detyt(Xo)l 



(1 -dimensional) 
(J-dimensional) . 



(16.9) 



For a deterministic, invertible flow x has only one preimage xq; allowing for mul- 
tiple preimages also takes account of noninvertible mappings such as the 'stretch 
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& fold' maps of the interval, to be discussed briefly in example 16.1, and in more 
detail in sect. 11.3. 

We shall refer to the integral operator with singular kernel (16.6) as the Perron- 
Frobenius operator. 

£\x,y) = 8(x-f , (y)). (16.10) 

The PeiTon-Frobenius operator assembles the density p(x, t) at time t by going 
back in time to the density p(xo, 0) at time t = 0. If you do not like the word "ker- 
nel" you might prefer to think of £(x, y) as a matrix with indices x, y, and index 
summation in matrix multiplication replaced by an integral over y, {£? o p) (x) = 
J dy £, ! (x, y)p(y) ■ In the next example Perron-Frobenius operator is a matrix, and 
(16.14) illustrates a matrix approximation to the Perron-Frobenius operator. 



exercise 16.3 
example 23.7 



remark 19.4 



Example 16.1 Perron-Frobenius operator for a piecewise-linear map: Consider 
the expanding I -dimensional map f(x) of figure 16.3, a piecewise-linear 2-branch map 
with slopes A 0 > 1 and Ai = -A 0 /(A 0 - 1) < -1 : 



exercise 16.7 



ffr \ - J Mx) = A 0 x , x e Mo = [0, 1/A 0 ) rifi in 

i /i(jr) = Ai(l-je), x e Mi = (1/A 0) 1] . 

Both f(Mo) and f(M\) map onto the entire unit interval M - [0,1]. We shall refer to 
any unimodal map whose critical point maps onto the "left" unstable fixed point xq as 
the "Ulam" map. Assume a piecewise constant density 

i po if x e Mo , l( - n , 
**>H p\ ifxeMi ■ (16 ' 12) 

As can be easily checked using (16.9), the Perron-Frobenius operator acts on this 
piecewise constant function as a [2x2] Markov matrix L with matrix elements 



P0 \ _» l.n = l \fa \k \i P ° 



exercise 16.1 
exercise 16.5 



Lp= |A i°' ^ r . (16.13) 

Pi I \ |Ad W\ l\Pll 

stretching both p 0 and p\ over the whole unit interval A. In this example the density is 
constant after one iteration, so L has only a unit eigenvalue e so = 1/|A 0 | + l/|Ai| = 1, 
with constant density eigenvector p 0 = p\. The quantities 1/|A 0 |, l/|Ai| are, respec- 
tively, the fractions of state space taken up by the \Mo\, \Mi\ intervals. This simple 
explicit matrix representation of the Perron-Frobenius operator is a consequence of the 
piecewise linearity of f, and the restriction of the densities p to the space of piece- 
wise constant functions. The example gives a flavor of the enterprize upon which we 
are about to embark in this book, but the full story is much subtler: in general, there 
will exist no such finite-dimensional representation for the Perron-Frobenius operator, 
(continued in example 1 7.4) 



fast track: 

sect. 16.4, p. 334 
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16.3 Why not just leave it to a computer? 



Another subtlety in the [dynamical systems ] theory is that 
topological and measure-theoretic concepts of genericity 
lead to different results. 



— John Guckenheimer 



(R. Artuso and P. Cvitanovic) 



To a student with a practical bent the above Example 16.1 suggests a strategy for 
constructing evolution operators for smooth maps, as limits of partitions of state 
space into regions Mi, with a piecewise-linear approximations fi to the dynamics 
in each region, but that would be too naive; much of the physically interesting 
spectrum would be missed. As we shall see, the choice of function space for p is 
crucial, and the physically motivated choice is a space of smooth functions, rather 
than the space of piecewise constant functions. 

All of the insight gained in this chapter and in what is to follow is nothing but 
an elegant way of thinking of the evolution operator, X, as a matrix (this point of 
view will be further elaborated in chapter 23). There are many textbook methods 
of approximating an operator X by sequences of finite matrix approximations X, 
but in what follows the great achievement will be that we shall avoid construct- 
ing any matrix approximation to X altogether. Why a new method? Why not 
just run it on a computer, as many do with such relish in diagonalizing quantum 
Hamiltonians? 

The simplest possible way of introducing a state space discretization, fig- 
ure 16.4, is to partition the state space M with a non-overlapping collection of 
sets Mi, i - I, . . . ,N, and to consider densities (16.2) piecewise constant on each 



where Xi( x ) is the characteristic function (16.1) of the set Mi. This piecewise 
constant density is a coarse grained presentation of a fine grained density p(x), 
with (16.2) 



The Perron-Frobenius operator does not preserve the piecewise constant form, but 
we may reapply coarse graining to the evolved measure 



Mi. 





Mi 
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BTR.UTO INSENSlTIVO METHOD; 



T 

Figure 16.4: State space discretization approach to /%3*>-s^S> 
computing averages. 

exmd" if «£hw>i ^o, iij '»j ' 




or 



7=1 



\Mj\ 



In this way 



\Mi nr l (Mj) 
\Mi\ 



Uj = 1 frrr 3 . p'=pL (16.14) 



is a matrix approximation to the Perron-Frobenius operator, and its leading left 
eigenvector is a piecewise constant approximation to the invariant measure. 

The problem with such state space discretization approaches is that they are 
blind, the grid knows not what parts of the state space are more or less important. 
This observation motivated the development of the invariant partitions of chaotic 
systems undertaken in chapter 1 1 , we exploited the intrinsic topology of a flow to 
give us both an invariant partition of the state space and a measure of the partition 
volumes, in the spirit of figure 3.7. 

Furthermore, a piecewise constant p belongs to an unphysical function space, 
and with such approximations one is plagued by numerical artifacts such as spu- 
rious eigenvalues. In chapter 23 we shall employ a more refined approach to 
extracting spectra, by expanding the initial and final densities p, p" in some basis 
<^o> <f\, <P2>"' (orthogonal polynomials, let us say), and replacing £(y, x) by its 
(p a basis representation L ff( g = {(p a \£\(pp). The art is then the subtle art of finding 
a "good" basis for which finite truncations of I^g give accurate estimates of the 
eigenvalues of X- 

Regardless of how sophisticated the choice of basis might be, the basic prob- 
lem cannot be avoided - as illustrated by the natural measure for the Henon map 



remark 16.3 



chapter 23 
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(3.17) sketched in figure 16.5, eigenfunctions of X are complicated, singular func- 
tions concentrated on fractal sets, and in general cannot be represented by a nice 
basis set of smooth functions. We shall resort to matrix representations of £, and 
the tp a basis approach only insofar this helps us prove that the spectrum that we 
compute is indeed the correct one, and that finite periodic orbit truncations do 
converge. 

in depth: 
chapter 1 , p. 3 

16.4 Invariant measures 

A stationary or invariant density is a density left unchanged by the flow 

p(x, t) = p(x, 0) = p(x) . (16.15) 

Conversely, if such a density exists, the transformation fix) is said to be measure- 
preserving. As we are given deterministic dynamics and our goal is the compu- 
tation of asymptotic averages of observables, our task is to identify interesting 
invariant measures for a given fix). Invariant measures remain unaffected by dy- 
namics, so they are fixed points (in the infinite-dimensional function space of p 
densities) of the Perron-Frobenius operator (16.10), with the unit eigenvalue: 

©xsrcis© 1 6.3 

£'p(x)= f dy8{x-f t iy))p{y)=p{x). (16.16) 
Jm 

We will construct explicitly such eigenfunction for the piecewise linear map in 
example 17.4, with p(y) = const and eigenvalue 1. In general, depending on the 
choice of f{x) and the function space for p(x), there may be no, one, or many 
solutions of the eigenfunction condition (16.16). For instance, a singular measure 
dp(x) = 6(x - x q )dx concentrated on an equilibrium point Xq = f{x q ), or any 
linear combination of such measures, each concentrated on a different equilib- 
rium point, is stationary. There are thus infinitely many stationary measures that 
can be constructed. Almost all of them are unnatural in the sense that the slightest 
perturbation will destroy them. 

From a physical point of view, there is no way to prepare initial densities 
which are singular, so we shall focus on measures which are limits of transforma- 
tions experienced by an initial smooth distribution p(x) under the action of /, 

po(Jc) = lim f dyS(x-f(y))p(y,0), f dyp{y,0) = \. (16.17) 
JM JM 

Intuitively, the "natural" measure should be the measure that is the least sensitive 
to the (in practice unavoidable) external noise, no matter how weak, or round-off 
errors in a numerical computation. 
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16.4.1 Natural measure 



Huang: Chen-Ning, do you think ergodic theory gives us 
useful insight into the foundation of statistical mechanics? 
Yang: I don't think so. 

— Kerson Huang, C.N. Yang interview 

In computer experiments, as the Henon example of figure 16.5, the long time evo- 
lution of many "typical" initial conditions leads to the same asymptotic distribu- 
tion. Hence the natural measure (also called equilibrium measure, SRB measure, 
Sinai-Bowen-Ruelle measure, physical measure, invariant density, natural density, 
or even "natural invariant") is defined as the limit 



p*.oo = 



lim^oo y £ dr 5(y - f T (x 0 )) flows 

(16.18) 

lim^oo i YICl 6 {y ~ f k ( x o)) ma P s > 



exercise 16.8 
exercise 16.9 



where xq is a generic initial point. Generated by the action of /, the natural 
measure satisfies the stationarity condition (16.16) and is thus invariant by con- 
struction. 

Staring at an average over infinitely many Dirac deltas is not a prospect we 
cherish. From a computational point of view, the natural measure is the visitation 
frequency defined by coarse-graining, integrating (16.18) over the Mi region 

A/L = lira - , (16.19) 

>oo t 

where ?, is the accumulated time that a trajectory of total duration t spends in the 
Mi region, with the initial point xq picked from some smooth density p{x). 

Let a = a{x) be any observable. In the mathematical literature a(x) is a func- 
tion belonging to some function space, for instance the space of integrable func- 
tions L 1 , that associates to each point in state space a number or a set of numbers. 
In physical applications the observable a(x) is necessarily a smooth function. The 
observable reports on some property of the dynamical system. Several examples 
will be given in sect. 17.1. 

The space average of the observable a with respect to a measure p is given by 
the G?-dimensional integral over the state space M: 

{a) p = r—. f dxp(x)a(x) 
\pm\Jm 

\Pm\ - I dx p(x) = mass in M . (16.20) 
JM 

For now we assume that the state space M has a finite dimension and a finite 
volume. By its construction, (a) p is a function(al) of p. For p = po natural measure 
we shall drop the subscript in the definition of the space average; (a) p = (a). 
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Inserting the right-hand- side of (16.18) into (16.20), we see that the natural 
measure corresponds to a time average of the observable a along a trajectory of 
the initial point xq, 

1 r* 

a XQ = lim - dTa(f T (xo)). (16.21) 

f^oo t Jo 

Analysis of the above asymptotic time limit is the central problem of ergodic 
theory. The Birkhoff ergodic theorem asserts that if an invariant measure p ex- 



remark 16.1 



ists, the limit a(xo) for the time average (16.21) exists for (almost) all initial xq. a pp en dix A 
Still, Birkhoff theorem says nothing about the dependence on xo of time averages 
a~^ (or, equivalently, that the construction of natural measures (16.18) leads to a 
"single" density, independent of xq). This leads to one of the possible definitions 
of an ergodic evolution: / is ergodic if for any integrable observable a in (16.21) 
the limit function is constant. If a flow enjoys such a property the time averages 
coincide (apart from a set of p measure 0) with space averages 



lim - f dTa(f(x 0 )) = (a) . (16.22) 

?-»°° t Jo 

For future reference, we note a further property that is stronger than ergodicity: 
if the space average of a product of any two variables decorrelates with time, 

lim (a(xMf(x))) = (a) (b) , (16.23) 

the dynamical system is said to be mixing. The terminology may be understood 
better once we consider as the pair of observables in (16.23) characteristic func- 
tions of two sets J{ and B: then (16.23) may be written as 

t^oo p(J{) 

so that the set S spreads "uniformly" over the whole state space as t increases. 
Mixing is a fundamental notion in characterizing statistical behavior for dynam- 
ical systems: suppose we start with an arbitrary smooth nonequilibrium distribu- 
tion p(x)v(x): the after time t the average of an observable a is given by 

I dxp(x)v(f(x))a(x) 
JM 

and this tends to the equilibrium average (a) p if / is mixing. 

Example 16.2 The Henon attractor natural measure: A numerical calculation of 
the natural measure (16.19) for the Henon attractor (3.17) is given by the histogram 
in figure 16.5. The state space is partitioned into many equal-size areas Mi, and the 
coarse grained measure (16.19) is computed by a long-time iteration of the Henon map, 
and represented by the height of the column over area Mi. What we see is a typical 
invariant measure - a complicated, singular function concentrated on a fractal set. 



section 22.3 
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Figure 16.5: Natural measure (16.19) for the Henon 
map (3.17) strange attractor at parameter values 
(a,b) = (1.4,0.3). See figure 3.6 for a sketch of the 
attractor without the natural measure binning. (Cour- 
tesy of J.-P. Eckmann) 




remark A. 2 



If an invariant measure is quite singular -for instance a Dirac S concentrated 
on a fixed point or a cycle- it is most likely of no physical import. No smooth 
initial density will converge to this measure if its neighborhood is repelling. In 
practice the average (16.18) is problematic and often hard to control, as generic 
dynamical systems are neither uniformly hyperbolic nor structurally stable: it is 
not known whether even the simplest model of a strange attractor, the Henon 
attractor of figure 16.5, is "strange," or merely a transient to a very long stable 
cycle. 



exercise 6.4 



16.4.2 Determinism vs. stochasticity 

While dynamics can lead to very singular p's, in any physical setting we cannot 
do better than to measure p averaged over some region A4; the coarse-graining is 
not an approximation but a physical necessity. One is free to think of a measure 
as a probability density, as long as one keeps in mind the distinction between de- 
terministic and stochastic flows. In deterministic evolution the evolution kernels 
are not probabilistic; the density of trajectories is transported detenninistically . 
What this distinction means will became apparent later: for deterministic flows 
our trace and determinant formulas will be exact, while for quantum and stochas- 
tic flows they will only be the leading saddle point (stationary phase, steepest 
descent) approximations. 

Clearly, while deceptively easy to define, measures spell trouble. The good 
news is that if you hang on, you will never need to compute them, at least not 
in this book. How so? The evolution operators to which we next turn, and the 
trace and determinant formulas to which they will lead us, will assign the correct 
weights to desired averages without recourse to any explicit computation of the 
coarse-grained measure Ap,-. 



16.5 Density evolution for infinitesimal times 

Consider the evolution of a smooth density p(x) = p(x, 0) under an infinitesimal 
step St, by expanding the action of J^ T to linear order in St: 

£ Sr p{y) = [ dxS(y-f T (x))p(x) 
JM 
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- I dx 8{y - x - Stv(x)) p(x) 
JM 

p(y - frrvQQ) _ p(y) - 6TVj(y)dip(y) 

| det (l +(5T 5gO)| " \+5TY? i=l dm(y) 

d 

p(x,dr) = p(x,0) -6t— (v(x)p{x,0)) . (16.24) 

ox 

Here we have used the infinitesimal form of the flow (2.7), the Dirac delta Jaco- 

exercise4.1 

bian (16.9), and the In det = tr In relation. By the Einstein summation conven- 
tion, repeated indices imply summation, v;(y)d; = 2lf =1 Vj(y)<9;. Moving p(y, 0) to 
the left hand side and dividing by 5t, we discover that the rate of the deformation 
of p under the infinitesimal action of the Perron-Frobenius operator is nothing but 
the continuity equation for the density: 

d,p + d-(pv) = 0. (16.25) 

The family of Perron-Frobenius operators operators |X f } feR forms a semigroup 
parameterized by time 

(a) £° = / 

(b) £l£! = £! +t t,t' > 0 (semigroup property) . 

From (16.24), time evolution by an infinitesimal step 5t forward in time is gener- 
ated by 

Sipix) = + lim ^ LL ST - l)p{x) = -3,(v,(x)p(x)) . (16.26) 
We shall refer to 



SI = -d-v + ^vtWdi (16.27) 



as the time-evolution generator. If the flow is finite-dimensional and invertible, 
Si is a generator of a full-fledged group. The left hand side of (16.26) is the 
definition of time derivative, so the evolution equation for p{x) is 

lj f -Sl\fKx) = 0. (16.28) 



The finite time Perron-Frobenius operator (16.10) can be formally expressed 
by exponentiating the time evolution generator SI as 

£ = e m . (16.29) 
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The generator J{ is reminiscent of the generator of translations. Indeed, for a con- 
stant velocity field dynamical evolution is nothing but a translation by (time x velocity): 

exercise 16.10 

e~ tv &a{x) = a{x-tv). (16.30) 



16.6 Liouville operator 



A case of special interest is the Hamiltonian or symplectic flow defined 
by Hamilton's equations of motion (7.1). A reader versed in quantum mechan- 
ics will have observed by now that with replacement — > —tH , where H is 
the quantum Hamiltonian operator, (16.28) looks rather like the time dependent 
Schrodinger equation, so this is the right moment to figure out what all this means 
for Hamiltonian flows. 

The Hamilton's evolution equations (7.1) for any time-independent quantity 
Q = Qiq, P) are given by 



m = ^dm + dQdPi = dtI_dQ_dQ_dH 

dt dqi dt dpi dt dpi dqi dpi dqi ' 



where (p,-, qO span the full state space, which for Hamiltonian flows we shall refer 
to as the phase space. As equations with this structure arise frequently for sym- 
plectic flows, it is convenient to introduce a notation for them, the Poisson bracket 



remark 16.4 



tA m OA dB OA dB 

dpi dqt dqi dp,- 



In terms of Poisson brackets the time-evolution equation (16.31) takes the compact 
form 



dQ 

J±={H,Q}. (16.33) 
dt 



The discussion of sect. 16.5 applies to any deterministic flow. The full phase 
space flow velocity is x = v = (q, p), where the dot signifies time derivative. 



section 28.1 



If the density itself is a material invariant, combining 



d t I + v • dl = 0 . 
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remark 16.4 



and (16.25) we conclude that 3,-v; = 0 and det/(xo) = 1. An example of such 
incompressible flow is the Hamiltonian flow. For incompressible flows the con- 
tinuity equation (16.25) becomes a statement of conservation of the phase space 
volume (see sect. 7.3), or the Liouville theorem 

dtp + Vjdip = 0. (16.34) 

The symplectic structure of Hamilton's equations (7.1) implies that the flow 
is incompressible, <9,V; = 0, so for Hamiltonian flows the equation for p reduces to 
the continuity equation for the phase-space density: 

d t p + dtipvi) = 0 , i=l,2...,Z>. (16.35) 

Consider the evolution of the phase-space density p of an ensemble of nonin- 
teracting particles; the particles are conserved, so 

d , x l d ■ 9 9 \ r \ A 

—p(q,p,t) = \—+q i —+p i —\p{q,p,t) = 0. 

Inserting Hamilton's equations (7.1) we obtain the Liouville equation, a special 
case of (16.28): 

d 

-p(q,p,t) = -ftp{q,p,t) = {H,p(q,p,t)\, (16.36) 

where { , } is the Poisson bracket (16.32). The generator of the flow (16.27) is in 
this case a generator of infinitesimal symplectic transformations, 

d d dH d dH d „,„^ 

& = 4ijr + PiT- = -jrir- irir ■ (16 - 37) 

dqi dpi dpidqt dq t dpi 

For example, for separable Hamiltonians of form H = p 2 /2m+ V(q), the equations 
of motion are 

qt = — , Pi = — z — ■ (16.38) 
m dqi 

and the action of the generator . 

exercise 16.1 1 

ft = --— + diV(q)—. (16.39) 
m dqi dpi 

Looking back at (16.30) we see that the first term generates a translation in the 
configuration space, f(q, p) — > f{q - dt q, p), and the second generates acceler- 
ation by force dV(q) in the momentum space. They do not commute, hence the 
time integration is not trivial. 
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The time-evolution generator (16.27) for the case of symplectic flows is called 
the Liouville operator. You might have encountered it in statistical mechanics, 
while discussing what ergodicity means for 1CF 3 hard balls. Here its action will 
be very tangible; we shall apply the Liouville operator to systems as small as 1 or 
2 hard balls and to our surprise learn that this suffices to already get a bit of a grip 
on foundations of the nonequilibrium statistical mechanics. 



Resume 

In physically realistic settings the initial state of a system can be specified only to 
a finite precision. If the dynamics is chaotic, it is not possible to calculate the long 
time trajectory of a given initial point. Depending on the desired precision, and 
given a deterministic law of evolution, the state of the system can then be tracked 
for a finite time only. 

The study of long-time dynamics thus requires trading in the evolution of a 
single state space point for the evolution of a measure, or the density of repre- 
sentative points in state space, acted upon by an evolution operator. Essentially 
this means trading in nonlinear dynamical equations on a finite dimensional space 
x - (x\ , X2 • ■ ■ Xd) for a linear equation on an infinite dimensional vector space of 
density functions p(x). For finite times and for maps such densities are evolved by 
the Perron-Frobenius operator, 

p(x,t) =(£ , op){x), 

and in a differential formulation they satisfy the continuity equation: 

d t p + d ■ (pv) = 0 . 

The most physical of stationary measures is the natural measure, a measure robust 
under perturbations by weak noise. 

For long times the dynamics is described in terms of stationary measures, i.e., 
fixed points of the appropriate evolution operators. Reformulated this way, clas- 
sical dynamics takes on a distinctly quantum-mechanical flavor. If the Lyapunov 
time (1.1), the time after which the notion of an individual deterministic trajectory 
loses meaning, is much shorter than the observation time, the "sharp" observables 
are those dual to time, the eigenvalues of evolution operators. This is very much 
the same situation as in quantum mechanics; as atomic time scales are so short, 
what is measured is the energy, the quantum-mechanical observable dual to the 
time. Both in classical and quantum mechanics one has a choice of implementing 
dynamical evolution on densities ("Schrodinger picture," sect. 16.5) or on observ- 
ables ("Heisenberg picture," sect. 17.2 and chapter 18). 

In what follows we shall find the second formulation more convenient, but the 
alternative is worth keeping in mind when posing and solving invariant density 
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problems. However, as classical evolution operators are not unitary, their eigen- 
function can be quite singular and difficult to work with. In what follows we 
shall learn how to avoid dealing with these eigenstates altogether. As a matter of 
fact, what follows will be a labor of radical deconstruction; after having argued so 
strenuously here that only smooth measures are "natural," we shall merrily pro- 
ceed to erect the whole edifice of our theory on periodic orbits, i.e., objects that 
are ^-functions in state space. The trick is that each comes with an interval, its 
neighborhood - periodic points only serve to pin these intervals, just as millime- 
ter markings on a measuring rod are used to partition a continuum into intervals. 

Commentary 

Remark 16.1 Ergodic theory: An overview of ergodic theory is outside the scope of 
this book: the interested reader may find it useful to consult refs. [ 1, 3, 4, 5]. The exis- 
tence of time average (16.21) is the basic result of ergodic theory, known as the Birkhoff 
theorem, see for example refs. [ 1, 25], or the statement of theorem 7.3.1 in ref. [ 12]. The 
natural measure (16.19) of sect. 16.4.1 is often referred to as the SRB or Sinai-Ruelle- 
Bowen measure [29, 28, 32]. 

There is much literature on explicit form of natural measure for special classes of 1- 
dimensional maps [ 1 9, 14, 15] - J. M. Aguirregabiria [16], for example, discusses several 
families of maps with known smooth measure, and behavior of measure under smooth 
conjugacies. As no such explicit formulas exist for higher dimensions and general dy- 
namical systems, we do not discuss such measures here. 

Remark 16.2 Time evolution as a Lie group: Time evolution of sect. 16.5 is an 
example of a 1-parameter Lie group. Consult, for example, chapter 2. of ref. [ 13] for a 
clear and pedagogical introduction to Lie groups of transformations. For a discussion of 
the bounded semigroups of page 358 see, for example, Marsden and Hughes [6]. 

Remark 16.3 Discretization of the Perron-Frobenius operator operator It is an old 

idea of Ulam [ 1 8] that such an approximation for the Perron-Frobenius operator is a 
meaningful one. The piecewise-linear approximation of the Perron-Frobenius operator 
(16.14) has been shown to reproduce the spectrum for expanding maps, once finer and 
finer Markov partitions are used [19, 23, 20]. The subtle point of choosing a state space 
partitioning for a "generic case" is discussed in ref. [21, 22]. 



Remark 16.4 The sign convention of the Poisson bracket: The Poisson bracket 
is antisymmetric in its arguments and there is a freedom to define it with either sign 
convention. When such freedom exists, it is certain that both conventions are in use 
and this is no exception. In some texts [8, 7] you will see the right hand side of (16.32) 
defined as \B,A) so that (16.33) is ^ = \Q,H). Other equally reputable texts [24] employ 
the convention used here. Landau and Lifshitz [8] denote a Poisson bracket by [A, B], 
notation that we reserve here for the quantum-mechanical commutator. As long as one is 
consistent, there should be no problem. 
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Remark 16.5 "Anon it lives"? "Anon it lives" refers to a statue of King Leontes's wife, 
Hermione, who died in a fit of grief after he unjustly accused her of infidelity. Twenty 
years later, the servant Paulina shows Leontes this statue of Hermione. When he repents, 
the statue comes to life. Or perhaps Hermione actually lived and Paulina has kept her 
hidden all these years. The text of the play seems deliberately ambiguous. It is probably 
a parable for the resurrection of Christ. (John F. Gibson) 



Exercises 



16.1. Integrating over Dirac delta functions. Check the 
delta function integrals in 

(a) 1 dimension (16.7), 

fdxS(h(x)) = £ * (16.40) 

(b) and in d dimensions ( 16.8), h : R d -> R d , 



dx6(h(x)) 



?1; 



dx 8(h(x)) 



E 

\x:h(x)=Q] 



det 



.41) 



/e 2<r 
dx ; f(x) . 
V2^ 



Use this approximation to see whether the formal 
expression 



Jit 



dxd(x z ) 
makes sense. 

16.2. Derivatives of Dirac delta functions. 

a* 



Consider 



Using integration by parts, determine the value of 



fdxb(x)6^(y) = J] -1 

•J i ,.\-m " ' 



|a:v(a')=0) 



Vy" 



+b\3 



//\2 



y 



0") 3 

L6.44) 



16.3. 



CvO (y'> 

These formulas are useful for computing effects of weak 
noise on deterministic dynamics [ ]. 

X' generates a semigroup. Check that the Perron- 
Frobenius operator has the semigroup property, 

dzL h (y, z) £" (z, x) = £ h - +h (y,x), t , , t 2 > 0 .( 1 



Jm 



where Mj are arbitrarily small regions enclosing 
the zeros Xj (with Xj not on the boundary dMj). 
For a refresher on Jacobian determinants, read, for ^ 
example, Stone and Goldbart Sect. 12.2.2. 

(c) The delta function can be approximated by a se- 
quence of Gaussians 



As the flows in which we tend to be interested are in- 
vertible, the _£'s that we will use often do form a group, 
with t\ , t2 e R. 

Escape rate of the tent map. 

(a) Calculate by numerical experimentation the log of 
the fraction of trajectories remaining trapped in 
the interval [0, 1] for the tent map 

/(jc)=a(l-2|jc-0.5|) 

for several values of a. 

(b) Determine analytically the a dependence of the es- 
cape rate y{a). 

(c) Compare your results for (a) and (b). 

16.5. Invariant measure. We will compute the invariant 
measure for two different piecewise linear maps. 



dx 8'{y) 
dx6 {2) (y) 



where y = f(x) - x (16.42) 



s 

lx:y(x)=0) 



i LV'f 



\y'\ { (y'f V) 
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(a) Verify the matrix £, representation (16.13). 

(b) The maximum value of the first map is 1 . Com- 
pute an invariant measure for this map. 

(c) Compute the leading eigenvalue of X for this map. 

(d) For this map there is an infinite number of in- 
variant measures, but only one of them will be 
found when one carries out a numerical simula- 
tion. Determine that measure, and explain why 
your choice is the natural measure for this map. 

(e) In the second map the maximum occurs at a — 
(3 - V5)/2 and the slopes are +( V5 + l)/2. Find 
the natural measure for this map. Show that it is 
piecewise linear and that the ratio of its two values 
is(V5 + l)/2. 

(medium difficulty) 

16.6. Escape rate for a flow conserving map. Adjust A 0 , 
Ai in (16.1 1) so that the gap between the intervals Mo, 
M\ vanishes. Show that the escape rate equals zero in 
this situation. 

16.7. Eigenvalues of the Perron-Frobenius operator for the 
skew full tent map. Show that for the skew full tent 
map 




the eigenvalues are available analytically, compute the 
first few. 

16.8. "Kissing disks"* (continuation of exercises 8.1 and 
8.2) Close off the escape by setting R — 2, and look 
in real time at the density of the Poincare section iter- 
ates for a trajectory with a randomly chosen initial con- 
dition. Does it look uniform? Should it be uniform? 
(Hint - phase-space volumes are preserved for Hamil- 
tonian flows by the Liouville theorem). Do you notice 
the trajectories that loiter near special regions of phase 
space for long times? These exemplify "intermittency," 
a bit of unpleasantness to which we shall return in chap- 
ter 24. 



16.9. Invariant measure for the Gauss map. 

the Gauss map: 



16.10. 



/(*) = 



fo(x) = A Q x , 



16.11. 



x € Mo = [0, 1/A 0 ) 
xeMi =(1/A 0) 1]. 



fix) 



Hi] 



0 



x + 0 
x = 0 



Consider 



(16.47) 



where [ ] denotes the integer part. 

(a) Verify that the density 
1 1 



P(x) = 



log 2 1 + x 
is an invariant measure for the map. 
(b) Is it the natural measure? 

J{ as a generator of translations. Verify that for 
a constant velocity field the evolution generator J{ in 
(16.30) is the generator of translations, 

e fv ^a(x) = a(x + tv) . 

Incompressible flows. Show that (16.9) implies that 
Po(x) = 1 is an eigenfunction of a volume-preserving 
flow with eigenvalue so = 0. In particular, this im- 
plies that the natural measure of hyperbolic and mixing 
Hamiltonian flows is uniform. Compare this results with 
ft§-#6merical experiment of exercise 16.8. 
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Chapter 17 
Averaging 



Why think when you can compute? 
— Maciej Zworski 

We discuss first the necessity of studying the averages of observables in 
chaotic dynamics. A time average of an observable is computed by inte- 
grating its value along a trajectory. The integral along trajectory can be 
split into a sum of over integrals evaluated on trajectory segments; if exponenti- 
ated, this yields a multiplicative weight for successive trajectory segments. This 
elementary observation will enable us to recast the formulas for averages in a mul- 
tiplicative form that motivates the introduction of evolution operators and further 
formal developments to come. The main result is that any dynamical average mea- 
surable in a chaotic system can be extracted from the spectrum of an appropriately 
constructed evolution operator. In order to keep our toes closer to the ground, in 
sect. 17.4 we try out the formalism on the first quantitative diagnosis whether a 
system is chaotic, the Lyapunov exponent. 



17.1 Dynamical averaging 

In chaotic dynamics detailed prediction is impossible, as any finitely specified ini- 
tial condition, no matter how precise, will fill out the entire accessible state space 
after a finite Lyapunov time (1.1). Hence for chaotic dynamics one cannot follow 
individual trajectories for a long time; what is attainable, however, is a description 
of the geometry of the set of possible outcomes, and the evaluation of long-time 
averages. Examples of such averages are transport coefficients for chaotic dynam- 
ical flows, such as escape rates, mean drifts and diffusion rates; power spectra; and 
a host of mathematical constructs such as generalized dimensions, entropies, and 
Lyapunov exponents. Here we outline how such averages are evaluated within the 
evolution operator framework. The key idea is to replace the expectation values of 
observables by the expectation values of generating functionals. This associates 
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an evolution operator with a given observable, and relates the expectation value of 
the observable to the leading eigenvalue of the evolution operator. 

17.1.1 Time averages 

Let a = a{x) be any observable, a function that associates to each point in state 
space a number, a vector, or a tensor. The observable reports on a property of 
the dynamical system. The observable is a device, such as a thermometer or laser 
Doppler velocitometer. The device itself does not change during the measurement. 
The velocity field dj{x) = Vi(x) is an example of a vector observable; the length 
of this vector, or perhaps a temperature measured in an experiment at instant t 
are examples of scalar observables. We define the integrated observable A as the 
time integral of the observable a evaluated along the trajectory of the initial point 
xq, 

A\x 0 )= f dra[x(T)], x(t)=f(xo). (17.1) 
Jo 

If the dynamics are given by an iterated mapping and the time is discrete, the 
integrated observable after n iterations is given by 

n-\ 

A"(x 0 ) = J] a(x k ) , x k = f k (x 0 )) (17.2) 
k=o 

(we suppress vectorial indices for the time being). 

Example 17.1 Integrated observables. (a) If the observable is the velocity, a^x) = 
Vi(x), its time integral A'^xo) is the trajectory A'^xq) - xt(t). 

(b) For Hamiltonian flows the action associated with a trajectory x(t) = [q(t), p(i)] 
passing through a phase-space point x Q = [q(Q),p(Q)] is: 

A'(x 0 )= f drq(T)-p(r). (17.3) 
Jo 

The time average of the observable along an orbit is denned by 



a(x 0 ) = lim -A\xq) . (17.4) 

r^oo f 

If a does not behave too wildly as a function of time - for example, if a(x) is the 
Chicago temperature, bounded between -S(fF and +130°F for all times - A\xq) 
is expected to grow no faster than t, and the limit (17.4) exists. For an example of 
a time average - the Lyapunov exponent - see sect. 17.4. 
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Figure 17.1: (a) A typical chaotic trajectory ex- 
plores the state space with the long time visitation 
frequency building up the natural measure f>o(x). 
(b) time average evaluated along an atypical tra- 
jectory such as a periodic orbit fails to explore the 
entire accessible state space. (A. Johansen) 




The time average is a property of the orbit, independent of the initial point on 
that orbit: if we start at a later state space point f{xo) we get a couple of extra 
finite contributions that vanish in the t — > oo limit: 



a[f T (x 0 )] 



i r +1 

lim - dTa[f T (xo)] 

a(x 0 ) - lim - ( I dra[f(x 0 )] - \ dTa[f{x 0 )] 



= a(x 0 ) . 



The integrated observable A'(xq) and the time average ci(xq) take a particularly 
simple form when evaluated on a periodic orbit. Define 



exercise 4.6 



cipTp 



J 0 p dr alfixo)] for a flow 
Z"=! a[f'(xo)] for a map 



x Q eM p , (17.5) 



where p is a prime cycle, T p is its period, and n p is its discrete time period in the 
case of iterated map dynamics. The quantity A p is a loop integral of the observable 
along a single traversal of a prime cycle p, so it is an intrinsic property of the cycle, 
independent of the starting point xo € M p . (If the observable a is not a scalar but 
a vector or matrix we might have to be more careful in defining an average which 
is independent of the starting point on the cycle). If the trajectory retraces itself 
r times, we just obtain A p repeated r times. Evaluation of the asymptotic time 
average (17.4) therefore requires only a single traversal of the cycle: 



p ■ 



(17.6) 



However, a(xo) is in general a wild function of ^3; for a hyperbolic system it 
takes the same value (a) for almost all initial ^o, but a different value (17.6) on 
any periodic orbit, i.e., on a dense set of points (figure 17.1 (b)). 



Example 17.2 Deterministic diffusion. The phase space of an open system such as 
the Sinai gas (an infinite 2-dimensional periodic array of scattering disks, see sect. 25. 1 ) 
is dense with initial points that correspond to periodic runaway trajectories. The mean 
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distance squared traversed by any such trajectory grows as x(t) 2 ~ t 2 , and its contri- 
bution to the diffusion rate D oc x(t) 2 /t, (17.4) evaluated with a(x) = x(t) 2 , diverges. 
Seemingly there is a paradox; even though intuition says the typical motion should be 
diffusive, we have an infinity of ballistic trajectories. 

For chaotic dynamical systems, this paradox is resolved by also averaging over 
the initial x and worrying about the measure of the 'pathological' trajectories, (contin- 
ued in example 1 7.3) 



section 25.1 



17.1.2 Spatial averages 

The space average of a quantity a evaluated over all state space trajectories x(t) at 
time t is given by the J-dimensional integral over all initial points xy at time t = 0: 



< a >W = 777, f dx 0 «M ? )] , x(t) = f(x 0 ) 

\M\ = I dx = volume of M . (17.7) 
JM 

The space M is assumed to have finite volume - open systems like the 3-disk game 
of pinball are discussed in sect. 17.3. 

What is it we really do in experiments? We cannot measure the time average 
(17.4), as there is no way to prepare a single initial condition with infinite preci- 
sion. The best we can do is prepare an initial density p(x), perhaps concentrated on 
some small (but always finite) neighborhood. Then we can abandon the uniform 
space average (17.7) and consider instead the weighted spatial average 



(a)Jt) 



= TTj-: \ dx 0 p(xo)a[x(t)], \M P \= \ dxp(x) . (17.8) 
\An p \J M J M 



For ergodic mixing systems, any smooth initial density will tend to the asymptotic 
natural measure in the t — > oo limit p(x, t) — > po(x). This allows us to take any 
smooth initial p(x) and define the expectation value (a) of an observable a as the 
asymptotic time and space average over the state space At 



(a) 



f dxa[x] = lim -J- f dx 0 \ f dra[x(t)]. (17.9) 
\M\ Jm r-*» \M\ J M t J Q 



We use the same (• • •) notation as for the space average (17.7) and distinguish the 
two by the presence of the time variable in the argument: if the quantity (a)(t) 
being averaged depends on time, then it is a space average; if it is the infinite time 
limit, it is the expectation value (a). 

The expectation value is a space average of time averages, with every x e M 
used as a starting point of a time average. The advantage of averaging over space 
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is that it smears the starting points which were problematic for the time average 
(such as periodic points). While easy to define, the expectation value (a) turns out 
not to be particularly tractable in practice. 

Here comes a simple idea that is the basis of all that follows: Such averages 
are more conveniently studied by investigating instead of (a) the space averages 
of form 



In the present context /3 is an auxiliary variable of no physical significance whose 
role is to enable us to recover the desired space average by differentiation, 



In most applications J3 is a scalar, but if the observable is a <i-dimensional vector 
a(x) € R d , then /? e W 1 ; if the observable is a [d x d] tensor, (3 is also a rank-2 
tensor, and so on. Here we will mostly limit the considerations to scalar fi. 

If the time average limit a(xo) (17.4) exists for 'almost all' initial xq's and 
the system is ergodic and mixing (in the sense of sect. 1.3.1), we expect the 
time average along almost all trajectories to tend to the same valued, and the 
integrated observable A' to tend to ta. The space average (17.10) is an integral 
over exponentials and hence also grows exponentially with time. So as t — > oo we 
would expect the space average of exp(/3 • A\x)) to grow exponentially with time 



Now we understand one reason for why it is smaller to compute (exp(yS • A')) 
rather than (a): the expectation value of the observable (17.9) and the moments of 
the integrated observable (17.1) can be computed by evaluating the derivatives of 




(17.10) 





and its rate of growth to be given by the limit 




(17.11) 




(17.12) 
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exercise 17.1 



and so forth. We have explicitly written out the formulas for a scalar observable; 
the vector case is worked out in exercise 17.1 (we could have used full derivative 
notation ds/dfi in (17.12), but for vector observable we do need partial derivatives 
dsldfii). If we can compute the function s(J3), we have the desired expectation 
value without having to estimate any infinite time limits from finite time data. 

Suppose we could evaluate s{f3) and its derivatives. What are such formulas 
good for? A typical application arises in the problem of determining transport 
coefficients from underlying deterministic dynamics. 



Example 17.3 Deterministic diffusion. (continued from example 17.2) Con- 
sider a point particle scattering elastically off a d-dimensional array of scatterers. If 
the scatterers are sufficiently large to block any infinite length free flights, the particle 
will diffuse chaotically and the transport coefficient of interest is the diffusion constant 
(*(f) 2 ) ~ 4Dr. In contrast to D estimated numerically from trajectories x(t) for finite 
but large t, the above formulas yield the asymptotic D without any extrapolations to the 
t — > oo limit. For example, for a -, = v, and zero mean drift (v;) = 0, in d dimensions the 
diffusion constant is given by the curvature of s(J3) at/3 -0, 



section 25.1 



D = lim — U(f) } = — > — * 
f-«° 2dt \ / 2d t-t dBr 

1=1 ^ I 



(17.13) 



0=0 



so if we can evaluate derivatives of s(j3), we can compute transport coefficients that 
characterize deterministic diffusion. As we shall see in chapter 25, periodic orbit theory 
yields an exact and explicit closed form expression for D. 



We turn to the problem of evaluating (^ A ) in sect. 17.2, but first we review 
some elementary notions of statistics that will be useful later on. 



fast track: 

sect. 17.2, p. 354 



17.1.3 Moments, cumulants 



Given a set of N data points, the unbiased empirical estimates for the 
empirical mean and the sample variance are 



j N l N 

a = — > a; , o~ 2 = > (a: - a) 

i=\ i=\ 



(17.14) 



(The N — 1 divisor in sample invariance formula has to do with ensuring that a 
minimizes & .) 
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The exact mean (or expectation or expected value) is the integral of the random 
variable with respect to its probability measure p, commonly denoted (• • •), E[- ■ •], 
or — , 



(a) = E[a] - a - I dx p(x)a(x) . (17.15) 
JM 



(In ChaosBook we use (• • -) p to denote an integral over state space weighted by 
p, and ~ to denote a time average.) The nth moment is the expectation (d 1 ). 
The moments about the mean, ((a - (a)y), are called central moments. The 
second central moment is the variance, and its positive square root is the stan- 
dard deviation cr. Standardized moment is the nth central moment divided by 
cr", ((a - (a))") /cr", a dimensionless representation of the distribution, indepen- 
dent of translations and linear changes of scale. Moments can be collected in the 
moment-generating function 



(^) = 1+ ZS<* n) ' (17 - 16) 



or, with some hindsight, in a more natural representation, the cumulant-generating 
function 



lo s( /fl } = Z!r< a ">- (17 - 17) 
1 ■ 

n=\ 



where the subscript c indicates a cumulant, or, in statistical mechanics and quan- 
tum field theory contexts, the 'connected Green's function'. Expanding log(g-) 
it is easy to check that the first cumulant is the mean, the second is the variance, 

(a 2 ) = a 2 = {{a - (a)) 2 ) - (a 2 ) - {a) 2 , (17.18) 
and ^a 3 ^ is the third central moment, or the skewness, 

(a 3 ) c = {(a - (a)) 3 ) = {a 3 ) - 3 {a 2 ) (a) + 2 (a) 3 . (17.19) 

The higher cumulants are neither moments nor central moments. The fourth cu- 
mulant, 

(a 4 ) c = ((a -{a)) 4 } -3 ((a -(a)) 2 ) 2 

= (a 4 )-4(a 3 ){a)-3(a 2 ) 2 + \2(a 2 ){a) 2 -6{a) A . (17.20) 
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Figure 17.2: Space averaging pieces together the 
time average computed along the / — > oo orbit 
of figure 17.1 by a space average over infinitely 
many short / trajectory segments starting at all ini- 
tial points at once. 




rewritten in terms of standardized moments, is known as the kurtosis: 

^< fl4 > c = ^4^-<^ 4 >- 3 - < 17 " 21 ) 

One of the reasons why cumulants are preferable to moments is that for a normal- 
ized Gaussian distribution all cumulants beyond the second one vanish, so they 
are a measure of deviation of statistics from the Gaussian one (see example25.3). 

In statistical mechanics and field theory, the partition function and the Helmholtz 
free energy have form 

ZQ3) = exp(-/?£) , F(fi) = - - lnZ(£) , (17.22) 

analogous to {e^ A and s(fi) in (17. 1 1). The derivatives s(J3) with respect to [3 then 
yield cumulants, or the Burnett coefficients (25.22). 



17.2 Evolution operators 



For it, the mystic evolution; 

Not the right only justified 

- what we call evil also justified. 

— Walt Whitman, 
Leaves of Grass: Song of the Universal 

The above simple shift of focus, from studying (a) to studying (exp (/? • A')) is 
the key to everything that follows. Make the dependence on the flow explicit by 
rewriting this quantity as 




(17.23) 



Here 5{y - f(x)) is the Dirac delta function: for a deterministic flow an initial 
point x maps into a unique point y at time t. Formally, all we have done above is 
to insert the identity 

1= f dyS(y-f\x)) , (17.24) 

J M 
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into (17.10) to make explicit the fact that we are averaging only over the trajec- 
tories that remain in M for all times. However, having made this substitution 
we have replaced the study of individual trajectories f(x) by studying the evolu- 
tion of the density of the totality of initial conditions. Instead of trying to extract a 
temporal average from an arbitrarily long trajectory which explores the state space 
ergodically, we can now probe the entire state space with short (and controllable) 
finite time pieces of trajectories originating from every point in At 

As a matter of fact (and that is why we went to the trouble of denning the gen- 
erator (16.27) of infinitesimal transformations of densities) infinitesimally short 
time evolution induced by the generator of (16.27) suffices to determine the 
spectrum and eigenvalues of £. 

We shall refer to the kernel of the operation (17.23) as the evolution operator 

£\y,x) = 6(y-f\x))^ A '^. (17.25) 

The simplest example is the ft = 0 case, i.e., the Perron-Frobenius operator intro- 
duced in sect. 16.2. Another example - designed to deliver the Lyapunov exponent 
- will be the evolution operator (17.42) discussed below. The action of the evolu- 
tion operator on a function <p is given by 

[-C>]G0 = f dx6(y-f t (x))^ A ' {x) 4>(x)- (17-26) 
Jm 

The evolution operator is different for different observables, as its definition 
depends on the choice of the integrated observable A in the exponential. Its job is 
to deliver the expectation value of a, but before showing that it accomplishes that, 
we need to verify the semigroup property of evolution operators. 

By its definition, the integral over the observable a is additive along the tra- 
jectory 

X(tl+t2) ^t»x(ti+t2) 
x(ti) x(tiK 
x(0) + 

dra[f(x)] + dra[f(x)] 
= AHxq) + A&(f (*>))• 

As A\x) is additive along the trajectory, the evolution operator generates a semi- 
group 

£<^(y,x)= f dz£' 2 (y,z)£ tl (z,x), (17.27) 
Jm 

as is easily checked by substitution 

[X f2 £ f '£?] (y)= f dx 5(y - f t2 (x))^ A ' Hx) [£ tl a] (x) = [£ tl+ ' 2 a] (y) . 



exercise 16.3 



section 16.5 
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This semigroup property is the main reason why (17.23) is preferable to (17.9) as 
a starting point for evaluation of dynamical averages: it recasts averaging in form 
of operators multiplicative along the flow. 

In terms of the evolution operator, the space average of the generating function 
(17.23) is given by 

(^ A ') = rLf dx f dycl>(y)-C!(y,x)<f>{x). 
x 1 \M\J M Jm 

where <p(x) is the constant function <f){x) - 1. If the linear operator Jl can be 
thought of as a matrix, high powers of a matrix are dominated by its fastest grow- 
ing matrix elements, and the limit (17.11) 

s(P) = lim - In (£) . (17.28) 

f— >oo t \ I 



yields the leading eigenvalue sq(J3), and, through it, all desired expectation values 
(17.12). 

In what follows we shall learn how to extract not only the leading eigenvalue 
of £} ', but much of the dominant part of its spectrum. Clearly, we are not interested 
into the eigenvalues of X' for any particular finite time t, but their behavior as 
t — > oo. That is achieved via a Laplace transform, see sect. 17.2.3. 



17.2.1 Spectrum of an evolution operator 

This operator is strange: it is not self-adjoint, so it is noth- 
ing good 

— Jean Bellissard 

An exposition of a subject is of necessity sequential and one cannot explain ev- 
erything at once. As we shall actually never use eigenfunctions of evolution oper- 
ators, we postpone their discussion to sect. 23.6. For the time being we ask the 
reader to accept uncritically the following sketch: 

Schematically, a linear operator has a spectrum of eigenvalues ^ and eigen- 
functions <p a (x) 

[ify«](jc) = e*"W*). a = 0,1,2,... (17.29) 

ordered so that Re s a > Res a+ i. For continuous time flow eigenvalues cannot 
depend on time, they are eigenvalues of the time-evolution generator (16.26) we 
always write the eigenvalues of an evolution operator in exponentiated form i a 
rather than as multipliers We find it convenient to write them this way both for 
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the continuous time £! and the discrete time £, - £} cases, and we shall assume 
that spectrum of X is discrete. 

is a lineal - operator acting on a density of initial conditions p(x), x e At, so 
the t — > oo limit will be dominated by so = s(y3), the leading eigenvalue of J2, 

[£ f P/?]()0 := f dxS(y-fix))^ A,< - 3i >p fi (x) = ^ ) p fi (y), (17.30) 

where Pyg(x) is the corresponding eigenfunction. For yS = 0 the evolution operator 
(17.25) is the Perron-Frobenius operator (16.10), with po(x) the natural measure. 

From now on we have to be careful to distinguish the two kinds of linear 
operators. In chapter 5 we have characterized the evolution of the local linear 
neighborhood of a state space trajectory by eigenvalues and eigenvalues of the 
linearized flow Jacobian matrices. Evolution operators described in this chapter 
are global, and they act on densities of orbits, not on individual trajectories. As 
we shall see, ne of the wonders of chaotic dynamics is that the more unstable 
individual trajectories, the nicer are the corresponding global density functions. 



17.2.2 Evolution for infinitesimal times 

For infinitesimal time 5t, the evolution operator (17.7) acts as 

p(y,5t) = J dxeP A5 ' (x) 8(y- f'(x))p{x,0) 



= J dxeP a{x)6t 5(y- x - 6tv(x))p(x,0) 
= (1+»j8oG0) 



p(y,0)-6tv^p(y,0) 



(the denominator arises from the 5t linearization of the jacobian) giving the conti- 
nuity equation (16.25) a source term 

^ + ^-(v iP )=pap. (17.31) 
at oxi 

The evolution generator (16.27) eigenfunctions now satisfy 

(s(/3)-J{)p(x,j3) =/3a(x)p{x,f3). (17.32) 
Differentiating with respect to ft 

d did 
s'(J3)p(x,p) + s(p)—p(x,p) + —Ux)—p(x,p) 

d 

= a(x) p(x, P)+/3a(x)— p(x, p) 
dp 
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In the vanishing auxiliary parameter limit — > 0, we have s(0) = 0, p(x, 0) = po{x) 

s'(0)p Q (x) + Li(x) p(x, 0)j = a(*)po(X) ■ 
By integrating, the second term vanishes by Gauss' theorem 

s'(0) - J" dx a(x) po(x) = (a) , 

verifying equation (17.8): spatial average of the observable a is given by the 
derivative of the leading eigenvalue / (0). 



fast track: 
sect. 1 8, p. 366 



17.2.3 Resolvent of £ 

Here we limit ourselves to a brief remark about the notion of the 'spectrum' of a 
linear operator. 

The Perron-Frobenius operator X acts multiplicatively in time, so it is reason- 
able to suppose that there exist constants M > 0, so > 0 such that ||X f || < Me ts ° for 
all t > 0. What does that mean? The operator norm is defined in the same spirit in 
which one defines matrix norms: We are assuming that no value of Jlp{x) grows 
faster than exponentially for any choice of function p{x), so that the fastest pos- 
sible growth can be bounded by £ So , a reasonable expectation in the light of the 
simplest example studied so far, the escape rate (1.3). If that is so, multiplying 
£! by e~ ts ° we construct a new operator e~ tSa £! - e"- s °' which decays exponen- 
tially for large t, \\e' m ' So) \\ < M. We say that e^L* is an element of a bounded 
semigroup with generator - sqI. Given this bound, it follows by the Laplace 
transform 



f 

Jo 



dte- st £' = — !— , Re s > s 0 , (17.33) 
s - y\ 



that the resolvent operator {s - tA) 1 is bounded 
1 



dte- s, Me tS0 = . (17.34) 

Jo s - s 0 



If one is interested in the spectrum of X, as we will be, the resolvent operator is a 
natural object to study; it has no time dependence, and it is bounded. It is called 
'resolvent' because it separates the spectrum of X into individual constituents, one 
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for each spectral 'line'. From (17.28), it is clear that the leading eigenvalue sq(J3) 
corresponds to the pole in (17.34); as we shall see in chapter 18, the rest of the 
spectrum is similarly resolved into further poles of the Laplace transform. 

The main lesson of this brief aside is that for continuous time flows, the 
Laplace transform is the tool that brings down the generator in (16.29) into the 
resolvent form (17.33) and enables us to study its spectrum. 



17.3 Averaging in open systems 



If At is a compact region or set of regions to which the dynamics is con- 
fined for all times, (17.9) is a sensible definition of the expectation value. How- 
ever, if the trajectories can exit At without ever returning, 



\ dy 8{y - f(x 0 )) = 0 for t > t exit , x 0 e At , 
JM 



we might be in trouble. In particular, a repeller is a dynamical system for which 
the trajectory f(xo) eventually leaves the region At, unless the initial point is 
on the repeller, so the identity 



I dy 6(y - /'(xq)) = 1 , t > 0, iff xq e non-wandering set (17.35) 
JM 



might apply only to a fractal subset of initial points of zero Lebesgue measure 
(non-wandering set is denned in sect. 2.1.1). Clearly, for open systems we need 
to modify the definition of the expectation value to restrict it to the dynamics on 
the non-wandering set, the set of trajectories which are confined for all times. 

Denote by At a state space region that encloses all interesting initial points, 
say the 3-disk Poincare section constructed from the disk boundaries and all pos- 
sible incidence angles, and denote by |At| the volume of At. The volume of state 
space containing all trajectories, which start out within the state space region At 
and recur within that region at time t, is given by 

|At(0l= f dxdydiy-fXxj) ~ \M\e' yt . (17.36) 

JM 

As we have already seen in sect. 1.4.3, this volume is expected to decrease ex- 
ponentially, with the escape rate y. The integral over x takes care of all possible 
initial points; the integral over y checks whether their trajectories are still within 
At by the time t. For example, any trajectory that falls off the pinball table in 
figure 1.1 is gone for good. 



section 22.1 
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Figure 17.3: A piecewise-linear repeller (16.11): All 
trajectories that land in the gap between the fo and f\ 
branches escape (Ao = 4, Ai = -2). 




If we expand an initial distribution p(x) in (17.29), the eigenfunction basis 
p( x ) - 2« dafaix) , we can also understand the rate of convergence of finite-time 
estimates to the asymptotic escape rate. For an open system the fraction of trapped 
trajectories decays as 



f M dx[£'p] (x) f M dx<p a (x) 

T M (t) - — r y e a a a — 

J M dxp(x) V J M dxp(x) 

e s °' ((const.) + O(e (si - So)t j) . 



- „so 



(17.37) 



section 17.3 



The constant depends on the initial density p(x) and the geometry of state space 
cutoff region Ai, but the escape rate y = -so is an intrinsic property of the re- 
pelling set. We see, at least heuristically, that the leading eigenvalue of Jl domi- 
nates T y vi(?) and yields the escape rate, a measurable property of a given repeller. 

The non-wandering set can be very difficult to describe; but for any finite 
time we can construct a normalized measure from the finite-time covering volume 
(17.36), by redefining the space average (17.10) as 



dx 

m 



e/M'M ~ _L f dx( fi-AKx)+yt_ 
\M\J M 



(17.38) 



in order to compensate for the exponential decrease of the number of surviving 
trajectories in an open system with the exponentially growing factor S" '. What 
does this mean? Once we have computed y we can replenish the density lost to 
escaping trajectories, by pumping in e yt of new trajectories in such a way that the 
overall measure is correctly normalized at all times, (1) = 1. 



Example 17.4 Escape rate for a piecewise-linear repeller: (continuation of exam- 
ple 16.1) What is gained by reformulating the dynamics in terms of 'operators'? We 
start by considering a simple example in which the operator is a [2x2] matrix. Assume 
the expanding l-dimensional map f(x) of figure 17.3, a piecewise-linear 2-branch re- 
peller (16.11). Assume a piecewise constant density (16.12). There is no need to 
define p(x) in the gap between Mo andM\, as any point that lands in the gap escapes. 
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The physical motivation for studying this kind of mapping is the pinball game: f 
is the simplest model for the pinball escape, figure 1.8, with f 0 and f\ modelling its two 
strips of survivors. 

As can be easily checked using (16.9), the Perron-Frobenius operator acts on 
this piecewise constant function as a [2x2] 'transfer' matrix ( 16. 13) 



po )^£p = ( ¥ ¥ )( po 

Pi/ \ WT\ /\Pi 



exercise 16.1 
exercise 16.5 



stretching both p 0 and p\ over the whole unit interval A, and decreasing the density at 
every iteration. In this example the density is constant after one iteration, so £ has only 
one non-zero eigenvalue e s ° = 1/|A 0 | + l/|Ai| < 1, with constant density eigenvector 
Po = p\. The quantities 1/|A 0 |, l/|Ai| are, respectively, the sizes of the \Mo\, \Mi\ 
intervals, so the exact escape rate (1.3) - the log of the fraction of survivors at each 
iteration for this linear repeller - is given by the sole eigenvalue of £: 

y = -s 0 = -ln(l/|Ao| + l/|Aj|). (17.39) 

Voila! Here is the rationale for introducing operators - in one time step we have solved 
the problem of evaluating escape rates at infinite time, (continued in example 23.5) 



17.4 Evolution operator evaluation of Lyapunov exponents 



A solution to these problems was offered in sect. 17.2: replace time averaging 
along a single orbit by action of a multiplicative evolution operator on the entire 
state space, and extract the state space average of the Lyapunov exponent from its 
leading eigenvalue, computed from finite length cycles. The main idea - what is 
the Lyapunov 'observable' - can be illustrated by the dynamics of a 1-dimensional 
map. 



Example 17.5 Lyapunov exponent, discrete time 1-dimensional dynamics. Due 

to the chain rule (4.22) for the derivative of an iterated map, the stability of a 1 -dimensional 
mapping is multiplicative along the flow, so the integral (17.1) of the observable a(x) = 
lnl/'O)!, the local trajectory divergence rate, evaluated along the trajectory of x 0 , is 
additive: 

n-\ 

A"(x 0 ) = In \f" \xo)\ = J] In \f'(x k )\ . (17.40) 

(t=0 

For a 1-dimensional iterative mapping, the Lyapunov exponent is then the expectation 
value (17.9) given by a spatial integral (17.8) weighted by the natural measure 

A=(ln\f'(x)\)= f dxp {) (x)\n\f'(x)\. (17.41) 
Jm 

The associated (discrete time) evolution operator (17.25) is 

£(y, x) = 6(y- /(*)) e^ ln|/ ' (jc)l . (17.42) 
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Here we have restricted our considerations to l-d maps, as for higher-dimensional 
flows only the Jacobian matrices are multiplicative, not the individual eigenvalues. 
Construction of the evolution operator for evaluation of the Lyapunov spectra for 
a J-dimensional flow requires more skill than warranted at this stage in the narra- 
tive: an extension of the evolution equations to a flow in the tangent space. 

If the chaotic motion fills the whole state space, we are indeed computing the 
asymptotic Lyapunov exponent. If the chaotic motion is transient, leading even- 
tually to some long attractive cycle, our Lyapunov exponent, computed on a non- 
wandering set, will characterize the chaotic transient; this is actually what any 
experiment would measure, as even a very small amount of external noise suffices 
to destabilize a long stable cycle with a minute immediate basin of attraction. 

All that remains is to determine the value of the Lyapunov exponent 

dsiJS) 



A = (\n\f\x)\) 



= s'(0) (17.43) 

/?=o 



from (17.12), the derivative of the leading eigenvalue sq(P) of the evolution oper- 
ator (17.42). 

The only question is: How? (By chapter 20 you will know.) 

Resume 

The expectation value (a) of an observable a(x) integrated, A(x) = J dTa(x(r)), 
and time averaged, A' It, over the trajectory x — > x{t) is given by the derivative 



(a) = — 

N 1 dp 



example 20.2 



of the leading eigenvalue of the evolution operator £. 

By computing the leading eigenfunction of the Perron-Frobenius operator 
(16.10), one obtains the expectation value (16.20) of any observable a(x). Thus 
we can construct a specific, hand-tailored evolution operator £, for each and every 
observable. The good news is that, by the time we arrive at chapter 20, the scaf- 
folding will be removed, both X's and their eigenfunctions will be gone, and only 
the explicit and exact periodic orbit formulas for expectation values of observables 
will remain. 

The next question is: How do we evaluate the eigenvalues of X? In exam- 
ple 17.4, we saw a piecewise-linear example where these operators reduce to fi- 
nite matrices L, but for generic smooth flows, they are infinite-dimensional linear 
operators, and finding smart ways of computing their eigenvalues requires some 



chapter 20 
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thought. In chapter 1 1 we undertook the first step, and replaced the ad hoc parti- 
tioning (16.14) by the intrinsic, topologically invariant partitioning. In chapterl5 
we applied this information to our first application of the evolution operator for- 
malism, evaluation of the topological entropy, and the growth rate of the number 
of topologically distinct orbits. In chapters 18 and 19, this small victory will 
be refashioned into a systematic method for computing eigenvalues of evolution 
operators in terms of periodic orbits. 



Remark 17.1 'Pressure'. The quantity (exp(/? ■ A')) is called a 'partition function' by 
Ruelle [ ]. Mathematicians decorate it with considerably more Greek and Gothic letters 
than is done in this treatise. Ruelle [1] and Bowen [2] had given name 'pressure' or 
'topological pressure' P(a) to s(p) (where a is the observable introduced in sect. 17.1.1), 
defined by the 'large system' limit (17.11). As we shall also apply the theory to com- 
putating the physical gas pressure exerted on the walls of a container by a bouncing 
particle , we refer to s{J3) as simply the leading eigenvalue of the evolution operator in- 
troduced in sect. 16.5. The 'convexity' properties such as P(a) < P(\a\) will be pretty 
obvious consequences of the definition ( 17.1 1). In the case that X is the Perron-Frobenius 
operator (16.10), the eigenvalues {sq(J3), si(J3), • • •} are called the Ruelle-Pollicott reso- 
nances [3, 4, 5], with the leading one, sifi) — sq(J3) being the one of main physical interest. 

In order to aid the reader in digesting the mathematics literature, we shall try to point out 
the notational correspondences whenever appropriate. The rigorous formalism is replete 
with lims, sups, infs, f2-sets which are not really essential to understanding of the theory, 
and are avoided in this book. 

Remark 17.2 Microcanonical ensemble. In statistical mechanics the space average 
(17.7) performed over the Hamiltonian system constant energy surface invariant measure 
p(x)dx = dqdp 6(H(q, p) - E) of volume a>(E) = J M dqdp 6(H(q, p) - E) 



is called the microcanonical ensemble average. 

Remark 17.3 State space discretization. Ref. [10] discusses numerical discretiza- 
tons of state space, and construction of Perron-Frobenius operators as stochastic matrices, 
or directed weighted graphs, as coarse-grained models of the global dynamics, with trans- 
port rates between state space partitions computed using this matrix of transition proba- 
bilities; a rigorous discussion of some of the former features is included in ref. [ 1 1]. 



average - 20mar2013 ChaosBook.org versionl4.4.1, Apr 21 2013 



Commentary 




(17.44) 



EXERCISES 



364 



Exercises 



17.1. Expectation value of a vector observable. 

Check and extend the expectation value formulas 
(17.12) by evaluating the derivatives of s(J3) up to 4-th 
order for the space average (exp(J3 ■ A')) with a, a vector 
quantity: 



(a) 



(b) 



ds_ 



(8=0 



d 2 s 



= limi(A?) = (a t ) , (17.45) 



>'=() 



Note that the formalism is smart: it automatically 
yields the variance from the mean, rather than 
simply the 2nd moment (a 2 )- 
(c) compute the third derivative of s(J3). 



(d) compute the fourth derivative assuming that the 
mean in (17.45) vanishes, (a,) = 0. The 4-th order 
moment formula 



K(t) 



(17.46) 



that you have derived is known as kurtosis ( 17.21): 
it measures a deviation from what the 4-th order 
moment would be were the distribution a pure 
Gaussian (see (25.22) for a concrete example). 
If the observable is a vector, the kurtosis K(f) is 
given by 

Z u [{AjAjAjAj) + 2 ((AiAj) (AjA,) - (A-,A t ) (A? 
(XiiAiAt)) 2 

Pinball escape rate from numerical simulation*. 

Estimate the escape rate for R : a = 6 3-disk pinball 
by shooting 100,000 randomly initiated pinballs into the 
3-disk system and plotting the logarithm of the number 
of trapped orbits as function of time. For comparison, 
a numerical simulation of ref. [3] yields y = .410 
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Chapter 18 



Trace formulas 



The trace formula is not a formula, it is an idea. 
— Martin Gutzwiller 

Dynamics is posed in terms of local equations, but the ergodic averages re- 
quire global information. How can we use a local description of a flow 
to learn something about the global behavior? In chapter 17 we have re- 
lated global averages to the eigenvalues of appropriate evolution operators. Here 
we show that the traces of evolution operators can be evaluated as integrals over 
Dirac delta functions, and in this way the spectra of evolution operators become 
related to periodic orbits. If there is one idea that one should learn about chaotic 
dynamics, it happens in this chapter, and it is this: there is a fundamental local <-> 
global duality which says that 

the spectrum of eigenvalues is dual to the spectrum of periodic orbits 

For dynamics on the circle, this is called Fourier analysis; for dynamics on well- 
tiled manifolds, Selberg traces and zetas; and for generic nonlinear dynamical 
systems the duality is embodied in the trace formulas that we will now derive. 
These objects are to dynamics what partition functions are to statistical mechanics. 

The above phrasing is a bit too highfalutin, so it perhaps pays to go again 
through the quick sketch of sects. 1.5 and 1.6. We have a state space that we 
would like to tessellate by periodic orbits, one short orbit per neighborhood, as in 
figure 18.1 (a). How big is the neighborhood of a given cycle? 

Along stable directions neighbors of the periodic orbit get closer with time, 
so we only have to keep track of those who are moving away along the unsta- 
ble directions. The fraction of those who remain in the neighborhood for one 
recurrence time T p is given by the overlap ratio along the initial sphere and the 
returning ellipsoid, figure 18.1(b), and along the expanding eigen-direction 4'^ 
of J p (x) this is given by the expanding Floquet multiplier l/jAp^l. A bit more 
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Figure 18.1: (a) Smooth dynamics tesselated by 
the skeleton of periodic points, together with their 
linearized neighborhoods, (b) Jacobian matrix J p 
maps spherical neighborhood of — > ellipsoidal 
neighborhood time T p later, with the overlap ratio 
along the expanding eigdirection e (,) of J p (x) given 
by the expanding eigenvalue 1/]A JM |. 



(a) 




thinking leads to the conclusion that one also cares about how long it takes to re- 
turn (the long returns contributing less to the time averages), so the weight tp 
of the /^-neighborhood t p = e~ sTp /\A p \ decreases exponentially both with the 
shortest recurrence period and the product (5.9) of expanding Floquet multipli- 
ers A p = Yl e ■ With emphasis on expanding - the flow could be a 60,000- 
dimensional dissipative flow, and still the neighborhood is defined by the handful 
of expanding eigen-directions. Now the long-time average of a physical observ- 
able -let us say power D dissipated by viscous friction of a fluid flowing through a 
pipe- can be estimated by its mean value (17.6) D p /T p computed on each neigh- 
borhood, and weighted by the above estimate 



Wrong in detail, this estimate is the crux of many a Phys. Rev. Letter, and in its 
essence the key result of this chapter, the 'trace formula.' Here we redo the argu- 
ment in a bit greater depth, and derive the correct formula (20.23) for a long time 
average (D) as a weighted sum over periodic orbits. It will take three chapters, 
but it is worth it - the reward is an exact (i.e., not heuristic) and highly convergent 
and controllable formula for computing averages over chaotic flows. 



18.1 A trace formula for maps 

Our extraction of the spectrum of X commences with the evaluation of the trace. 
As the case of discrete time mappings is somewhat simpler, we first derive the 
trace formula for maps, and then, in sect. 18.2, for flows. The final formula (18.23) 
covers both cases. 

To compute an expectation value using (17.23) we have to integrate over all 
the values of the kernel J2\x,y). Were £!' a matrix sum over its matrix elements 
would be dominated by the leading eigenvalue as n —> oo (we went through the 
argument in some detail in sect. 15. 1). As the trace of X" is also dominated by the 
leading eigenvalue as n — > oo, we might just as well look at the trace for which we 
have a very explicit formula 




exercise 15.3 
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On the other hand, by its matrix motivated definition, the trace is the sum over 
eigenvalues (17.29), 



ff=0 



(18.2) 



We find it convenient to write the eigenvalues as exponents i a rather than as 
multipliers A a , and we assume that spectrum of £, is discrete, $q, s\, S2, • • ; ordered 
so that Re s a > Re s a +\. 

For the time being we choose not to worry about convergence of such sums, 
ignore the question of what function space the eigenfunctions belong to, and com- 
pute the eigenvalue spectrum without constructing any explicit eigenfunctions. 
We shall revisit these issues in more depth in chapter 23, and discuss how lack of 
hyperbolicity leads to continuous spectra in chapter 24. 



18.1.1 Hyperbolicity assumption 



We have learned in sect. 16.2 how to evaluate the delta-function integral (18.1). 

According to (16.8) the trace (18.1) picks up a contribution whenever x - 
f n (x) = 0, i.e., whenever x belongs to a periodic orbit. For reasons which we 
will explain in sect. 18.2, it is wisest to start by focusing on discrete time systems. 
The contribution of an isolated prime cycle p of period }y for a map / can be 
evaluated by restricting the integration to an infinitesimal open neighborhood Aj, 
around the cycle, 



tr p £ n " = f dx5{x- f n "{x)) 
JM P 

n v 1 — r 1 

det(l-M p ) UH-Ap.il 



section 16.2 



For the time being we set here and in (16.9) the observable $ A <> = 1. Periodic orbit 
Jacobian matrix M p is also known as the monodromy matrix, and its eigenvalues 

A„ i , A„ 2, ■ ■ ■, An d as the Floquet multipliers. 

p ' p ' Pl M r section 5.2.2 

We sort the eigenvalues A p j, A p 2, ■ ■ h~p,d of the ^-cycle [dxd] monodromy 
matrix M p into expanding, marginal and contracting sets {e, m, c), as in (5.8). As 
the integral (18.3) can be evaluated only if M p has no eigenvalue of unit magni- 
tude, we assume that no eigenvalue is marginal (we shall show in sect. 18.2 that 
the longitudinal A p £ / + i = 1 eigenvalue for flows can be eliminated by restricting 
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the consideration to the transverse monodromy matrix M p ), and factorize the trace 
(18.3) into a product over the expanding and the contracting eigenvalues 

Idetfl - Mj 1 = — ]~[ I! — — . (18.4) 

where A p = Yie^p,e is the product of expanding eigenvalues. Both Ap ;C and 
1/A p e are smaller than 1 in absolute value, and as they are either real or come in 
complex conjugate pairs we are allowed to drop the absolute value brackets | • • • | 
in the above products. 

The hyperbolicity assumption requires that the stabilities of all cycles included 
in the trace sums be exponentially bounded away from unity: 

|A p , e | > e AeTp any p, any expanding \A p>e \ > 1 

|Ap, f | < e~ AcTp any p, any contracting |A p>c | < 1 , (18.5) 

where A e , A c > 0 are strictly positive bounds on the expanding, contracting cycle 
Lyapunov exponents. If a dynamical system satisfies the hyperbolicity assump- 
tion (for example, the well separated 3-disk system clearly does), the £, spectrum 
will be relatively easy to control. If the expansion/contraction is slower than ex- 
ponential, let us say \A p j\ ~ T p 2 , the system may exhibit "phase transitions," and 
the analysis is much harder - we shall discuss this in chapter 24. 

Example 18.1 Elliptic stability. Elliptic stability, i.e., a pair of purely imaginary 
exponents A m = e ±,e is excluded by the hyperbolicity assumption. While the contribution 
of a single repeat of a cycle 



(1 -e ie )(l -e- ie ) 2(1 - cos 6) v ' ' 

does not make (16.9) diverge, if A m = e' 27Tp/r is rth root of unity, 1/ |det (l - M£| di- 
verges. For a generic 9 repeats cos(r#) behave badly and by ergodicity 1 - cos(rB) is 
arbitrarily small, 1 - cos(r#) < e, infinitely often. This goes by the name of "small divisor 
problem," and requires a separate treatment. 

It follows from (18.4) that for long times, t = rT p — > oo, only the product of 
expanding eigenvalues matters, |det(l - M^l — > \A p \ r . We shall use this fact to 
motivate the construction of dynamical zeta functions in sect. 19.3. However, for 
evaluation of the full spectrum the exact cycle weight (18.3) has to be kept. 

18.1.2 A classical trace formula for maps 

If the evolution is given by a discrete time mapping, and all periodic points have 
Floquet multipliers |A p ,| + 1 strictly bounded away from unity, the trace is 
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given by the sum over all periodic points i of period n: 



tr£ n = J dx£ n (x,x) = Yj 



|det (1 - M"(xi))\ 



(18.7) 



Here Fix f" - {x : f n (x) = x] is the set of all periodic points of period n, and 
Ai is the observable (17.5) evaluated over n discrete time steps along the cycle to 
which the periodic point Xj belongs. The weight follows from the properties of 
the Dirac delta function (16.8) by taking the determinant of b\(Xj - f n (x)j). If a 
trajectory retraces itself r times, its monodromy matrix is Af , where M p is the 
[dxd] monodromy matrix (4.5) evaluated along a single traversal of the prime 
cycle p. As we saw in (17.5), the integrated observable A n is additive along the 
cycle: If a prime cycle p trajectory retraces itself r times, n = rnp, we obtain A p 
repeated r times, A; = A n (xj) = rA p , x, € Ai p . 

A prime cycle is a single traversal of the orbit, and its label is a non-repeating 
symbol string. There is only one prime cycle for each cyclic permutation class. 
For example, the four periodic points 0011 = 1001 = 1100 = 0110 belong to the 
same prime cycle p = 0011 of length 4. As both the stability of a cycle and the 
weight A p are the same everywhere along the orbit, each prime cycle of length Up 
contributes n p terms to the sum, one for each periodic point. Hence (18.7) can be 
rewritten as a sum over all prime cycles and their repeats 

trX" - V n p V : -6 n , npr , (18.8) 

y ^|det(l-M£)| 

with the Kronecker delta 8 n , n r projecting out the periodic contributions of total 
period n. This constraint is awkward, and will be more awkward still for the 
continuous time flows, where it would yield a series of Dirac delta spikes. In both 
cases a Laplace transform rids us of the time periodicity constraint. 

In the sum over all cycle periods, 

^ z «tr C = tr -*±- = ^ n p £ *± , (18.9) 

l-z£ y ^|det(l-M;)| 

the constraint 6 nA r is replaced by weight z". Such discrete time Laplace trans- 
form of tr X" is usually referred to as a "generating function." Why this transform? 
We are actually not interested in evaluating the sum (18.8) for any particular fixed 
period n; what we are interested in is the long time n — > oo behavior. The trans- 
form trades in the large time n behavior for the small z behavior. Expressing the 
trace as in (18.2), in terms of the sum of the eigenvalues of £, we obtain the trace 
formula for maps: 

Zze Sa v~i z' lpr e r P' A r> 
_i — = y n y-t . (i8.io) 



chapter 1 1 



zeSa p ^det(l-M^) 
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This is our second example of the duality between the spectrum of eigenvalues 
and the spectrum of periodic orbits, announced in the introduction to this chapter. 
(The first example was the topological trace formula (15.10).) 




fast track: 

sect. 18.2, p. 372 



Example 18.2 A trace formula for transfer operators: For a piecewise-linear map 
(16.11), we can explicitly evaluate the trace formula. By the piecewise linearity and the 
chain rule A p = A"°A" [ , where the cycle p contains n 0 symbols 0 and «i symbols 1, the 
trace (18.7) reduces to 

trr = f ( n ) 1 = y f_J_ + _J_T , (18 . n) 



with eigenvalues 



\Ao\A k Q |Ai|A* 



(18.12) 



As the simplest example of spectrum for such dynamical system, consider the symmet- 
ric piecewise-linear 2-branch repeller (16.11) for which A = Ai = -A 0 . In this case all 

odd eigenvalues vanish, and the even eigenvalues are given by e Sl = 2/A k+l , k even. . 

©xsrciss l o. / 

Asymptotically the spectrum (18. 12) is dominated by the lesser of the two fixed 
point slopes A = A 0 (if\A Q \ < |Ai|, otherwise A = A\), and the eigenvalues e Sk fall off 
exponentially as l/A k , dominated by the single less unstable fixed-point. 

©Xcimpi© to.i 

For k = 0 this is in agreement with the explicit transfer matrix ( 16. 13) eigenval- 
ues ( 1 7.39). The alert reader should experience anxiety at this point. Is it not true that 
we have already written down explicitly the transfer operator in (16.13), and that it is 
clear by inspection that it has only one eigenvalue e s ° = 1 /|A 0 | + l/|Ai| ? The example 
at hand is one of the simplest illustrations of necessity of defining the space that the 
operator acts on in order to define the spectrum. The transfer operator (16.13) is 
the correct operator on the space of functions piecewise constant on the state space 
partition {Mo, Mi}; on this space the operator indeed has only the eigenvalue e So . As 
we shall see in example 23. 1, the full spectrum (18.12) corresponds to the action of the 
transfer operator on the space of real analytic functions. 

The Perron-Frobenius operator trace formula for the piecewise-linear map (16.11) 
follows from (18.9) 

tr U zto + E^T) (18 . 13) 
verifying the trace formula (18.10). 
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18.2 A trace formula for flows 



Amazing! I did not understand a single word. 
— Fritz Haake 



(R. Artuso and P. Cvitanovic) 



Our extraction of the spectrum of J2 commences with the evaluation of the trace 



We are not interested in any particular time t, but into the long-time behavior 
as t —> oo, so we need to transform the trace from the "time domain" into the 
"frequency domain." A generic flow is a semi-flow defined forward in time, so 
the appropriate transform is a Laplace rather than Fourier. 

For a continuous time flow, the Laplace transform of an evolution operator 
yields the resolvent (17.33). This is a delicate step, since the evolution operator 
becomes the identity in the t —> 0 + limit. In order to make sense of the trace we 
regularize the Laplace transform by a lower cutoff e smaller than the period of any 
periodic orbit, and write 



where J{ is the generator of the semigroup of dynamical evolution, see sect. 16.5. 
Our task is to evaluate tr £! from its explicit state space representation. 

18.2.1 Integration along the flow 

As any pair of nearby points on a cycle returns to itself exactly at each cycle 
period, the eigenvalue of the Jacobian matrix corresponding to the eigenvector 
along the flow necessarily equals unity for all periodic orbits. Hence for flows the 
trace integral tr £} requires a separate treatment for the longitudinal direction. To 
evaluate the contribution of an isolated prime cycle p of period T p , restrict the in- 
tegration to an infinitesimally thin tube M p enveloping the cycle (see figure 1.13), 
and consider a local coordinate system with a longitudinal coordinate d$ along 
the direction of the flow, and d- 1 transverse coordinates x ± , 




(18.14) 




(18.15) 




(18.16) 
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(we set /? = 0 in the exp(/3 • A') weight for the time being). Pick a point on the 
prime cycle p, and let 



V(*||) 



( d ) 1 ' 2 

2 



i=l 



(18.17) 



be the magnitude of the tangential velocity at any point x — (-Hfl , 0, • • • , 0) on the 
cycle p. The velocity v(x) must be strictly positive, as otherwise the orbit would 
stagnate for infinite time at v(x) = 0 points, and that would get us nowhere. 

As 0 < t < T p , the trajectory x\\(r) = f T (x p ) sweeps out the entire cycle, and 
for larger times x\\ is a cyclic variable of periodicity T p , 

x,|(t) =x\ [ {r + rT p ) r = l,2,... (18.18) 

We parametrize both the longitudinal coordinate X|(r) and the velocity v(r) = 
v(jc||(t)) by the flight time t, and rewrite the integral along the periodic orbit as 

£ dx\\ 6( X \l - f\x\l)) = £ dTv(T)6(x {l (T) - X||(T + t)) . (18.19) 
J p J p 

By the periodicity condition (18.18) the Dirac 5 function picks up contributions 
for t - rT p , so the Laplace transform can be split as 



/-»oo 00 

dt e- st 8{x\\{t) - x,|(t + t)) = V e~ sT " r I r 
Jo 7=1 

I r = J dt e~ st s(x\\(t) - X||(t + rT p + ij) . 



Taylor expanding and applying the periodicity condition (18.18), we have jc||(t + 
rTp + t) - X||(t) + v(r)t + ..., 



lr = f dte' st d(x,|(T) - x,|(t + rTp + /)) = -L , 



so the remaining integral (18.19) over r is simply the cycle period j> p dr = T p . 
The contribution of the longitudinal integral to the Laplace transform is thus 

dte~ st £ dx\\ <5(x|| - /(x,|)) - T p J] . (18.20) 
Jo Jp r=l 

This integration is a prototype of what needs to be done for each marginal direc- 
tion, whenever existence of a conserved quantity (energy in Hamiltonian flows, 
angular momentum, translational invariance, etc.) implies existence of a smooth 
manifold of equivalent (equivariant) solutions of dynamical equations. 
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18.2.2 Stability in the transverse directions 

Think of the r = 0 point in above integrals along the cycle p as a choice of a 
particular Poincare section. As we have shown in sect. 5.5, the transverse Flo- 
quet multipliers do not depend on the choice of a Poincare section, so ignoring 
the dependence on x\\(j) in evaluating the transverse integral in (18.16) is justi- 
fied. For the transverse integration variables the Jacobian matrix is defined in a 
reduced Poincare surface of section P of fixed x\. Linearization of the periodic 
flow transverse to the orbit yields 

f dx x 6(x ± - f r ± T "(x)) = ■ -i , (18.21) 

Jr det(l-iHj) 

where M p is the /?-cycle [d-lxd- 1] transverse monodromy matrix. As in (18.5) 
we have to assume hyperbolicity, i.e., that the magnitudes of all transverse eigen- 
values are bounded away from unity. 

Substitution (18.20), (18.21) in (18.16) leads to an expression for trX' as a 
sum over all prime cycles p and their repetitions 



I 



dte~ st tr £ { = V T p V , . (18.22) 

y fdm(i-M r p )\ 



The e — > 0 limit of the two expressions for the resolvent, (18. 15) and (18.22), now 
yields the classical trace formula for flows 

> = > T„) -. r . (18.23) 

Zj ^- V ^det(l-M;) 



exercise 18.1 



(If you are worried about the convergence of the resolvent sum, keep the e regu- 
larization.) 

This formula is still another example of the duality between the (local) cycles 
and (global) eigenvalues. If T p takes only integer values, we can replace e~ s — > z 
throughout, so the trace formula for maps (18.10) is a special case of the trace 
formula for flows. The relation between the continuous and discrete time cases 
can be summarized as follows: 



T p <-> Up 

e- s ~ z 

e m <-> £ n . (18.24) 



The beauty of trace formulas is that they are coordinate independent: the 
|det(l - M p )| = |det(l - M T "(x))\ and e BA " - e^ ATp(x) contributions to the cy- 
cle weight t p are both independent of the starting periodic point x € Mp. For 
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the Jacobian matrix M p this follows from the chain rule for derivatives, and for 
e PA P from the fact that the integral over e° A w is evaluated along a closed loop. In 
addition, as we have shown in sect. 5.3, |det(l - M p )| is invariant under smooth 
coordinate transformations. 

We could now proceed to estimate the location of the leading singularity of 
tr (s - 3K)~ X by extrapolating finite cycle length truncations of (18.23) by methods 
such as Pade approximants. However, it pays to first perform a simple resumma- 
tion which converts this divergence of a trace into a zero of a spectral determinant. 
We shall do this in sect. 19.2, but first a brief refresher of how all this relates to 
the formula for escape rate (1.8) offered in the introduction might help digest the 
material. 



fast track: 
sect. 1 9, p. 379 



18.3 An asymptotic trace formula 



In order to illuminate the manipulations of sect. 18.1 and relate them to 
something we already possess intuition about, we now rederive the heuristic sum 
of sect. 1.5.1 from the exact trace formula (18.10). The Laplace transforms (18.10) 
or (18.23) are designed to capture the time — > oo asymptotic behavior of the trace 
sums. By the hyperbolicity assumption (18.5), for t = T p r large the cycle weight 
approaches 

|det(l-M;)|^|A/, (18.25) 

where A p is the product of the expanding eigenvalues of M p . Denote the corre- 
sponding approximation to the nth trace (18.7) by 



and denote the approximate trace formula obtained by replacing the cycle weights 
|det(l - Mp)| by |A p | r in (18.10) by T(z). Equivalently, think of this as a replace- 
ment of the evolution operator (17.25) by a transfer operator (as in example 18.2). 
For concreteness consider a dynamical system whose symbolic dynamics is com- 
plete binary, for example the 3-disk system figure 1 .6. In this case distinct periodic 
points that contribute to the nth periodic points sum (18.8) are labeled by all ad- 
missible itineraries composed of sequences of letters $ e {0, 1}: 

n=l n=l XjeFixf" 
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' e P-A 0 ^ Ai\ ( e WAo gP-Aoi ^-A.o e 2p-At 

= Z<T-- + ---\+Z 2 <— -T + T—- + T— ~ + 



lAol IAiIJ llAol 2 IA01I IA10I |Ail 2 

( e 3J3-A 0 e /3-Aooi e /3-A 0l0 ^S-A m \ 

+z + (18 - 27) 

Both the cycle averages A,- and the stabilities A,- are the same for all points Xj e M p 
in a cycle p. Summing over repeats of all prime cycles we obtain 

T(z) = V , t p = z"^- A »/\A p \ . (18.28) 

L ty, 



This is precisely our initial heuristic estimate (1.9). Note that we could not per- 
form such sum over r in the exact trace formula (18.10) as |det(l - # 

|det(l - M p )| ; the correct way to resum the exact trace formulas is to first ex- 
pand the factors 1/|1 - A Pi /|, as we shall do in (19.9). 

If the weights e^ A "W are multiplicative along the flow, and the flow is hyper- 
bolic, for given ft the magnitude of each l^ '^'VA,! term is bounded by some 
constant M". The total number of cycles grows as 2" (or as e 1 '", h = topological 
entropy, in general), and the sum is convergent for z sufficiently small, \z\ < 1/2M. 
For large n the nth level sum (18.7) tends to the leading X" eigenvalue e" s °. Sum- 
ming this asymptotic estimate level by level 

(ze s <>T = -i— - (18.29) 
1 - ze i0 

n=l 

we see that we should be able to determine sq by determining the smallest value 
of z = e~ s ° for which the cycle expansion (18.28) diverges. 

If one is interested only in the leading eigenvalue of it suffices to consider 
the approximate trace T{z). We will use this fact in sect. 19.3 to motivate the 
introduction of dynamical zeta functions (19.14), and in sect. 19.5 we shall give 
the exact relation between the exact and the approximate trace formulas. 



section 19.2 



Resume 



The description of a chaotic dynamical system in terms of cycles can be visual- 
ized as a tessellation of the dynamical system, figure 18.1, with a smooth flow 
approximated by its periodic orbit skeleton, each region A\ centered on a peri- 
odic point x, of the topological length n, and the size of the region determined 
by the linearization of the flow around the periodic point. The integral over such 
topologically partitioned state space yields the classical trace formula 

1 V yi e r(P-A p -sT p ) 



a=Q S S " 



det(l -M;) 
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Now that we have a trace formula, one might ask: what is it good for? As it 
stands, it is a scary divergent formula which relates the unspeakable infinity of 
global eigenvalues to the unthinkable infinity of local unstable cycles. However, 
it is a good stepping stone on the way to construction of spectral determinants (to 
which we turn next), and a first hint that when the going is good, the theory might 
turn out to be convergent beyond our wildest dreams (chapter 23). In order to 
implement such formulas, we will have to determine "all" prime cycles. The first 
step is topological: enumeration of all admissible cycles undertaken in chapterl2. 
The more onerous enterprize of actually computing the cycles we first approach 
traditionally, as a numerical task in chapter 13, and then more boldly as a part and 
parcel of variational foundations of classical and quantum dynamics in chapter29. 



Remark 18.1 Who's dunne it? Continuous time flow traces weighted by cycle periods 
were inttoduced by Bowen [ I ] who treated them as Poincare section suspensions weighted 
by the "time ceiling" function (3.5). They were used by Parry and Pollicott [2]. 

Remark 18.2 Flat and sharp traces. In the above formal derivation of trace for- 
mulas we cared very little whether our sums were well posed. In the Fredholm theory 
traces like (18.14) require compact operators with continuous function kernels. This is 
not the case for our Dirac delta evolution operators: nevertheless, there is a large class 
of dynamical systems for which our results may be shown to be perfectly legal. In the 
mathematical literature expressions like (18.7) are called flat traces (see the review [4] 
and chapter 23). Other names for traces appear as well: for instance, in the context of 1- 
dimensional mappings, sharp traces refer to generalizations of ( 18.7) where conttibutions 
of periodic points are weighted by the Lefschetz sign ±1, reflecting whether the periodic 
point sits on a branch of nth iterate of the map which crosses the diagonal starting from 
below or starting from above [ 1 0]. Such traces are connected to the theory of kneading in- 
variants (see ref. [4] and references therein). Traces weighted by +1 sign of the derivative 
of the fixed point have been used to study the period doubling repeller, leading to high 
precision estimates of the Feigenbaum constant 6, refs. [5, 6, 6]. 



Commentary 



Exercises 



18.1. t — > 0+ regularization of eigenvalue sums**. In tak- 
ing the Laplace transform (18.23) we have ignored the 
t — > 0+ divergence, as we do not know how to regularize 
the delta function kernel in this limit. In the quantum 
(or heat kernel) case this limit gives rise to the Weyl 



or Thomas-Fermi mean eigenvalue spacing. Regularize 
the divergent sum in (18.23) and assign to such volume 
term some interesting role in the theory of classical res- 
onance spectra. E-mail the solution to the authors. 
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18.2. General weights, (easy) Let /' be a flow and £} the 
operator 

£'g(x) = J" dy5(x-f(y)) W (t,y)g(y) 

where w is a weight function. In this problem we will 
try and determine some of the properties w must satisfy. 

(a) Compute £. s £!g(x) to show that 

w(s, f'(x))w(t, x) = w(t + s, x) . 



(b) Restrict t and s to be integers and show that the 
most general form of w is 

w(n, x) = g(x)g(f(x))g(f 2 (x)) ■ ■ ■ g{f-\x)) , 

for some g that can be multiplied. Could g be a 
function from R" 1 i-> R" 2 ? (m e N.) 
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Chapter 19 



Spectral determinants 



"It seems very pretty," she said when she had finished it, 
"but it's rather hard to understand!" (You see she didn't 
like to confess, even to herself, that she couldn't make it 
out at all.) "Somehow it seems to fill my head with ideas 
— only I don't exactly know what they are!" 



— Lewis Carroll, Through the Looking Glass 




he problem with the trace formulas (18.10), (18.23) and (18.28) is that they 



diverge at z - e s °, respectively s - sq, i.e., precisely where one would 



like to use them. While this does not prevent numerical estimation of 
some "thermodynamic" averages for iterated mappings, in the case of the Gutz- 
willer trace formula this leads to a perplexing observation that crude estimates 
of the radius of convergence seem to put the entire physical spectrum out of 
reach. We shall now cure this problem by thinking, at no extra computational 
cost; while traces and determinants are formally equivalent, determinants are the 
tool of choice when it comes to computing spectra. Determinants tend to have 
larger analyticity domains because if tr-C/(l - Z-Q = —4- lndet (1 - z-Q diverges 
at a particular value of z, then det(l - z-Q might have an isolated zero there, and 
a zero of a function is easier to determine numerically than its poles. 



19.1 Spectral determinants for maps 

The eigenvalues Zk of a linear operator are given by the zeros of the determinant 



For finite matrices this is the characteristic determinant; for operators this is the 
Hadamard representation of the spectral determinant (sparing the reader from 




(19.1) 



k 
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pondering possible regularization factors). Consider first the case of maps, for 
which the evolution operator advances the densities by integer steps in time. In 
this case we can use the formal matrix identity 



exercise 4.1 



lndet 



oo 

(1 - M) = tr ln(l - M) = - Y -tr M" , 



(19.2) 



n=\ 



to relate the spectral determinant of an evolution operator for a map to its traces 
(18.8), and hence to periodic orbits: 



det(l -z£) 



exp 



exp 



^— ' n 

n 

oo 



p r=l 



det(l-M;)| 



(19.3) 



Going the other way, the trace formula (18.10) can be recovered from the 
spectral determinant by taking a derivative 



tr = -z4- In det (1 - z£) . 

1 - z-L dz 



(19.4) 



fast track: 

sect. 19.2, p. 381 



Example 19.1 Spectral determinants of transfer operators: 



For a piecewise-linear map (16.11) with a finite Markov partition, an explicit 
formula for the spectral determinant follows by substituting the trace formula (18.11) 
into (19.3): 



(19.5) 



where t s = z/\A s \. The eigenvalues are necessarily the same as in (18.12), which we 
already determined from the trace formula (18.10). 

The exponential spacing of eigenvalues guarantees that the spectral determin- 
ant (19.5) is an entire function. It is this property that generalizes to piecewise smooth 
flows with finite Markov partitions, and singles out spectral determinants rather than 
the trace formulas or dynamical zeta functions as the tool of choice for evaluation of 
spectra. 
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19.2 Spectral determinant for flows 



... an analogue of the [Artin-Mazur] zeta function for dif- 
feomorphisms seems quite remote for flows. However we 
will mention a wild idea in this direction. [■ • •] define l(y) 
to be the minimal period of y [■ • •] then define formally 
(another zeta function!) Z(s) to be the infinite product 

OO 

Z(s) = \ "] (l-[e X p /(y)]-*-*). 

yer k=0 

— Stephen Smale, Differentiable Dynamical Systems 

We write the formula for the spectral determinant for flows by analogy to 
(19.3) 



det (s - St) = exp 



2 2 r 

p r=l 



1 e r(J3-A p -sT p ) 



det(l-M£)| 



(19.6) 



and then check that the trace formula (18.23) is the logarithmic derivative of the 
spectral determinant 



1 d 
tr = — In det (s - Jl) . 

s - 3K ds 



(19.7) 



With z set to z = e~ s as in (18.24), the spectral determinant (19.6) has the same 
form for both maps and flows. We refer to (19.6) as spectral determinant, as the 
spectrum of the operator Si is given by the zeros of 



det(s-JV) = 0. 



(19.8) 



We now note that the r sum in (19.6) is close in form to the expansion of a 
logarithm. This observation enables us to recast the spectral determinant into an 
infinite product over periodic orbits as follows: 

Let M p be the />cycle [dxd] transverse Jacobian matrix, with eigenvalues 
A p j, A Pt 2, ■ ■ ., A pf /. Expanding the expanding eigenvalue factors 1/(1 - 1/Ap >e ) 
and the contracting eigenvalue factors 1/(1 - Ap >c ) in (18.4) as geometric series, 
substituting back into (19.6), and resumming the logarithms, we find that the spec- 
tral determinant is formally given by the infinite product 



det (s - JK) 



oo oo 1 

n-n^ 



• n 



A ,] ..A' 1 



p,e+l p,e+2 



•A 



p,d 



p,\ p,2 P' c 

1 



t p = t p (z,s,J3) = —e' 3 - A "~ sT "z n " 

|An| 



(19.9) 
(19.10) 
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In such formulas t p is a weight associated with the p cycle (letter t refers to the 
"local trace" evaluated along the p cycle trajectory), and the index p runs through 
all distinct prime cycles. Why the factor f? ? It is associated with the trace for- 
mula (18.10) for maps, whereas the factor e~ sT ? is specific to the continuous time 
trace formuls (18.23); according to (18.24) we should use either one or the other. 
But we have learned in sect. 3.1 that flows can be represented either by their 
continuous-time trajectories, or by their topological time Poincare section return 
maps. In cases when we have good control over the topology of the flow, it is 
often convenient to insert the t p factor into cycle weights, as a formal parame- 
ter which keeps track of the topological cycle lengths. These factors will assist 
us in expanding zeta functions and determinants, eventually we shall set z = 1- 
The subscripts e, c indicate that there are e expanding eigenvalues, and c contract- 
ing eigenvalues. The observable whose average we wish to compute contributes 
through the A\x) term in the p cycle multiplicative weight S' Ap . By its definition 
(17.1), the weight for maps is a product along the periodic points 



e A » 



n 

7=0 



and the weight for flows is an exponential of the integral (17.5) along the cycle 




This formula is correct for scalar weighting functions; more general matrix valued 
weights require a time-ordering prescription as in the Jacobian matrix of sect.4. 1 . 



Example 19.2 Expanding 1 -dimensional map: For expanding 1 -dimensional 

mappings the spectral determinant (19.9) takes the form 



det(l-z£) = [ ~](l-t p /A k p ), t p = e —z"». (19.11) 
p k=o 1 P l 



Example 19.3 Two-degree of freedom Hamiltonian flows: For a 2-degree of free- 
dom Hamiltonian flows the energy conservation eliminates on phase-space variable, 
and restriction to a Poincare section eliminates the marginal longitudinal eigenvalue 
A - I, so a periodic orbit of 2-degree of freedom hyperbolic Hamiltonian flow has one 
expanding transverse eigenvalue A, |A| > 1, and one contracting transverse eigenvalue 
1 /A. The weight in (18.4) is expanded as follows: 

1 1 1 ^ k + l 

TV ) — r- ■ (19.12) 



|det(l-M;)| IM'd-l/Ap 2 w r U A*'' 
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The spectral determinant exponent can be resummed, 

r det(l-M;) fej |A,|A*J 



and fhe spectral determinant for a 2-dimensional hyperbolic Hamiltonian flow rewritten 
as an infinite product over prime cycles 



det(s-St) = l 1( 1_ ^ /A *) + ■ (19 - 13) 

p k=0 

exercise 23.4 



Now we are finally poised to deal with the problem posed at the beginning of 
chapter 18; how do we actually evaluate the averages introduced in sect. 17.1? The 
eigenvalues of the dynamical averaging evolution operator are given by the values 
of s for which the spectral determinant (19.6) of the evolution operator (17.25) 
vanishes. If we can compute the leading eigenvalue ^(/3) and its derivatives, we 
are done. Unfortunately, the infinite product formula (19.9) is no more than a 
shorthand notation for the periodic orbit weights contributing to the spectral det- 
erminant; more work will be needed to bring such formulas into a tractable form. 
This shall be accomplished in chapter 20, but here it is natural to introduce still 
another variant of a determinant, the dynamical zeta function. 



19.3 Dynamical zeta functions 



It follows from sect. 18.1.1 that if one is interested only in the leading eigenvalue 
of £,' the size of the p cycle neighborhood can be approximated by l/|Ap| r , the 
dominant term in the rT p = t — > oo limit, where A p = T\ e A p ^ e is the product of 
the expanding eigenvalues of the Jacobian matrix M p . With this replacement the 
spectral determinant (19.6) is replaced by the dynamical zeta function 



l/£ = exp 



p r=\ 



-C 



(19.14) 



that we have already derived heuristically in sect. 1.5.2. Resumming the loga- 
rithms using Y>r t'p/ r - ~ m (l _ tp) we obtain the Euler product representation of 
the dynamical zeta function: 



iK = \\(i-t p ). 



(19.15) 



In order to simplify the notation, we usually omit the explicit dependence of 1 /£, 
t p on z, s, fi whenever the dependence is clear from the context. 
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The approximate trace formula (18.28) plays the same role vis-a-vis the dyn- 
amical zeta function (19.7) 

r ( ,) = -f Inr^y-r^, (19-16) 
as ' 1 - 1„ 



as the exact trace formula (18.23) plays vis-a-vis the spectral determinant (19.6). 
The heuristically derived dynamical zeta function of sect. 1.5.2 now re-emerges 
as the 1 /£o-o(z) P art of the exact spectral determinant; other factors in the infinite 
product (19.9) affect the non-leading eigenvalues of X- 

In summary, the dynamical zeta function (19.15) associated with the flow f(x) 
is defined as the product over all prime cycles p. The quantities, T p , n p and 
A p , denote the period, topological length and product of the expanding Floquet 
multipliers of prime cycle p, A p is the integrated observable a(x) evaluated on a 
single traversal of cycle p (see (17.5)), s is a variable dual to the time t, z is dual 
to the discrete "topological" time n, and t p (z, s,fi) denotes the local trace over the 
cycle p. We have included the factor f'> in the definition of the cycle weight in 
order to keep track of the number of times a cycle traverses the surface of section. 
The dynamical zeta function is useful because the term 

l/£Cr) = 0 (19.17) 

when s = sq, Here so is the leading eigenvalue of £ = e m , which is often all 
that is necessary for application of this equation. The above argument completes 
our derivation of the trace and determinant formulas for classical chaotic flows. 
In chapters that follow we shall make these formulas tangible by working out a 
series of simple examples. 

The remainder of this chapter offers examples of zeta functions. 



fast track: 
chapter 20, p. 395 



19.3.1 A contour integral formulation 



The following observation is sometimes useful, in particular for zeta func- 
tions with richer analytic structure than just zeros and poles, as in the case of 
intermittency (chapter 24): r„, the trace sum (18.26), can be expressed in terms 
of the dynamical zeta function (19.15) 



'"^ni'-jja) ■ <19 - 18) 
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Figure 19.1: The survival probability r„ can be split 
into contributions from poles (x) and zeros (o) between 
the small and the large circle and a contribution from 
the large circle. 




as a contour integral 



r„ = <£ z- n I log r\z))dz , (19.19) 
2m J y - \dz J 

where a small contour y~ encircles the origin in negative (clockwise) direction. 
If the contour is small enough, i.e., it lies inside the unit circle \z\ = 1, we may 
write the logarithmic derivative of £~ l {z) as a convergent sum over all periodic 
orbits. Integrals and sums can be interchanged, the integrals can be solved term 
by term, and the trace formula (18.26) is recovered. For hyperbolic maps, cycle 
expansions or other techniques provide an analytical continuation of the dynam- 
ical zeta function beyond the leading zero; we may therefore deform the original 
contour into a larger circle with radius R which encircles both poles and zeros of 
£ _1 (z)> as depicted in figure 19.1. Residue calculus turns this into a sum over the 
zeros z a and poles zp of the dynamical zeta function, that is 



exercise 19.7 



poles 

\z a \<R \zp\<R P ""^V* 



r ' = 2.2-.?.H + 5£*'*Z^ <19 - 20) 



chapter 20 



where the last term gives a contribution from a large circle y R . It would be a 
miracle if you still remember this, but in sect. 1.4.3 we interpreted T n as fraction 
of survivors after n bounces, and defined the escape rate y as the rate of the find 
exponential decay of T n . We now see that this exponential decay is dominated by 
the leading zero or pole of g~ l (z). 



19.3.2 Dynamical zeta functions for transfer operators 



Ruelle's original dynamical zeta function was a generalization of the topo- 
logical zeta function (15.27) to a function that assigns different weights to different 

chapter 15 

cycles: 



n-l 



az) = exp ^ Z - tr ft SifKxd) 



a,eFix/" ;'=0 
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exercise 18.2 

Here we sum over all periodic points xt of period n, and g(x) is any (matrix val- 
ued) weighting function, where the weight evaluated multiplicatively along the 
trajectory of Xj. 

By the chain rule (4.42) the stability of any «-cycle of a 1 -dimensional map 
is given by A p = I~[" = i f'( x i)> so the 1-dimensional map cycle stability is the 
simplest example of a multiplicative cycle weight g(x-) = l/l/'fe)!, and indeed - 
via the Perron-Frobenius evolution operator (16.9) - the historical motivation for 
Ruelle's more abstract construction. 

In particular, for a piecewise-linear map with a finite Markov partition such as 
the map of example 16.1, the dynamical zeta function is given by a finite polyno- 
mial, a straightforward generalization of the topological transition matrix deter- 
minant (14.1). As explained in sect. 15.3, for a finite [Ax A] dimensional matrix 
the determinant is given by 



N 

p n=\ 



where c n is given by the sum over all non-self-intersecting closed paths of length 
n together with products of all non-intersecting closed paths of total length n. 



Example 19.4 A piecewise linear repeller: Due to piecewise linearity, the stability 
of any n-cycle of the piecewise linear repeller (16. 11) factorizes as A S| = A™ A"~ m , 
where m is the total number of times the letter Sj = 0 appears in the p symbol sequence, 
so the traces in the sum (18.28) take the particularly simple form 



JAol I A, 

The dynamical zeta function (19.14) evaluated by resumming the traces, 

0x©rcis© l y.o 

l/f(z)= 1 - z/|Aq! - z/lAa) , (19.21) 

is indeed the determinant det (1 - zT) of the transfer operator ( 16. 13), which is almost 
as simple as the topological zeta function ( 15.34). 



chapter 15 



More generally, piecewise-linear approximations to dynamical systems yield 
polynomial or rational polynomial cycle expansions, provided that the symbolic 
dynamics is a subshift of finite type. 

We see that the exponential proliferation of cycles so dreaded by quantum 
chaologians is a bogus anxiety; we are dealing with exponentially many cycles of 
increasing length and instability, but all that really matters in this example are the 
stabilities of the two fixed points. Clearly the information earned by the infinity 
of longer cycles is highly redundant; we shall learn in chapter 20 how to exploit 
this redundancy systematically. 
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19.4 False zeros 



Compare (19.21) with the Euler product (19.15). For simplicity consider two 
equal scales, |Ao| = |Ai| = e A . Our task is to determine the leading zero z = e 7 
of the Euler product. It is a novice error to assume that the infinite Euler product 
(19. 15) vanishes whenever one of its factors vanishes. If that were true, each factor 
(1 -z""/|A P l) would yield 

0 = 1 - e n " (19.22) 

so the escape rate y would equal the Floquet exponent of a repulsive cycle, one 
eigenvalue y - y p for each prime cycle p. This is false! The exponentially 
growing number of cycles with growing period conspires to shift the zeros of the 
infinite product. The correct formula follows from (19.21) 

0=l-e y ' A+h , h = In 2. (19.23) 

This particular formula for the escape rate is a special case of a general relation 
between escape rates, Lyapunov exponents and entropies that is not yet included 
into this book. Physically this means that the escape induced by the repulsion 
by each unstable fixed point is diminished by the rate of backscatter from other 
repelling regions, i.e., the entropy h; the positive entropy of orbits shifts the "false 
zeros" z = e Ap of the Euler product (19.15) to the true zero z = e A ~ h . 



19.5 Spectral determinants vs. dynamical zeta functions 

In sect. 19.3 we derived the dynamical zeta function as an approximation to the 
spectral determinant. Here we relate dynamical zeta functions to spectral deter- 
minants exactly, by showing that a dynamical zeta function can be expressed as a 
ratio of products of spectral determinants. 

The elementary identity for J-dimensional matrices 
1 ^ 

1 = V (-l)*tr (a*m) , (19.24) 

det(l-M)^ v ' V /' v 

inserted into the exponential representation (19.14) of the dynamical zeta func- 
tion, relates the dynamical zeta function to weighted spectral determinants. 



Example 19.5 Dynamical zeta function in terms of determinants, 1 -dimensional 
maps: For 1 -dimensional maps the identity 

1 1 1 1 



(1-1/A) AQ-l/A) 
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Figure 19.2: A game of pinball consisting of two disks 
of equal size in a plane, with its only periodic orbit (A. 
Wirzba). 




<_ R _> 



substituted into (19.14) yields an expression for the dynamical zeta function for 1- 
dimensional maps as a ratio of two spectral determinants 

V^ ^Tf, (19.25) 
det(l -z£ w ) 

where the cycle weight in X. w is given by replacement t p — > t p /A p . As we shall see 
in chapter 23, this establishes that for nice hyperbolic flows 1 is meromorphic, with 
poles given by the zeros of det (1 - zJUy>)- The dynamical zeta function and the spectral 
determinant have the same zeros, although in exceptional circumstances some zeros 
of det (1 -z£(\)) might be cancelled by coincident zeros of det (1 - zZ(\)). Hence even 
though we have derived the dynamical zeta function in sect. 19.3 as an "approximation" 
to the spectral determinant, the two contain the same spectral information. 



Example 19.6 Dynamical zeta function in terms of determinants, 2-dimensional 
Hamiltonian maps: For 2-dimensional Hamiltonian flows the above identity yields 

-(1-2/A+l/A 2 ), 



so 



|A| |A|(1 - 1/A) 2 



det(l -z£)det(l -z£ (2) ) 

m= . tn r : — • (19.26) 

det(l -z£(i)) 

This establishes that for nice 2-dimensional hyperbolic flows the dynamical zeta func- 
tion is meromorphic. 



Example 19.7 Dynamical zeta functions for 2-dimensional Hamiltonian flows: The 

relation (19.26) is not particularly useful for our purposes. Instead we insert the identity 

1 2 1 11 
1 = ~ r— ~ - -r~. r— r + 



(1-1/A) 2 A(l-1/A) 2 A 2 (l-1/A) 2 

into the exponential representation ( 19. 14) of 1/&, and obtain 

det(l - zX«))det(l - zLtk^vi) 
det(l-z£( k+ i))- 

Even though we have no guarantee that det (1 - z£(k)) are entire, we do know that the 
upper bound on the leading zeros of det (1 -z£(k+\)) lies strictly below the leading zeros 
of det (1 - z£(k)), and therefore we expect that for 2-dimensional Hamiltonian flows the 
dynamical zeta function 1/& generically has a double leading pole coinciding with the 
leading zero of the det (1 - z£(k+ij) spectral determinant. This might fail if the poles and 
leading eigenvalues come in wrong order, but we have not encountered such situations 
in our numerical investigations. This result can also be stated as follows: the theorem 
establishes that the spectral determinant (19.13) is entire, and also implies that the 
poles in 1/& must have the right multiplicities to cancel in the det(l - z-Q = H 
product. 
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Figure 19.3: The classical resonances a 
(19.28) for a 2-disk game of pinball. 
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19.6 All too many eigenvalues? 

What does the 2-dimensional hyperbolic Hamiltonian flow spectral determinant 
(19.13) tell us? Consider one of the simplest conceivable hyperbolic flows: the 
game of pinball of figure 19.2 consisting of two disks of equal size in a plane. 
There is only one periodic orbit, with the period T and expanding eigenvalue 
A given by elementary considerations (see exercise 13.7), and the resonances 
det (s a - Jl) - 0, a = {k, n) plotted in figure 19.3: 

2m 

s a = -(k + I) A + n— , n € Z , k € Z + , multiplicity k+l, (19.28) 




can be read off the spectral determinant (19.13) for a single unstable cycle: 



det(j-^) = Y\(l - e' sT /\A\A k f +l . (19.29) 

k=0 

In the above A = In \A\/T is the cycle Lyapunov exponent. For an open system, 
the real part of the eigenvalue s a gives the decay rate of ath eigenstate, and the 
imaginary part gives the "node number" of the eigenstate. The negative real part 
of s a indicates that the resonance is unstable, and the decay rate in this simple 
case (zero entropy) equals the cycle Lyapunov exponent. 

Rapidly decaying eigenstates with large negative Re are not a problem, but 
as there are eigenvalues arbitrarily far in the imaginary direction, this might seem 
like all too many eigenvalues. However, they are necessary - we can check this by 
explicit computation of the right hand side of (18.23), the trace formula for flows: 



oo oo 

-(k+\)At+i2nnt/T 



a=0 k=0 n=-oo 

yt/T oo 

Jlimt/T 



2> sv = Z2> +1 > 

k=0 n=-oo 

OO / x[/ i OO 

k=0 x ' n=-o- 

oo j 1 oo 

ylll y 6{r - t/T) 

Zj \A\r\kr Zj 

i'——nn 



k=0 

T f 6{t ~ rT) - , (19.30) 
^ IM'Yl - 1/AO 2 



det - 19apr2005 



ChaosBook.org version!4.4.1, Apr 21 2013 



CHAPTER 19. SPECTRAL DETERMINANTS 



390 



Hence, the two sides of the trace formula (18.23) are verified. The formula is fine 
for t > 0; for t —> 0 + , however, sides are divergent and need regularization. 

The reason why such sums do not occur for maps is that for discrete time we 
work with the variable z = e? , so an infinite strip along Im s maps into an annulus 
in the complex z plane, and the Dirac delta sum in the above is replaced by the 
Kronecker delta sum in (18.8). In the case at hand there is only one time scale 
T, and we could just as well replace s by the variable z = e~ sT . In general, a 
continuous time flow has an infinity of irrationally related cycle periods, and the 
resonance arrays are more irregular, cf. figure 20. 1 . 

Resume 

The eigenvalues of evolution operators are given by the zeros of corresponding 
determinants, and one way to evaluate determinants is to expand them in terms 
of traces, using the matrix identity log det = tr log. Traces of evolution operators 
can be evaluated as integrals over Dirac delta functions, and in this way the spectra 
of evolution operators are related to periodic orbits. The spectral problem is now 
recast into a problem of determining zeros of either the spectral determinant 



The spectral determinant is the tool of choice in actual calculations, as it has 
superior convergence properties (this will be discussed in chapter 23 and is il- 
lustrated, for example, by table 20.2). In practice both spectral determinants and 
dynamical zeta functions are preferable to trace formulas because they yield the 
eigenvalues more readily; the main difference is that while a trace diverges at an 
eigenvalue and requires extrapolation methods, determinants vanish at s corre- 
sponding to an eigenvalue s a , and are analytic in s in an open neighborhood of 



The critical step in the derivation of the periodic orbit formulas for spectral 
determinants and dynamical zeta functions is the hyperbolicity assumption (18.5) 
that no cycle stability eigenvalue is marginal, lAp,,-! ± 1. By dropping the prefac- 
tors in (1.5), we have given up on any possibility of recovering the precise distri- 
bution of the initial x (return to the past is rendered moot by the chaotic mixing 
and the exponential growth of errors), but in exchange we gain an effective de- 
scription of the asymptotic behavior of the system. The pleasant surprise (to be 
demonstrated in chapter 20) is that the infinite time behavior of an unstable system 
turns out to be as easy to determine as its short time behavior. 




or the leading zeros of the dynamical zeta function 




p 
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Commentary 

Remark 19.1 Piecewise monotone maps. A partial list of cases for which the 
transfer operator is well defined: the expanding Holder case, weighted subshifts of finite 
type, expanding differentiable case, see Bowen [28]: expanding holomorphic case, see 
Ruelle [9]; piecewise monotone maps of the interval, see Hofbauer and Keller [ 14] and 
Baladi and Keller [17]. 



Remark 19.2 Smale's wild idea. Smale's wild idea quoted on page 381 was tech- 
nically wrong because 1) the Selberg zeta function yields the spectrum of a quantum 
mechanical Laplacian rather than the classical resonances, 2) the spectral determinant 
weights are different from what Smale conjectured, as the individual cycle weights also 
depend on the stability of the cycle, 3) the formula is not dimensionally correct, as k is 
an integer and s represents inverse time. Only for spaces of constant negative curvature 
do all cycles have the same Lyapunov exponent A = \n\K p \IT p . In this case, one can 
normalize time so that A = 1, and the factors e~ sT f lh p in (19.9) simplify to s^ s+k)T ', as 
intuited in Smale's quote on page 381 (where l(y) is the cycle period denoted here by T p ). 
Nevertheless, Smale's intuition was remarkably on the target. 

Remark 19.3 Is this a generalization of the Fourier analysis? Fourier analysis 
is a theory of the space <-» eigenfunction duality for dynamics on a circle. The way in 
which periodic orbit theory generalizes Fourier analysis to nonlinear flows is discussed in 
ref. [3], a very readable introduction to the Selberg Zeta function. 

Remark 19.4 Zeta functions, antecedents. For a function to be deserving of 
the appellation "zeta function," one expects it to have an Euler product representation 
(19.15), and perhaps also satisfy a functional equation. Various kinds of zeta functions 
are reviewed in refs. [6, 7, 8]. Historical antecedents of the dynamical zeta function are 
the fixed-point counting functions introduced by Weil [9], Lefschetz [10] and Artin and 
Mazur [11], and the determinants of transfer operators of statistical mechanics [29]. 

In his review article Smale [27] already intuited, by analogy to the Selberg Zeta func- 
tion, that the spectral determinant is the right generalization for continuous time flows. 
In dynamical systems theory, dynamical zeta functions arise naturally only for piecewise 
linear mappings; for smooth flows the natural object for the study of classical and quantal 
spectra are the spectral determinants. Ruelle derived the relation ( 19.3) between spectral 
determinants and dynamical zeta functions, but since he was motivated by the Artin- 
Mazur zeta function (15.27) and the statistical mechanics analogy, he did not consider the 
spectral determinant to be a more natural object than the dynamical zeta function. This 
has been put right in papers on "flat traces" [18, 23]. 

The nomenclature has not settled down yet; what we call evolution operators here is 
elsewhere called transfer operators [32], Perron-Frobenius operators [4] and/or Ruelle- 
Araki operators. 

Here we refer to kernels such as (17.25) as evolution operators. We follow Ruelle in 
usage of the term "dynamical zeta function," but elsewhere in the literature the function 
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(19.15) is often called the Ruelle zeta function. Ruelle [33] points out that the corre- 
sponding transfer operator T was never considered by either Perron or Frobenius; a more 
appropriate designation would be the Ruelle-Araki operator. Determinants similar to 
or identical with our spectral determinants are sometimes called Selberg Zetas, Selberg- 
Smale zetas [8] (Gaspard credits Smale [13] - but why, dynamical spectral determinant 
was not derived by either? and we cannot find any other zeta for the flows but the "wild" 
one quoted on page 381), functional determinants, Fredholm determinants, or even - to 
maximize confusion - dynamical zeta functions [12], A Fredholm determinant is a notion 
that applies only to trace class operators - as we consider here a somewhat wider class of 
operators, we prefer to refer to their determinants loosely as "spectral determinants." 



Exercises 



19.1. Escape rate for a 1 -dimensional repeller, numerically. 



Consider the quadratic map 
f{x)=Ax{\-x) 



(19.31) 



on the unit interval. The trajectory of a point starting 
in the unit interval either stays in the interval forever or 
after some iterate leaves the interval and diverges to mi- 
nus infinity. Estimate numerically the escape rate (22.8), 
the rate of exponential decay of the measure of points 
remaining in the unit interval, for either A = 9/2 or 
A — 6. Remember to compare your numerical estimate 
with the solution of the continuation of this exercise, ex- 
ercise 20.2. 

19.2. Spectrum of the "golden mean" pruned map. 

(medium - exercise 15.7 continued) 

(a) Determine an expression for trX", the trace of 
powers of the Perron-Frobenius operator (16.10) 
acting on the space of real analytic functions for 
the tent map of exercise 15.7. 

(b) Show that the spectral determinant for the Perron- 
Frobenius operator is 



det(l-zX) = 



(19.32) 



k even 
X 



n( 1 + y^ + A5s) 

k odd ' ' 



(a) Evaluate in closed form the dynamical zeta func- 
tion 



for the piecewise-linear map ( 16.1 1) with the left 
branch slope Aq, the right branch slope A i . 



f(x) 





f(x) 



Ai 






/ s oo 




s io\ 



19.3. Dynamical zeta functions, (easy) 



(b) What if there are four different slopes \oo. Soi. SiO; 
and in instead of just two, with the preimages 
of the gap adjusted so that junctions of branches 
soo> *oi an d siii -Sio ma P m the gap in one iteration? 
What would the dynamical zeta function be? 

19.4. Dynamical zeta functions from transition graphs. 

Extend sect. 15.3 to evaluation of dynamical zeta func- 
tions for piecewise linear maps with finite transition 
graphs. This generalizes the results of exercise 19.3. 

19.5. Zeros of infinite products. Determination of the 
quantities of interest by periodic orbits involves work- 
ing with infinite product formulas. 
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(a) Consider the infinite product 



F(z) = Y\(l+fk(z)) 



for a 2-dimensional Hamiltonian map. Using the equal- 
ity 



19.10. 



fc=0 

where the functions /* are "sufficiently nice." This 
infinite product can be converted into an infinite 
sum by the use of a logarithm. Use the properties 
of infinite sums to develop a sensible definition of 
infinite products. 

(b) If z* is a root of the function F, show that the infi- 
nite product diverges when evaluated at z*. 

(c) How does one compute a root of a function repre- 
sented as an infinite product? 

(d) Let p be all prime cycles of the binary alphabet 
{0, 1). Apply your definition of F(z) to the infinite 
product 

n z"" 
p 

(e) Are the roots of the factors in the above product 
the zeros of F(z)? 

(Per Rosenqvist) 

19.6. Dynamical zeta functions as ratios of spectral determinants. 

(medium) Show that the zeta function 



1 



1 



r (l-2/A+l/A 2 ) : 



(1-1/A) 2 
show that 

HC = det(l - X)det(l - X (2 ,)/det(l - £ (1) ) 2 . 

In this expression det(l— Z-C(k)) is the expansion one gets 
by replacing t p — > t p IA k p in the spectral determinant. 

Riemann f function. The Riemann f function is 
defined as the sum 



oo 



s € 



(a) Use factorization into primes to derive the Euler 
product representation 



l/£(z) = exp 
can be written as the ratio 



7T 1 -— 



l/az) = det(l-zX (0 ))/det(l -z£ m ), 

where det (1 - z-C (s) ) = U P YltoV ~ Z n ? /\A p \Af*). 

19.7. Contour integral for survival probability. Perform 
explicitly the contour integral appearing in ( 19.19). 

19.8. Dynamical zeta function for maps. In this prob- 
lem we will compare the dynamical zeta function and 
the spectral determinant. Compute the exact dynamical 
zeta function for the skew full tent map ( 16.46) 



The dynamical zeta function exercise 19.15 is 
called a "zeta" function because it shares the form 
of the Euler product representation with the Rie- 
mann zeta function. 

(b) (Not trivial:) For which complex values of s is the 
Riemann zeta sum convergent? 

(c) Are the zeros of the terms in the product, s = 
- In p, also the zeros of the Riemann f function? 
If not, why not? 



19.11. Finite truncations. (easy) Suppose we have a 1- 
dimensional system with complete binary dynamics, 
where the stability of each orbit is given by a simple 
multiplicative rule: 



A, 



A»p.O A «p,l 

yv 0 /v l 



n p,o — #0 in p , n Pi i = #1 in p , 



so that, for example, A 



00101 



A 3 A 2 



i/«z)=nf i 



peP 



Z"» 

|A^j 



What are its roots? Do they agree with those computed 
in exercise 16.7? 

19.9. Dynamical zeta functions for Hamiltonian maps. 

Starting from 



(a) Compute the dynamical zeta function for this sys- 
tem; perhaps by creating a transfer matrix analo- 
gous to (16.13), with the right weights. 

(b) Compute the finite p truncations of the cycle ex- 
pansion, i.e. take the product only over the p up to 
given length with n p < N, and expand as a series 
in z 

z"" 
\A P \ 



= exp 



oo 



p r=l 



Do they agree? If not, how does the disagreement 
depend on the truncation length A? 
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Chapter 20 



Cycle expansions 



Recycle... It's the Law! 

— Poster, New York City Department of Sanitation 




he Euler product representations of spectral determinants (19.9) and dyn- 



amical zeta functions (19.15) are really only a shorthand notation - the ze- 



ros of the individual factors are not the zeros of the zeta function, and the 
convergence of these objects is far from obvious. Now we shall give meaning to 
dynamical zeta functions and spectral determinants by expanding them as cycle 
expansions, which are series representations ordered by increasing topological cy- 
cle length, with products in (19.9), (19.15) expanded as sums over pseudo-cycles, 
products of weights t p of contributing cycles. The zeros of correctly truncated 
cycle expansions yield the desired leading eigenvalues of evolution operators, and 
the expectation values of observables are given by the cycle averaging formulas 
obtained from the partial derivatives of dynamical zeta functions (or spectral det- 
erminants). 

For reasons of pedagogy in what follows everything is first explained in terms 
of dynamical zeta functions: they aid us in developing 'shadowing' intuition about 
the geometrical meaning of cycle expansions. For actual calculations, we recom- 
mend the spectral determinant cycle expansions of sects. 20.2.2 and 20.4.2. While 
the shadowing is less transparent, and the weights calculation is an iterative nu- 
merical algorithm, these expansions use full analytic information about the flow, 
and can have better convergence properties than the dynamical zeta functions. For 
example, as we shall show in chapter 23, even when a spectral determinant (19.6) 
is entire and calculations are super-exponentially convergent, cycle expansion of 
the corresponding dynamical zeta function (19.25) has a finite radius of conver- 
gence and captures only the leading eigenvalue, at exponentially convergent rate. 
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20.1 Pseudocycles and shadowing 



How are periodic orbit formulas such as (19. 15) evaluated? We start by computing 
the lengths and Floquet multipliers of the shortest cycles. This always requires 
numerical work, such as searches for periodic solutions via Newton's method; 
we shall assume for the purpose of this discussion that the numerics is under 
control, and that all short cycles up to a given (topological) length have been 
found. Examples of the data required for application of periodic orbit formulas 
are the lists of cycles given in exercise 13.14 and table 29.3. Sadly, it is not 
enough to set a computer to blindly troll for invariant solutions, and blithely feed 
those into the formulas that will be given here. The reason that this chapter is 
numbered 20 and not 6, is that understanding the geometry of the non-wandering 
set is a prerequisite to good estimation of dynamical averages: one has to identify 
cycles that belong to a given ergodic component (whose symbolic dynamics and 
shadowing is organized by its transition graph), and discard the isolated cycles 
and equilibria that do not take part in the asymptotic dynamics. It is important 
not to miss any short cycles, as the calculation is as accurate as the shortest cycle 
dropped - including cycles longer than the shortest omitted does not improve the 
accuracy (more precisely, the calculation improves, but so little as not to be worth 
while). 

Given a set of periodic orbits, we can compute their weights t p and expand the 
dynamical zeta function (19.15) as a formal power series, 

i/t=Y\(i-t p ) = i- y,' {-v> k+ \M---hk (2o.i) 

P \PiP2-Pk) 

where the prime on the sum indicates that the sum is over all distinct non-repeating 
combinations of prime cycles. As we shall frequently use such sums, let us denote 
by t n = {—Y) t Pl t P2 ■ ■ - hk an element of the set of all distinct products of the 
prime cycle weights t p . The formal power series (20.1) is now compactly written 

as 

l/f=l-£V (20.2) 



For k > 1, the signed products ^ are weights of pseudo-cycles; they are sequences 
of shorter cycles that shadow a cycle with the symbol sequence piP2 - ■ ■ Pk along 
the segments p\, p2, . . ., Pk, as in figure 1.12. The symbol YI denotes the re- 
stricted sum, for which any given prime cycle p contributes at most once to a 
given pseudo-cycle weight t^. 

The pseudo-cycle weight, i.e., the product of weights (19.10) of prime cycles 
comprising the pseudo-cycle, 

t„ = (- 1 )*+ 1 — (fiA n -sT n z n n (20 3) 

I Awl 
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depends on the pseudo-cycle integrated observable \, the period T„, the stability 



A^ — Ap 2 • • • Ap^ , T n — T pi + . . . + Tp k 

A n = A pi + . . . + A Pk , n n = n pi + ... + n Pk , (20.4) 



remark 5.1 



and, when available, the topological length r^. 



20.1.1 Curvature expansions 

The simplest example is the pseudo-cycle sum for a system described by a com- 
plete binary symbolic dynamics. In this case the Euler product (19.15) is given 
by 

= (l-*o)(l-*i)(l-'oi)(l-*ooi)(l-*oii) (20.5) 
x (1 - foooi)(l _ f 001l)(l _ ?01 1 1 )(1 - fooooi)(l _ fooon) 
x (1 - ?ooioi)(l - fooinXl - foion)(l - foini) ■ • ■ 

(see table 15.1), and the first few terms of the expansion (20.2) ordered by increas- 
ing total pseudo-cycle length are: 

1/<T = 1 - /o - /i - kn - tooi - ton - ?oooi - toon - torn - • • • 
+ toh + totoi + toit\ + totooi + toton + tooiti + tont\ 
-totoih-... (20.6) 

We refer to such series representation of a dynamical zeta function or a spectral 
determinant, expanded as a sum over pseudo-cycles, and ordered by increasing 
cycle length and instability, as a cycle expansion. 

The next step is the key step: regroup the terms into the dominant fundamental 
contributions tf and the decreasing curvature corrections c„, each c n split into 
prime cycles p of length n p -n grouped together with pseudo-cycles whose full 
itineraries build up the itinerary of p. For the binary case this regrouping is given 
by 

l/£ = 1 - 1 0 - h - i(toi - hto)] - [(tooi - toito) + (ton - toih)] 
-[(?oooi - totooi) + (?om _ tont\) 

+(?oon - tooiti - toton + totoiti)] - ■■■ 

= i-Z^-Z^- (20 - 7) 
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All terms in this expansion up to length n p - 6 are given in table 20. 1 . We refer to 
such regrouped series as curvature expansions, because the shadowed combina- 
tions [• • •] vanish identically for piecewise-linear maps with nice partitions, such 
as the 'full tent map' of figure 16.3. 

This separation into 'fundamental' and 'curvature' parts of cycle expansions 
is possible only for dynamical systems whose symbolic dynamics has finite gram- 
mar. The fundamental cycles to, t\ have no shorter approximations; they are the 
"building blocks" of the dynamics in the sense that all longer orbits can be approx- 
imately pieced together from them. The fundamental part of a cycle expansion is 
given by the sum of the products of all non-intersecting loops of the associated 
transition graph, discussed in chapter 14. The terms grouped in brackets [• • •] are sectjon 15 3 
the curvature corrections; the terms grouped in parentheses (• • •) are combinations sect j on 20 5 
of longer cycles and corresponding sequences of "shadowing" pseudo-cycles, as 
in figure 1.12. If all orbits are weighted equally (t p = z" p ), such combinations 
cancel exactly, and the dynamical zeta function reduces to the topological poly- 
nomial (15.27). If the flow is continuous and smooth, orbits of similar symbolic 
dynamics will traverse the same neighborhoods and will have similar weights, and 
the weights in such combinations will almost cancel. The utility of cycle expan- 
sions of dynamical zeta functions and spectral determinants, in contrast to naive 
averages over periodic orbits such as the trace formulas discussed in sect. 22.4, 
lies precisely in this organization into nearly canceling combinations: cycle ex- 
pansions are dominated by short cycles, with longer cycles giving exponentially 
decaying corrections. 

More often than not, good symbolic dynamics for a given flow is either not 
available, or its grammar is not finite, or the convergence of cycle expansions 
is affected by non-hyperbolic regions of state space. In those cases truncations 
such as the stability cutoff of sect. 20.6 and sect. 24.3 .4 might be helpful. The 
idea is to truncate the cycle expansion by including only the pseudo-cycles such 
that \A pi ■ ■ ■ A PA | < A max , with the cutoff A max equal to or greater than the most 
unstable A p in the data set. 

In what follows, we shall introduce two cycle averaging formulas, one based 
on dynamical zeta functions and the other on spectral determinants. (Frequently 
used, but inferior 'level sums' shall be discussed in sect. 22.4.) 



20.2 Construction of cycle expansions 

Due to the lack of factorization of the determinant in the denominator of the full 
pseudo-cycle weight in (18.23), 

det(l - M PlP2 ) t det(l - M Pl )det(l - M P2 ) , 

the cycle expansions for the spectral determinant (19.9) are somewhat less trans- 
parent than is the case for the dynamical zeta functions, so we postpone their 
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Table 20.1: The binary curvature expansion (20.7) up to length 6, listed in such a way that 
the sum of terms along the pth horizontal line is the curvature c p associated with a prime 
cycle p, or a combination of prime cycles such as the 1 100101 + '100110 pair. 
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evaluation to sect. 20.2.2. Sect. 20.2.1 is a pedagogical warmup. In actual calcu- 
lations, implementing the spectral determinant cycle expansions of sect.20.2.2 is 
recommended. Correct objects are spectral determinants, and as using the correct 
object costs exactly the same as using the approximations, why settle for less? 



20.2.1 Evaluation of dynamical zeta functions 

Cycle expansions of dynamical zeta functions are evaluated numerically by first 
computing the weights t p = t p (J3, s) of all prime cycles p of topological length 
n p < N, for given fixed f5 and s. Denote by the subscript (/) the zth prime cycle 
computed, ordered by the topological length < n^+iy The dynamical zeta 
function 1/^v truncated to n p < N cycles is computed recursively, by multiplying 

l/^-) = l/^-i)[l-^ ( °], (20.8) 

and truncating the expansion at each step to a finite polynomial in g, n < N. The 
result is the Mh order polynomial approximation 

N 

1/^ = 1-2^- (20 - 9) 
n=l 

In other words, a cycle expansion is a Taylor expansion in the dummy variable z, 
where each term in the sum is raised to the topological cycle length. If both the 
number of cycles and their individual weights grow not faster than exponentially 
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with the cycle length, and we multiply the weight of each cycle p by a factor ft , 
the cycle expansion converges for sufficiently small \z\. If the symbolic dynamics 
grammar is finite, the truncation cuttof N has to be larger than the length of longest 
cycle in the transition graph (15. 15), for the salubrious effect of shadowing cance- 
lations to kick in. If that is the case, further increases in N yield the exponentially 
decreasing corrections c n in (20.7). 

If the dynamics is given by an iterated mapping, the leading zero of (20.9) 
as a function of z yields the leading eigenvalue of the appropriate evolution oper- 
ator. For continuous time flows, z is a dummy variable that we set to z - 1, and 
the leading eigenvalue of the evolution operator is given by the leading zero of 
l/£(s,/3(s)) as function of s. 



20.2.2 Evaluation of traces and spectral determinants 

We commence the cycle expansion evaluation of a spectral determinant by com- 
puting the trace formula (18.10) or (18.23). The weight of prime cycle p repeated 
r times is 



t p (z,P,r) = 
t p (s,j3,r) = 



e rfi-A p z rn p 

|det(l-M;)| 

e r(p-A p -sT p ) 

|det(l-M;)| 



(discrete time) 
(continuous time) . 



(20.10) 
(20.11) 



For discrete time, the trace formula (18.10) truncated to all prime cycles p and 
their repeats r such that n p r < N, 



tr- 



z£ 



l-z£ 



Yc„z n , C n =ti£ n , 



n=l 



(20.12) 



is computed as a polynomial in z by adding a cycle at the time: 



tr- 



z£ 



l-z£ 



tr- 



z£ 



l-z£ 



n {i yr<N 



0-1) 



+ «(o ^ t (i) (z,/3, r) . 



For continuous time, we assume that the method of Poincare sections assigns each 
cycle a topological length n p . Than the trace formula (18.23) is also organized as 
a polynomial 



tr- 



N 

V CnZ H 



N 



n=\ 



(20.13) 
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computed as: 

ntnr<N 



tr- 



1 



= tr 

(0 s - 31 



+ T (i) £ t & (s,fi,r)?P r 



r=l 



The periodic orbit data set (20.4) consists of the list of the cycle periods T p , the 
cycle Floquet multipliers A p ^,A p ^, ■ ■ ■ ,A p j, and the cycle averages of the ob- 
servable A p for all prime cycles p such that n p < N. The coefficient of z"i' r is then 
evaluated numerically for the given parameter values (fi, s). Always compute the 
leading eigenvalue of the evolution operator first, i.e., the escape rate y = in 
order to use it in calculation of averages of sect. 20.4 as a weig hte rT u in (20.12). 
Now that we have an expansion for the trace formula (18.9) as a power series, we 
compute the Nth order approximation to the spectral determinant (19.3), 



N 



det (1 - z£)\ N = 1 - 2j Q nZ " ' Q" = nth cumulant . ( 20 - 14 ) 

n=\ 

as follows. The logarithmic derivative relation (19.4) yields 
(tr y^^J det (1 - z£) = -z^ det C 1 - z£) 



(Ciz + C 2 z 2 + ■ ■ 0(1 - Qiz - Qiz 2 -•••) = Qiz + 2Q 2 z 2 + 3Q 3 z 3 



so the «th order term of the spectral determinant cycle (or in this case, the cumu- 
lant) expansion is given recursively by the convolution trace formula expansion 
coefficients 

Q n = -(C„-C„_iGi -•••Cig„-i) , <2i=Ci. (20.15) 
n 

Given the trace formula (20.12) truncated to z^, we now also have the spectral 
determinant truncated to z N . 

The same program can also be reused to compute the dynamical zeta function 
cycle expansion (20.9), by replacing n(l ^ n (20.12) by the product of sectjon 193 

expanding eigenvalues A(,) = rieA-OV- 

A few points concerning different cycle averaging formulas: 

• The dynamical zeta functions is an approximation to spectral determinant 
that yields only the leading eigenvalue of the evolution operator. The cycle 
weights depend only on the product of expanding |A| Floquet multipliers, 
so signs do no matter. For hyperbolic flows they converge exponentially 
with increasing cycle lengths. 
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Table 20.2: The 3-disk repeller escape rates computed from cycle expansions of the spec- 
tral determinant (19.6) and the dynamical zeta function (19.15), as functions of the max- 
imal cycle length N. The disk-disk center separation to disk radius ratio is R.a, and the 
det(s - is an estimate of the classical escape rate computed from the spectral det- 
erminant cycle expansion in the fundamental domain. For larger disk-disk separations, 
the dynamics is more uniform, as illustrated by the faster convergence. Convergence of 
spectral determinant det(s - ift) is super-exponential, see chapter 23. For comparison, the 
1 /£(s) column lists estimates from the fundamental domain dynamical zeta function cycle 
expansion (20.7), and the l/^Xs^-disk c °l umn nsts estimates from the full 3-disk cycle 
expansion (20.43). The convergence of the fundamental domain dynamical zeta function 
is significantly slower than the convergence of the corresponding spectral determinant, 
and the full (unfactorized) 3-disk dynamical zeta function has still poorer convergence. 
(RE. Rosenqvist.) 
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• spectral determinants weights in (19.3) contain 1/|1 - A,| factors, so for 
them signs of Floquet multipliers A, do matter. With finite grammar the 
leading eigenvalue converges super-exponentially in cycle length. 

Note that while the dynamical zeta functions weights use only the expand- 
ing Floquet multipliers |Ag|, for spectral determinants the weights are of 
form |1 - A^|, both expanding and contracting directions contribute, and the 
signs of multipliers do matter. That's why ChaosBook everywhere tracks 
multipliers Ay, rather than Floquet exponents Aj. A's belong to equilibria, 
periodic orbits require multipliers. That's the way cookie crumbles. For 
very high-dimensional flows (such as unstable periodic solutions of Navier- 
Stokes equations), usually only a subset of the most unstable / least con- 
tracting Floquet multipliers is known. As long as the contracting Floquet 
multipliers omitted from the weights in (20. 12) are sufficiently strongly con- 
tracting, the errors introduced by replacement |1 - K\ — > 1 for such eigen- 
values should be negligible. 

• The least enlightened are the 'level sum' cycle averaging formulas. There 
is no point in using them, except that they have to be mentioned (here in 
sect. 22.4), as there is voluminous literature that uses them. 

• Other formulas published in physics literature are likely to be wrong. 

If the set of computed periodic orbits is incomplete, and their Floquet mul- 
tipliers inaccurate, distinctions between different cycle averaging formulas are 
academic, as there are not sufficiently many cycles to start worrying about what 
expansion converges faster. 



20.3 Periodic orbit averaging 

The first cycle expansion calculation should always be the determination of the 
leading eigenvalue of the evolution operator, calculated as follows. After the 
prime cycles and the pseudo-cycles have been grouped into subsets of equal topo- 
logical length, the dummy variable can be set equal to z = 1. With z - 1, the 
expansion (20.14) constitutes the cycle expansion (19.6) for the spectral deter- 
minant det(s - tR) . We vary s in cycle weights, and determine ath eigenvalue 
s a (17.29) by finding s = s a for which (20.14) vanishes. As an example, the 
convergence of a leading eigenvalue for a nice hyperbolic system is illustrated in 
table 20.2 by the list of pinball escape rates y = —so estimates computed from 
truncations of (20.7) and (20.14) to different maximal cycle lengths. 

Cn3pt6r 23 

The pleasant surprise, to be explained in chapter 23, is that one can prove 
that the coefficients in these cycle expansions decay exponentially or even faster, 
because of the analyticity of det (s - jft) or l/£(s), for s values well beyond those 
for which the corresponding trace formula (18.23) diverges. 
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Figure 20.1: Example scans in the complex s 
plane: contour plots of the logarithm of the ab- 
solute values of (left) 1 /((s), (right) spectral deter- 
minant det (s—tt) for the 3-disk system, separation 
R : a = 6. The A { subspace is evaluated numeri- 
cally. The eigenvalues of the evolution operator £, 
are given by the centers of elliptic neighborhoods 
of the rapidly narrowing rings. While the dynam- 
ical zeta function is analytic on the Im s > — 1 half- 
plane, the spectral determinant is entire and reveals 
further families of zeros. (P.E. Rosenqvist) 
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20.3. 1 Newton algorithm for determining the evolution operator eigen- 
values 



Cycle expansions of spectral determinants can be used to compute a set 
of leading eigenvalues of the evolution operator. A convenient way to search for 
these is by plotting either the absolute magnitude In |det (s - ^R)\ or the phase of 
spectral determinants and dynamical zeta functions as functions of the complex 
variable s. The eye is guided to the zeros of spectral determinants and dynamical 
zeta functions by means of complex s plane contour plots, with different intervals 
of the absolute value of the function under investigation assigned different col- 
ors; zeros emerge as centers of elliptic neighborhoods of rapidly changing colors. 
Detailed scans of the whole area of the complex s plane under investigation and 
searches for the zeros of spectral determinants, figure 20. 1 , reveal complicated 
patterns of resonances even for something as simple as the 3-disk game of pinball. 
With a good starting guess (such as the location of a zero suggested by the com- 
plex s scan of figure 20.1), a zero l/£(s) = 0 can now be determined by standard 
numerical methods, such as the iterative Newton algorithm (13.4), with the rath 
Newton estimate given by 



= S„ 



(20.16) 



The denominator (T)^ is required for Newton iteration and is given by cycle ex- 
pansion (20.25). We need to evaluate it anyhow, as (T)^ is needed for the cycle 
averaging formulas. 

Our next task will be to compute long-time averages of observables. Three 
situations arise, two of them equal in practice: 



(i) The system is bounded, and we have all cycles up to some cutoff: always 
start by testing the cycle expansion sum rules of sect. 20.3.2. 

(ii) The system is unbounded, and averages have to be computed over a repeller 
whose natural measure is obtained by balancing local instability with the 
global escape rate y = -sq, as in sect. 17.3. 
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(iii) The system is bounded, but we only have a repelling set consisting of a sub- 
set of unstable cycles embedded into the bounded strange attractor. Best one 
can do is to treat this as an open system, case (iii). That assigns a stationary 
natural measure to neighborhoods of the solutions used, the local instabili- 
ties balanced by a weight that includes escape rate exp(yT p ). Whether use 
of this measure improves averages as one increases the stability cutoff de- 
pends on whether the longer cycles explore qualitatively different regions 
of state space not visited by the shorter (fundamental) cycles, or only revisit 
already known regions (curvature corrections). 

20.3.2 Flow conservation sum rules 

If a dynamical system is bounded, so that all trajectories remain confined for all 
times, the escape rate (22.8) vanishes y = -sq = 0, and the leading eigenvalue of 
the Perron-Frobenius operator (16.10) (evolution operator with yS = 0) is simply 
exp(-fy) = 1. Conservation of material flow thus implies that for bounded flows 
cycle expansions of dynamical zeta functions and spectral determinants satisfy 
exact flow conservation sum rules: 

i/ao,0) = i + V ' =0 

CO 

F(0,0) - l-2](2„(0,0) = 0 (20.17) 

n=\ 

obtained by setting s = 0 in (20.18), (20.19) with cycle weights t p = e~ sT f /\A P \ -> 
1 /\A p \ . These sum rules depend neither on the cycle periods T p nor on the observ- 
able a(x) under investigation, but only on the cycle stabilities A^i, A p> 2, • • •, A^. 
Their significance is purely geometric; they are a measure of how well periodic 
orbits tessellate state space, as in figure 1.11. Conservation of material flow pro- 
vides a first and very useful test of the quality of finite cycle length truncations and 
is something that you should always check when constructing a cycle expansion 
for a bounded flow. 



20.4 Cycle formulas for dynamical averages 

Want to learn some useful safety maneuvers? Or perhaps 
you'd like to become a more able mechanic? Or have bike 
safety questions answered? Or eat pizza? Then sign up for 
Enlightened Cycling ! 

— Bike GT: Cycling around Georgia Tech 

The eigenvalue conditions for the dynamical zeta function (20.2) and the spectral 
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Pi 



F(p,s((3))=0 curve 



Figure 20.2: The eigenvalue condition is satisfied on 
the curve F = 0 on the (fS, s) plane. The expectation 
value of the observable (17.12) is given by the slope of 
the curve. 




determinant (20.14), 



0 = t n = t„(J3,s(J3)) 

7T 

OO 

0 = l-YjQn> Qn = Qn(J3,m), 



(20.18) 

(20.19) 



are implicit equations for an eigenvalue s = s(J3) of the form 0 = F(J3, s(fi)). The 
eigenvalue s = s{fl) as a function of yS is sketched in figure 20.2; this condition 
is satisfied on the curve F = 0. The cycle averaging formulas for the slope and 
curvature of s(J5) are obtained as in (17.12) by taking derivatives of the eigenvalue 
condition. Evaluated along F - 0, by the chain rule the first derivative yields 



dF ds dF 
lip + djilh 



s=s(® 



ds dF I dF 
d/3 ~ ~~d/3i ^7' 



(20.20) 



and the second derivative of F(j3, s(fi)) = 0 yields 



d 2 s 
d/P 



ds 



d 2 F 2 ds d 2 F 



2 d 2 F 



ds 2 



idF 



(20.21) 



Denoting 



(A) F 
(CA - (A)) 2 ) p 



_ dF_ 

d 2 F 



OS 



P,s=s% 



30 1 



(20.22) 



respectively, and the mean cycle expectation value of A, the mean cycle period, 
and the second derivative of F computed for F(fi, s(fi)) = 0, we obtain the cycle 
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averaging formulas for the expectation and variance of the observable (17.12): 



(a) = (20.23) 
((a -(a)) 2 ) = A-((A-(A» 2 ) F . (20.24) 



These formulas are the central result of periodic orbit theory. We now show that 
for each choice of the function F(J3, s) in (20.2), (20.14), and (22.15), the above 
quantities have explicit cycle expansions. 



20.4.1 Dynamical zeta function cycle averaging formulas 

For the dynamical zeta function condition (20.18), the cycle averaging formulas 
(20.20), (20.24) require one to evaluate derivatives of dynamical zeta functions at 
a given eigenvalue. Substituting the cycle expansion (20.2) for the dynamical zeta 
function we obtain 

w< ■= = Y1 A ^ (20 - 25) 



d 1 v-i> d 1 v-i' 

( }i := d~s( = ^ ( " >f := ~ Z ~dz~t = 2j ***** 



where the subscript in (•••)/ stands for the dynamical zeta function average over 
prime cycles, A„, T n , and n n given by (20.3) are evaluated on pseudo-cycles (20.4), 
and pseudo-cycle weights t n — t n (j^,fi, s(/J)) are evaluated at the eigenvalue s(fi). 
In most applications fi = 0, and s(J3) of interest is typically the leading eigenvalue 
so = *o(0) of the evolution generator 3K. 

For bounded flows the leading eigenvalue (the escape rate) vanishes, s(0) = 0, 
the exponent f5A n - sT n in (20.3) vanishes, so the cycle expansions take a simple 
form 

{Ak = y ( _ l)k+1 A s + A P2 ..^ + A Pk ^ (2Q26) 
where analogous formulas hold for (T)g, (n)^. 



Example 20.1 Cycle expansion for the mean cycle period: For example, for the 
complete binary symbolic dynamics the mean cycle period {T) ( is given by sec tj on 1 5 4 

To Ti I Toi T 0 + T, 



Too\ To\ + To\ I Tqu _ Tqi + T\ 



IA001I IA01A0I / \|Aon| |AoiA 
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Note that the cycle expansions for averages are grouped into the same shad- 
owing combinations as the dynamical zeta function cycle expansion (20.7), with 
nearby pseudo-cycles nearly canceling each other. 

The cycle averaging formulas for the expectation of observable (a) follow by 
substitution into (20.24). Assuming zero mean drift (a) = 0, the cycle expansion 
(20.14) for the variance ((A - (A)) 2 ) is given by 




(20.28) 



20.4.2 Spectral determinant cycle expansions 



The dynamical zeta function cycle expansions have a particularly simple structure, 
with the shadowing apparent already by a term-by-term inspection of table 20.2. 
For "nice" hyperbolic systems, shadowing ensures exponential convergence of the 
dynamical zeta function cycle expansions. This, however, is not the best achiev- 
able convergence. As will be explained in chapter 23, for nice hyperbolic systems 
the spectral determinant constructed from the same cycle database is entire, and 
its cycle expansion converges faster than exponentially. The fastest convergence 
is attained by the spectral determinant cycle expansion (20.19) and its deriva- 
tives. In this case the d/ds, d/dfi derivatives are computed recursively, by taking 
derivatives of the spectral determinant cycle expansion contributions (20.12) and 
(20.15). 

The cycle averaging formulas are exact, and highly convergent for nice hy- 
perbolic dynamical systems. An example of their utility is the cycle expansion 
formula for the Lyapunov exponent of example 20.2. Further applications of cy- 
cle expansions will be discussed in chapter 22. 



20.4.3 Continuous vs. discrete mean return time 



Sometimes it is convenient to compute an expectation value along a flow in con- 
tinuous time, and sometimes it might be easier to compute it in discrete time, from 
a Poincare return map. Return times (3.1) might vary wildly, and it is not at all 
clear that the continuous and discrete time averages are related in any simple way. 
As we shall now show, the relationship turns out to be both elegantly simple, and 
totally general. 

The mean cycle period (T) F fixes the normalization of the unit of time; it can 
be interpreted as the average near recurrence or the average first return time. For 
example, if we have evaluated a billiard expectation value (a) = (A) F I{T) F in 
terms of continuous time, and would like to also have the corresponding average 



exercise 20.13 
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(tf)dscr - (A) F /(n) F measured in discrete time, given by the number of reflections 
off billiard walls, the two averages are related by 



where (n) F the average of the number of bounces n p along the cycle p is given by 
is (20.25). 



Example 20.2 Cycle expansion formula for Lyapunov exponents: In sect. 17.4 
we defined the Lyapunov exponent for a I -dimensional map, relating it to the leading 
eigenvalue of an evolution operator, and promised to evaluate it. Now we are finally in 
position to deliver on our promise. 

The cycle averaging formula (20.26) yields an exact explict expression for the 
Lyapunov exponent in terms of prime cycles: 



For a repeller, the l/\A p \ weights are replaced by (22.10), the normalized measure 
weights exp(yn p )/\A p \, where y is the escape rate. 

For 2-dimensional Hamiltonian flows such as our game of pinball (see exam- 
ple 19.3), there is only one expanding eigenvalue and (20.30) applies as written. 
However, in dimensions higher than one, a correct calculation of Lyapunov expo- 
nents requires a bit of sophistication. 



20.5 Cycle expansions for finite alphabets 

A finite transition graph like the one given in figure 14.6 (d) is a compact encod- 
ing of the transition matrix for a given subshift. It is a sparse matrix, and the 
associated determinant (15.20) can be written by inspection: it is the sum of all 
possible partitions of the graph into products of non-intersecting loops, with each 
loop carrying a minus sign: 

det(l - T) = 1 - to - toon - foooi _ fooon + ^oon + ^oonfoooi (20.31) 

The simplest application of this determinant is the evaluation of the topological 
entropy; if we set t p = z" p , where n p is the length of the /?-cycle, the determinant 
reduces to the topological polynomial (15.21). 

The determinant (20.31) is exact for the finite graph figure 14.6(e), as well 
as for the associated finite-dimensional transfer operator of example 17.4. For 



(«>dscr = (o)(T) F I (n) F , 



(20.29) 




(20.30) 
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the associated (infinite dimensional) evolution operator, it is the beginning of the 
cycle expansion of the corresponding dynamical zeta function: 

l/£ = 1 - to - toon - foooi + ^oooifoon 

-Oooon - t 0 tooi\ + • ■ • curvatures) . . . (20.32) 



The cycles 0, 0001 and 001 1 are the fundamental cycles introduced in (20.7); they 
are not shadowed by any combinations of shorter cycles. All other cycles appear 
together with their shadows (for example, the fooon - ^o^oon combination, see 
figure 1.12) and yield exponentially small corrections for hyperbolic systems. For 
cycle counting purposes, both t a t and the pseudo-cycle combination t a+ t, = t a % in 
(20.2) have the same weight f a+ " b , so all curvature combinations t a \, - t a tb vanish 
exactly, and the topological polynomial (15.27) offers a quick way of checking 
the fundamental part of a cycle expansion. 

The splitting of cycles into the fundamental cycles and the curvature correc- 
tions depends on balancing long cycles tab against their pseudo-trajectory shadows 
t a tb. If the ab cycle or either of the shadows a, b do not to exist, such curvature 
cancelation is unbalanced. 

The most important lesson of the pruning of the cycle expansions is that pro- 
hibition of a finite subsequence imbalances the head of a cycle expansion and 
increases the number of the fundamental cycles in (20.7). Hence the pruned ex- 
pansions are expected to start converging only after all fundamental cycles have 
been incorporated - in the last example, the cycles 1, 10, 10100, 1011100. With- 
out cycle expansions, no such crisp and clear cut definition of the fundamental set 
of scales is available. 

Because topological zeta functions reduce to polynomials for finite grammars, 
only a few fundamental cycles exist and long cycles can be grouped into curvature 
combinations. For example, the fundamental cycles in exercise 9.6 are the three 
2-cycles that bounce back and forth between two disks and the two 3-cycles that 
visit every disk. Of all cycles, the 2-cycles have the smallest Floquet exponent, 
and the 3-cycles the largest. It is only after these fundamental cycles have been 
included that a cycle expansion is expected to start converging smoothly, i.e., only 
for n larger than the lengths of the fundamental cycles are the curvatures q (in 
expansion (20.7)), a measure of the deviations between long orbits and their short 
cycle approximations, expected to fall off rapidly with n. 



20.6 Stability ordering of cycle expansions 

There is never a second chance. Most often there is not 
even the first chance. 
— John Wilkins 

(CP. Dettmann and P. Cvitanovic) 
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We have judiciously deployed the 3-disk pinball, with its simple grammar, to mo- 
tivate the periodic orbit theory Most dynamical systems of interest, however, have 
infinite grammar, so at any order in z a cycle expansion may contain unmatched 
terms that do not fit neatly into the almost canceling curvature corrections. Sim- 
ilarly, for the intermittent systems that we shall discuss in sect. 24.3.4, curvature 
corrections are not small in general, so again the cycle expansions may converge 
slowly. For such systems, schemes that collect the pseudocycle terms according 
to some criterion other than the topology of the flow may converge faster than 
expansions based on the topological length. 

All chaotic systems exhibit some degree of shadowing, and a good truncation 
criterion should do its best to respect the shadowing as much as possible. If a 
long cycle is shadowed by two or more shorter cycles and the flow is smooth, the 
periods and the Floquet exponents will be additive in sense that the period of the 
longer cycle is approximately the sum of the shorter cycle periods. Similarly, as 
stability is multiplicative, shadowing is approximately preserved by including all 
terms with pseudo-cycle stability 

\A pi ---A Pk \ < A max (20.33) 

and ignoring any pseudo-cycles that are less stable. 

Two such schemes for ordering cycle expansions that approximately respect 
shadowing are truncations by the pseudocycle period and the stability ordering 
that we shall discuss here. In these schemes, a dynamical zeta function or a spec- 
tral determinant is expanded. One keeps all terms for which the period, action or 
stability for a combination of cycles (pseudo-cycles) is less than a given cutoff. 

Settings in which stability ordering may be preferable to ordering by topo- 
logical cycle length are the cases of bad grammar, of intermittency, and of partial 
cycle data sets. 

20.6.1 Stability ordering for bad grammars 

For generic flows it is often not clear what partition of state space generates the 
"optimal" symbolic dynamics. Stability ordering does not require understanding 
dynamics in such detail: if you can find the cycles, you can use stability-ordered 
cycle expansions. Stability truncation is thus easier to implement for a generic 
dynamical system than the curvature expansions (20.7) that rely on finite subshift 
approximations to a given flow. 

Cycles can be detected numerically by searching a long trajectory for near 
recurrences. The long trajectory method for detecting cycles preferentially finds 
the least unstable cycles, regardless of their topological length. Another practical 
advantage of the method (in contrast to blind Newton method searches) is that it 
preferentially finds cycles in a given connected ergodic component of state space, 
ignoring isolated cycles or other ergodic regions elsewhere in state space. 
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Why should stability-ordered cycle expansions of a dynamical zeta function 
converge better than the crude trace formula (22.9), to be discussed in sect. 22.2? 
The argument has essentially already been laid out in sect. 15.6: in truncations 
that respect shadowing, most of the pseudo-cycles appear in shadowing combi- 
nations and nearly cancel, while only the relatively small subset affected by the 
increasingly long pruning rules is not shadowed. The error is typically of the order 
of 1 /A, which is smaller by a factor d* 7 than the trace formula (22.9) error, where 
h is the entropy and T is the typical cycle length for cycles of stability A. 

20.6.2 Smoothing 



^■r If most, but not all long cycles in a stability truncation are shadowed by 
shorter cycles, we say that the shadowing is partial. The breaking of exact shad- 
owing cancellations deserves further comment. Any partial shadowing that may 
be present can be (partially) restored by smoothing the stability-ordered cycle ex- 
pansions by replacing the 1 /A weight for each term with the pseudo-cycle stability 
A = Apj • • • A Pk by /(A)/ A. Here, /(A) decreases monotonically from f(0) = 1 
to /(A max ) = 0. The lack of smoothing means we have a step function. 

A typical "shadowing error" induced by the cutoff is due to two pseudo-cycles 
of stability A separated by AA; the contributions of these pseudo-cycles are of 
opposite sign. Ignoring possible weighting factors, the magnitude of the resulting 
term is of order 1/A - 1/(A + AA) « AA/A 2 . With smoothing, one obtains an 
extra term of the form /'(A) AA/A, which we want to minimize. A reasonable 
guess might be to keep f(A)/A constant and as small as possible, so that 



The results of a stability-ordered expansion (20.33) should always be tested 
for robustness by varying the cutoff Amax. If this introduces significant variations, 
smoothing is probably necessary. 



Resume 

A cycle expansion is a series representation of a dynamical zeta function, trace 
formula or a spectral determinant, with products in (19.15) expanded as sums 
over pseudo-cycles, which are products of the prime cycle weights tp. 

If a flow is hyperbolic and has the topology of the Smale horseshoe (a sub- 
shift of finite type), dynamical zeta functions are holomorphic (have only poles 
in the complex s plane), the spectral determinants are entire, and the spectrum of 
the evolution operator is discrete. The situation is considerably more reassuring 
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than what practitioners of quantum chaos fear; there is no 'abscissa of absolute 
convergence' and no 'entropy barrier', the exponential proliferation of cycles is 
no problem, spectral determinants are entire and converge everywhere, and the 
topology dictates the choice of cycles to be used in cycle expansion truncations. 

In this case, the basic observation is that the motion in low-dimensional dy- 
namical systems is organized around a few fundamental cycles, with the cycle 
expansion of the Euler product 

i/<r=i-2>-2>' 

/ 

regrouped into dominant fundamental contributions tf and decreasing curvature 
corrections c n . The fundamental cycles tf have no shorter approximations; they 
are the 'building blocks' of the dynamics in the sense that all longer orbits can be 
approximately pieced together from them. A typical curvature contribution to q, 
is the difference of a long cycle {ab} and its shadowing approximation by shorter 
cycles {a} and {b}, as in figure 1.12: 

tab ~ t a tb = t a b(J ~ t a tb/tab) 

Orbits that follow the same symbolic dynamics, such as {ab} and a 'pseudo-cycle' 
{a}{Z?}, lie close to each other, have similar weights, and for increasingly long 
orbits the curvature corrections fall off rapidly. Indeed, for systems that satisfy the 
'axiom A' requirements, such as the 3-disk billiard, curvature expansions converge 
very well. 

Once a set of the shortest cycles has been found, and the cycle periods, stabili- 
ties, and integrated observable have been computed, the cycle averaging formulas 
such as (20.25) for the dynamical zeta function 

(a) = {A)r[{T)r, where for the zeta function expansions: 
d 1 d 1 v~" 

< A V - - W( =£ AA ' ^ = Tsl = Z>'* 

yield the expectation value of the observable a(x), i.e., the long time average over 
the chaotic non-wandering set). 



Commentary 

Remark 20.1 Alternative Periodic Orbit Theories. 

Extraordinary how potent cheap music is. 
— Noel Coward 
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There are no 'alternative periodic orbit theories'. There is only one ergodic theory, and 
periodic orbits are one aspect of it, just like there is only one quantum mechanics, and 
WKB is one way to gain insight into it. While the eigenfunctions of quantum evolution 
operators are smooth Hilbert space states, the eigenfunctions of deterministic evolution 
operators are highly singular, nowhere differentiable functions with support on fractal sets. 
The deterministic eigenstates of high-dimensional ergodic flows thus cannot be computed 
using the methods developed for quantum eigenstates, at least not without much further 
thought. The ergodic, singular 'natural measure' is harder (and in high-dimensional state 
space impossible) to construct numerically than its smooth quantum cousin, the 'ground 
state', and periodic orbits seem to be the way to do it. Were ergodic theory easy, Chaos- 
Book.org and Gaspard monograph [ 8] would have been a much breezier reads. 

In the vast and vastly uneven periodic orbit literature (should erroneous papers be 
cited?) one sometimes encounters the 'escape-time weighting', 



2 P T p a 



p 



(a) = -^r^- , t p = — — , (20.34) 

hp T p hj A p,j 



where the p sum goes over all known unstable periodic orbits, and sometimes also over 
judiciously chosen subsets of unstable equilibria. Here 



1 

V A pJ = — In A p , A p = | A p ,i Ap, 2 ■ ■ ■ A„J , (20.35) 



T p is the period of of the prime periodic orbit p, A p j is the y'th Floquet multiplier (5.5), 
A p is the product of the expanding multipliers \A p j\ > 1 , j = 1, . . . , e, and A p j's are the 
strictly positive Floquet exponents. 

Yf Ap,i is the local escape rate from single repelling cycle p, of dimension l/[time], 
so one may interpret its inverse t p as "an estimate of the mean time spent by the system 
in vicinity of periodic orbit" p. t p is the Lyapunov time of cycle p, that is, the mean time 
it takes for the density of neighboring trajectories in an arbitrarily small ball centered 
around a point on the trajectory to decrease by factor 1 je. 

The 'escape-time weighting' was introduced in a rapid communication thusly: "Less 
unstable orbits must be weighted more heavily, so the attractor dimension is approximated 
by 'escape-time weighting' (20.34)." That's it: the 'derivation' in its entirety. Formula 
(20.34) is then asserted to approximate the time average (a) of observable a{x) over the 
chaotic attractor in terms of 



ip -Tpi 



dTa[f(x 0 )], x 0 ep, (20.36) 



averaged over each and every prime periodic orbit p found in any computer exploration 
of a dynamical system's state space. 

The enchantment with the escape-time weighting approach lies its charming simplic- 
ity. If one has a dynamical problem, and if one has a computer one has programmed 
to search for periodic orbits, and if the computer brings back a set of unstable periodic 
orbits, all one has to do is to put A p j and a p into the formula (20.34), and it returns a 
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number - let's say D — 9.0 + 0.1, where the error one estimates somehow - which one then 
publishes. 

The only drawback is that the 'formula' is wrong. (1) It comes from nowhere. (2) 
t p has dimension of l/[time], but a 'weight' should be a dimensionless number, the like- 
lihood that an ergodic trajectory enters the neighborhood of the periodic orbit p. (3) Yip 
is the sum over all unstable prime periodic orbits, regardless whether they belong to the 
ergodic component under investigation or dwell isolated in the Moon orbit. (4) The guess 
for the weight t p is clearly wrong, as any periodic orbit, no matter how long and unstable, 
has the comparable weight Y" A p j, as l° n g as its Lyapunovs (instability rate per unit time) 
are comparable; the Lyapunov time has nothing to do with the period of the particular 
cycle. For that there is a fix in the literature, with the Lyapunov time in (20.34) replaced 
by 



Tp/^Apj. (20.37) 



The fix is explained as follows: "it is reasonable also to suppose that orbits with longer 
periods must be weighted more heavily as they are longer and should provide a greater 
contribution to the total sum." That's it: the 'derivation' in its entirety. No less wrong. 

The exact weight of the unstable prime periodic orbit p (for level sum ( 18.7)) was 
given by Kadanoff and Tang [ 1 0] in 1984. For the classical trace formula for flows ( 18.23) 
it is 



T p e 



■T.s 



det(l-Mj 



Ap = T p a p , (20.38) 



where s is the evolution operator eigenvalue, and /3 is an auxiliary variable. It is de- 
termined by the dimensionless Floquet multipliers (eigenvalues of the periodic orbit's 
monodromy matrix M p ) which grow/shrink exponentially with cycle period, not the Os- 
eledec Lyapunov exponents or periodic orbit Floquet exponents which measure average 
expansion/contraction rate per unit time. 

The exact cycle averaging formulas for the expectation value of the observable a, 
derived in chapters 16 to 19, have form 



(a) = (A) F /(T) F , (20.39) 



where the form of the periodic orbit sum (■ ■ -) F depends on whether it is computed from 
the trace formula (22.15), the dynamical zeta function (20.2), or the spectral determinant 
(20.14). This sum is never of the form (20.34). 

Often one cares only about the leading long-time behavior, and for long periodic or- 
bits approximates the denominator of (20.38) by the product of the expanding multipliers 
A p of the monodromy matrix M p , 



(20.40) 

det(l-M p ) 



A,, 
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This weight seems to have been first used in the 1987 Auerbach et al. [ 17], who computed 
(for discrete time n given by iterates of map /) an nth order estimate s („) of the leading 
evolution operator eigenvalue s from the sum of all periodic points j of period n 

1= X tj = ^. (20.41) 

;eFix/» ; 

Even as its was written, the heuristics of this paper was superseded by the exact cycle 
expansions, first published in 1987 Cvitanovic letter [10]. 

Then there is in literature an 'Alternative Periodic Orbit Theory' so bold that one can 
only call it The Heresy: the conjecture is that if one looks carefully enough, there exists 
a single periodic orbit that captures all dynamical averages of a turbulent flow. This is so 
wrong that one is at loss what to say: there is NO such single periodic orbit. Instead, there 
is the well established theory that says how periodic orbits are to be used, and how many 
are needed to capture the hyperbolic parts of the non-wandering set to a desired accuracy. 
It is as elegant and systematic as Statistical Mechanics and Quantum Field Theory. Read 
ChaosBook.org. But who reads books nowadays? (Of course, if one takes a random 
very long periodic orbit, one will get estimates as good as from an ergodic trajectory of 
comparable length, but then why make life hard by insisting on exact recurrence?) 

The strangest thing about 'Alternative Periodic Orbit Theories' is that since intro- 
duction of zeta functions of Smale (1967), Ruelle (1976), Gutzwiller (1969) and their 
cycle expansions (1987) there is no need for them whatsoever. Why would on need to 
guess an approximate periodic orbit weight when the exact weight is already known? It 
costs exactly the same to compute the exact spectral determinant as it costs to compute 
a wrong formula, both require the same periodic orbits, Floquet multipliers, periods, and 
cycle-averaged observables a p . Go figure... 

Remark 20.2 Pseudocycle expansions. Bowen's introduction of shadowing e- 
pseudo-orbits [28] was a significant contribution to Smale's theory. The expression 'pseudo- 
orbits' seems to have been introduced in Parry and Pollicott's 1983 paper [ 19]. Following 
them, M. Berry [9] used the expression 'pseudo-orbits' in his 1986 paper on Riemann 
zeta and quantum chaos. Cycle and curvature expansions of dynamical zeta functions and 
spectral determinants in terms of pseudocycles were introduced in refs. [10, 2]. Some 
literature [12] refers to pseudo-orbits as 'composite orbits', and to cycle expansions as 
'Dirichlet series' (see also appendix 1.5 and remark J.l). 

Remark 20.3 Cumulant expansion. To statistical mechanicians, curvature ex- 

section 17.1.3 

pansions are very reminiscent of cumulant expansions. Indeed, (20.15) is the standard 
Plemelj -Smithies cumulant formula for the Fredholm determinant. A new aspect, not rem- 
iniscent of statistical mechanics, is that in cycle expansions each Q n coefficient is ex- 
pressed as a sum over exponentially many cycles. 

Remark 20.4 Exponential growth of the number of cycles. Going from N n ~ N" 
periodic points of length n to M„ prime cycles reduces the number of computations from 
N n to M n « N"~ l jn. The use of discrete symmetries (chapter 21) reduces the number 
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of nth level terms by another factor. While reformulating theory from trace ( 18.28) to 
cycle expansion (20.7) does not eliminate exponential growth in the number of cycles, in 
practice only the shortest cycles are used, and the reduction in computational labor for 
these cycles can be significant. 

Remark 20.5 Shadowing cycle-by-cycle. A glance at the low order curvatures in 
table 20.1 leads to the temptation to associate curvatures to individual cycles, such as 
coooi = f oooi - fofooi- Such combinations tend to be numerically small (see, for example, 
ref. [3], table 1). However, splitting c„ into individual cycle curvatures is not possible in 
general [12]; the first example of such ambiguity in the binary cycle expansion is given by 
the t iooioi > ^iooi 10 0 «-> 1 symmetric pair of 6-cycles; the counterterm fooi f on in table 20.1 
is shared by these two cycles. 

Remark 20.6 Escape rates. A lucid introduction to escape from repellers is given by 
Kadanoff and Tang [ 1 0], For a review of transient chaos see refs. [11, 13]. The ^-function 
formulation is given by Ruelle [ 18] and W. Parry and M. Pollicott [ 14] and discussed in 
ref. [15]. Altmann and Tel [16] give a detailed study of escape rates, with citations to 
more recent literature. 

Remark 20.7 Stability ordering. The stability ordering was introduced by Dahlqvist 
and Russberg [13, 14] in a study of chaotic dynamics for the (x 2 y 2 ) l ^ a potential. The 
presentation here runs along the lines of Dettmann and Morriss [ 15] for the Lorentz gas, 
which is hyperbolic but with highly pruned symbolic dynamics, and Dettmann and Cvi- 
tanovic [ 1 6] for a family of intermittent maps. In the applications discussed in the above 
papers, stability ordering yields a considerable improvement over topological length or- 
dering. In quantum chaos applications, cycle expansion cancelations are affected by the 
phases of pseudo-cycles (their actions), hence period or action ordering rather than sta- 
bility is frequently employed. 

Remark 20.8 Desymmetrized cycle expansions. The 3-disk cycle expansions 
(20.43) might be useful for cross-checking purposes, but, as we shall see in chapter 21, 
they are not recommended for actual computations, as the factorized zeta functions yield 
much better convergence. 
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Exercises 



20. 1 . Cycle expansions. Write programs that implement 
binary symbolic dynamics cycle expansions for (a) dyn- 
amical zeta functions, (b) spectral determinants. Com- 
bined with the cycles computed for a 2-branch repeller 
or a 3-disk system they will be useful in the problems 
below. 

20.2. Escape rate for a 1 -dimensional repeller. (continua- 



tion of exercise 19.1 - easy, but long) 
the quadratic map (19.31) 

f(x)=Ax(l-x) 



Consider again 



on the unit interval. In order to be definitive, take ei- 
ther A = 9/2 or A = 6. Describing the itinerary of any 
trajectory by the binary alphabet {0, 1} ('0' if the iterate 
is in the first half of the interval and ' 1 ' if it is in the 
second half), we have a repeller with a complete binary 
symbolic dynamics. 

(a) Sketch the graph of / and determine its two fixed 
points 0 and 1, along with their stabilities. 

(b) Sketch the two branches of Determine all 
the prime cycles up to topological length 4 using 
your calculator and backwards iteration of / (see 
sect. 13.2.1). 

(c) Determine the leading zero of the zeta function 
(19.15) using the weights t p = z" 1 ' /\A. p \, where A p 
is the stability of the /?-cycle. 

(d) Show that for A — 9/2 the escape rate of the 
repeller is 0.361509 . . . using the spectral deter- 
minantwith the same cycle weight. If you have 
taken A = 6, show instead that the escape rate 
is in 0.83149298..., as shown in solution 20.2. 
Compare the coefficients of the spectral determin- 
ant and the zeta function cycle expansions. Which 
expansion converges faster? 

(Per Rosenqvist) 

20.3. Escape rate for the Ulam map. (Medium; repeat of 
exercise 13.1) We will try to compute the escape rate for 
the Ulam map ( 1 1 .5) 

f(x)=4x(l-x), 

using the method of cycle expansions. The answer 
should be zero, as nothing escapes. 

(a) Compute a few of the stabilities for this map. 
Show that A 0 = 4, Ai = -2, A 0 i = -4, A 00 i = -8 
and Aon = 8. 



(b) Show that 

Ae,...e„ = ±2" 
and determine a rule for the sign. 

(c) (hard) Compute the dynamical zeta function for 
this system 

r 1 = 1 - t 0 - h ~ (foi - toh) 

Note that the convergence as a function of the 
truncation cycle length is slow. Try to fix that by 
treating the Ao = 4 cycle separately, (continued 
as exercise 20.12) 

20.4. Pinball escape rate, semi-analytical. Estimate the 
3-disk pinball escape rate for R : a = 6 by substituting 
analytical cycle stabilities and periods (see exercise 13.7 
and exercise 13.8) into the appropriate binary cycle ex- 
pansion. Compare your result with the numerical esti- 
mate exercise 17.2. 

20.5. Pinball escape rate, from numerical cycles. Com- 
pute the escape rate for the 3-disk pinball with R : a = 6 
by substituting the list of numerically computed cycle 
stabilities of exercise 13.5 into the binary cycle expan- 
sion. 

20.6. Pinball resonances in the complex plane. Plot the 
logarithm of the absolute value of the dynamical zeta 
function and/or the spectral determinant cycle expansion 
(20.5) as contour plots in the complex s plane. Do you 
find zeros other than the one corresponding to the com- 
plex one? Do you see evidence for a finite radius of 
convergence for either cycle expansion? 

20.7. Counting the 3-disk psudocycles. (continuation of 
exercise 15.12) Show that the number of terms in the 
3-disk pinball curvature expansion (20.42) is given by 



n (!+»,) - 



1 - 3z 4 - 2z 6 
1 - 3z 2 - 2z 3 



= 1 + 3z 2 + 2z 3 + 6z 4 + 12z 5 

+20z 6 + 48z 7 + 84z 8 + 184z 9 + . . . 

This means that, for example, has a total of 20 terms, 
in agreement with the explicit 3-disk cycle expansion 
(20.43). 
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20.8. 3-disk unfactorized zeta cycle expansions. Check 
that the curvature expansion (20.2) for the 3-disk pin- 
ball, assuming no symmetries between disks, is given 
by 

1/f = (1-Z 2 fl2)(l-Z 2 fl3)(l-Z 2 f 23 ) 

(1 -z 3 ?m)(l -z 3 ?i32)(l - A213) 

(1 - Z 4 ?1232)(l - Z 4 fl323)(l ~ 2^12123) ' ' ' 



■z 2 t n - 



Z 2 ?23 



■ z 2 ?3i 



r(/i23 + ?m) 

-Z 4 [(?1213 - ? 12^13) + (?1232 - ?12*23) 

+ (?1323 - tl3fe»)] (20.42) 

-Z 5 [(t 12123 - ^ 12^123) + •••]-••■ 



20.10. 



Show that the symmetrically arranged 3-disk pinball cy- 
cle expansion of the Euler product (20.2) (see table 15.5 
and figure 9. 1) is given by: 

1/f = (1 - Z 2 fl 2 ) 3 (l - Z 3 fl23) 2 (l - Z 4 fl213) 3 
(1-Z 5 fl2123) 6 (l-Z 6 fl21213) 6 20.11. 
(1-Z 6 fl21323) 3 ... (20.43) 

= 1 - 3z 2 t n - 2z 3 fi23 - 3z 4 (f 1213 - ? 2 2 ) 
-6z (?12123 - tntm) 

-Z 6 (6 1 121213 + 3 1 121323 + ? \ 2 - 9 ?12?1213 _ ^123) 
-6Z (?1212123 + ?1212313 + ?1213123 + ?12?123 
-3?i2fl2123 - fl23?1213) 
-3z 8 (2 1 12121213 + ? 12121313 + 2 f 12121323 
+2*12123123 + 2 f 12123213 + ? 12132123 

+ 3 ? 2 2 ?i2i3 + f 12? 123 ~ 6 f 12? 121213 

- 3 f 12? 121323 - 4 f 123? 12123 ~ ?1213) _ 



20.12. 



20.9. 4-disk unfactorized dynamical zeta function cycle 
expansions. For the symmetrically arranged 4-disk 
pinball, the symmetry group is C4,,, which is of order 8. 
The degenerate cycles can have multiplicities 2, 4 or 8 
(see table 15.3). Show that: 

1/f = (l-Z 2 ?12) 4 (l-r?13) 2 (l-Z 3 ?123) 8 

(1-Z 4 ?1213) 8 (1-Z 4 ?1214) 4 (1-Z 4 ?1234) 2 
(1 -Z 4 ?1243) 4 (1 -Z 5 ?12123) 8 (1 -Z 5 ?12124) 8 



20.13. 



(1-Z 5 ?12134) 8 (1-Z 5 ?12143) 8 
(1-Z 5 ?12313) 8 (1-Z 5 ?12413) 8 ' 



Show that the cycle expansion is given by 



(20.44) 



1/f = 1 - z 2 (4f 12 +2fi 3 )-8z 3 fi23 

-Z 4 (8 ?1213 + 4 1 1214 + 2 1 1234 + 4 1 1243 
-6f 2 2 - ?i 3 - 8 ?12?13) 

-8z (?12123 + ?12124 + ?12134 + ?12143 + ? 123 13 
+?12413 _ 4 1 12? 123 _ 2 f 13? 123) 



-4z 6 (2S 8 + S A + t\ 2 + 3 1\ 2 t n + t n tj 3 

-8/12/1213 -4fi2?1214 
-2/12/1234 - 4fi2?l243 
-4?13?1213 - 2 ?13?1214 - ?13?1234 
-2 ?13?1243 - 7 f 2 23 ) - ' ' ' 

where in the coefficient of z 6 ,the abbreviations 5 8 and 
S 4 stand for the sums over the weights of the 12 orbits 
with multiplicity 8 and the 5 orbits with multiplicity 4, 
respectively; the orbits are listed in table 15.5. 

Escape rate for the Rossler flow. (continuation of 
exercise 13.10) Try to compute the escape rate for the 
Rossler flow (2.18) using the method of cycle expan- 
sions. The answer should be zero, as nothing escapes. 
Ideally you should already have computed the cycles 
and have an approximate grammar, but failing that you 
can cheat a bit and peak into exercise 13.10. 

State space volume contraction, recycled. (contin- 
uation of exercise 4.3) The plot of instantaneous state 
space volume contraction as a function of time in exer- 
cise 4.3 (d) illustrates one problem of time-averaging in 
chaotic flows - the observable might vary wildly across 
each recurrence to a given Poincare section. Evaluated 
on a given short cycle, the average is crisp and arbi- 
trarily accurate. Recompute (d ■ v) by means of cycle 
expansion, study its convergence, l/t convergence of 
mindless time-averaging is now replaced by exponential 
convergence in the cycle length. 

Ulam map is conjugate to the tent map. (con- 
tinuation of exercise 20.3, repeat of exercise B.4 and 
exercise 13.2; requires real smarts, unless you look it 
up) Explain the magically simple form of cycle stabil- 
ities of exercise 20.3 by constructing an explicit smooth 
conjugacy (2.20) 

g t (yo) = hofoh- 1 (y 0 ) 



that conjug; 
(11.4). 



ates the Ulam map (11.5) into the tent map 



Continuous vs. discrete mean return time. Show 
that the expectation value (a) time-averaged over con- 
tinuous time flow is related to the corresponding average 
(fl) ( j scr measured in discrete time (e.g. , Poincare section 
returns) by (20.29): 



(20.45) 



(Hint: consider the form of their cycle expansions.) The 
mean discrete period (n) ( averaged over cycles, and the 
mean continuous time period (T^ need to be evalu- 
ated only once, thereafter one can compute either (a) 
or (a)dscr' whichever is more convenient. 



exerRecyc - 19nov2012 



ChaosBook.org versionl4.4.1, Apr21 2013 



REFERENCES 



420 



References 

[20.1] P. Cvitanovic, Phys. Rev. Lett. 61, 2729 (1988). 

[20.2] R. Artuso, E. Aurell and P. Cvitanovic, "Recycling of strange sets I: Cycle 
expansions," Nonlinearity 3, 325 (1990). 

[20.3] R. Artuso, E. Aurell and P. Cvitanovic, "Recycling of strange sets II: Ap- 
plications," Nonlinearity 3, 361 (1990). 

[20.4] S. Grossmann and S. Thomae, Z. Naturforsch. 32 a, 1353 (1977); reprinted 
in ref. [5]. 

[20.5] Universality in Chaos, P. Cvitanovic, ed., (Adam Hilger, Bristol 1989). 

[20.6] F. Christiansen, P. Cvitanovic and H.H. Rugh, J. Phys A 23, L713 (1990). 

[20.7] J. Plemelj, "Zur Theorie der Fredholmschen Funktionalgleichung," Monat. 
Math. Phys. 15, 93 (1909). 

[20.8] F. Smithies, "The Fredholm theory of integral equations," Duke Math. 8, 
107 (1941). 

[20.9] M.V. Berry, in Quantum Chaos and Statistical Nuclear Physics, ed. T.H. 
Seligman and H. Nishioka, Lecture Notes in Physics 263, 1 (Springer, Berlin, 
1986). 

[20.10] P. Cvitanovic, "Invariant measurements of strange sets in terms of cy- 
cles," Phys. Rev. Lett. 61, 2729 (1988). 

[20.11] B. Eckhardt and G. Russberg, Phys. Rev. E 47, 1578 (1993). 

[20.12] E. Aurell, 'Convergence of Dynamical Zeta Functions," /. Stat. Phys. 58, 
967 (1990). 

[20.13] P. Dahlqvist and G. Russberg, "Periodic orbit quantization of bound 
chaotic systems," /. Phys. A 24, 4763 (1991). 

[20.14] P. Dahlqvist, "Determination of resonance spectra for bound chaotic sys- 
tems," J. Phys. A 27, 763 (1994). 

[20.15] C. P. Dettmann and G. P. Morriss, Phys. Rev. Lett. 78, 4201 (1997). 

[20.16] C. P. Dettmann and P. Cvitanovic, "Cycle expansions for intermittent dif- 
fusion," Phys. Rev. E 56, 6687 (1997); arXiv : chao-dyn/9708011. 

[20.17] D. Auerbach, P. Cvitanovic, J.-P. Eckmann, G.H. Gunaratne and I. Pro- 
caccia, Phys. Rev. Lett. 58, 2387 (1987). 

[20.18] D. Ruelle, "Zeta-Functions for expanding maps and Anosov flows," Inv. 
Math. 34, 231-242 (1976). 

[20.19] W. Parry and M. Pollicott, Ann. Math. 118, 573 (1983). 



refsRecycle - 17aug99 



ChaosBook.org versionI4.4.1, Apr 21 2013 



Chapter 21 



Discrete factorization 



No endeavor that is worthwhile is simple in prospect; if it 
is right, it will be simple in retrospect. 

— Edward Teller 




he utility of discrete symmetries in reducing spectrum calculations is fa- 



miliar from quantum mechanics. Here we show that the classical spectral 



determinants factor in essentially the same way as the quantum ones. In the 
process we 1 .) learn that the classical dynamics, once recast into the language of 
evolution operators, is much closer to quantum mechanics than is apparent in the 
Newtonian, ODE formulation (linear evolution operators/PDEs, group-theoretical 
spectral decompositions, . . .), 2.) that once the symmetry group is quotiented out, 
the dynamics simplifies, and 3.) it's a triple home run: simpler symbolic dynam- 
ics, fewer cycles needed, much better convergence of cycle expansions. Once you 
master this, going back is unthinkable. 

The main result of this chapter can be stated as follows: 

If the dynamics possesses a discrete symmetry, the contribution of a cycle p 
of multiplicity m p to a dynamical zeta function factorizes into a product over the 
d a -dimensional irreducible representations D a of the symmetry group, 



where tp is the cycle weight evaluated on the relative periodic orbit p, g - \G\ is the 
order of the group, hp is the group element relating the fundamental domain cycle 
p to a segment of the full space cycle p, and m p is the multiplicity of the p cycle. 
As dynamical zeta functions have particularly simple cycle expansions, a geomet- 
rical shadowing interpretation of their convergence, and suffice for determination 
of leading eigenvalues, we shall use them to explain the group-theoretic factoriza- 
tions; the full spectral determinants can be factorized using the same techniques. 
/?-cycle into a cycle weight t p . 



(l-t p ) m " = Y\det(l-D a (h p )t p ) 



f _ t Sl' n P 

p ~ l p 



a 
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This chapter is meant to serve as a detailed guide to the computation of dynam- 
ical zeta functions and spectral determinants for systems with discrete symmetries. 
Familiarity with basic group-theoretic notions is assumed, with the definitions 
relegated to appendix 1. 1 . We develop here the cycle expansions for factorized 
determinants, and exemplify them by working two cases of physical interest: Ci = 
Di, C3 V = D3 symmetries. Ci v = D2XD2 and C^ v = D4 symmetries are discussed 
in appendix I. 



21.1 Preview 

As we saw in chapter 9, discrete symmetries relate classes of periodic orbits and 
reduce dynamics to a fundamental domain. Such symmetries simplify and im- 
prove the cycle expansions in a rather beautiful way; in classical dynamics, just 
as in quantum mechanics, the symmetrized subspaces can be probed by linear op- 
erators of different symmetries. If a linear operator commutes with the symmetry, 
it can be block-diagonalized, and, as we shall now show, the associated spectral 
determinants and dynamical zeta functions factorize. 

21.1.1 Reflection symmetric 1-d maps 

Consider /, a map on the interval with reflection symmetry f(-x) = -f(x). A 
simple example is the piecewise-linear sawtooth map of figure 9.4. Denote the 
reflection operation by Rx = -x. The symmetry of the map implies that if {x„} is a 
trajectory, than also {Rx n } is a trajectory because Rx n+ \ = Rf(x n ) = f{Rx n ) . The 
dynamics can be restricted to a fundamental domain, in this case to one half of 
the original interval; every time a trajectory leaves this interval, it can be mapped 
back using R. Furthermore, the evolution operator commutes with R, £(y, x) - 
£{Ry,Rx). R satisfies R 2 = e and can be used to decompose the state space 
into mutually orthogonal symmetric and antisymmetric subspaces by means of 
projection operators 

P At = ^(e+R), P A2 = h^-R), 
£ Al (y, x) = P Al £(y, x) = - (£(y, x) + £{-y, x)) , 
£ Al (y, x) = P A2 £(y, x) = X - (£(y, x) - £(-y, x)) . (21.1) 

To compute the traces of the symmetrization and antisymmetrization projec- 
tion operators (21. 1), we have to distinguish three kinds of cycles: asymmetric cy- 
cles a, symmetric cycles s built by repeats of irreducible segments s, and boundary 
cycles b. Now we show that the spectral determinant can be written as the prod- 
uct over the three kinds of cycles: det (1 -£) = det(l -£\,det (1 -£) s det (1 -£)b- 



symm - 13jun2008 



ChaosBook.org versionl4.4.1, Apr 21 2013 



CHAPTER 21. DISCRETE FACTORIZATION 



423 



Asymmetric cycles: A periodic orbits is not symmetric if {xj\ n {Rx a } - 0, where 
{x a } is the set of periodic points belonging to the cycle a. Thus R generates a 
second orbit with the same number of points and the same stability properties. 
Both orbits give the same contribution to the first term and no contribution to the 
second term in (21.1); as they are degenerate, the prefactor 1/2 cancels. Resum- 
ing as in the derivation of (19.15) we find that asymmetric orbits yield the same 
contribution to the symmetric and the antisymmetric subspaces: 



Symmetric cycles: A cycle s is reflection symmetric if operating with R on the 
set of periodic points reproduces the set. The period of a symmetric cycle is 
always even (n s = 2n s ) and the mirror image of the x s periodic point is reached by 
traversing the irreducible segment s of length r^, f" s (x s ) = Rx s . 5{x - f'\x)) picks 
up Iris contributions for every even traversal, n - rn s , r even, and 6(x + f"(x)) for 
every odd traversal, n = rn~ s , r odd. Absorb the group-theoretic prefactor in the 
Floquet multiplier by denning the stability computed for a segment of length iy, 



Restricting the integration to the infinitesimal neighborhood At of the s cycle, 
we obtain the contribution to tr £1 : 



Substituting all symmetric cycles s into det (1 - Xt) and resuming we obtain: 



Boundary cycles: In the example at hand there is only one cycle which is neither 
symmetric nor antisymmetric, but lies on the boundary of the fundamental do- 
main, the fixed point at the origin. Such cycle contributes simultaneously to both 




A, = - 



dx 
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5{x - f'\x)) and 8{x + f n {x)): 



z n tr£'l 



f dx z" \ (6(x - f{x)) ± Six + f n (x))) 



z n tr£'l 



— t — At' 

r=l ' b r=l o 



Boundary orbit contributions to the factorized spectral determinants follow by 
resummation: 



det 



(1 - £ + ) b = f 



k=0 



i-A. 



det (!-£_)* = [ 



1 - 



A 



2/t+l 
ft ) 



Only the even derivatives contribute to the symmetric subspace, and only the odd 
ones to the antisymmetric subspace, because the orbit lies on the boundary. 

Finally, the symmetry reduced spectral determinants follow by collecting the 
above results: 

00/ \ CO / \ DO / \ 

™-nnMnn»-£n>-& 



OO I v CO/ \ CO / 

^>=nn 4nn Mn > 



t b 



A 2k + l 

b 



(21.2) 



We shall work out the symbolic dynamics of such reflection symmetric systems in 
some detail in sect. 21.5. As reflection symmetry is essentially the only discrete 
symmetry that a map of the interval can have, this example completes the group- 
theoretic factorization of determinants and zeta functions for 1 -dimensional maps. 
We now turn to discussion of the general case. 



exercise 21.1 



21.2 Discrete symmetries 



A dynamical system is invariant under a symmetry group G = {e,gi,... ,g\c\ \ if 
the equations of motion are invariant under all symmetries g e G. For a map 
x n+ i = f(x n ) and the evolution operator £(y, x) defined by (17.25) this means 



fix) = g-'figx) 
£(y, x) = £(gy, gx) 



(21.3) 
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Bold face letters for group elements indicate a suitable representation on state 
space. For example, if a 2-dimensional map has the symmetry x\ — > —x\, X2 — > 
-0C2, the symmetry group G consists of the identity and C, a rotation by n around 
the origin. The map / must then commute with rotations by n, f(Rx) = Cf(x), 
with R given by the [2 X 2] matrix 



0 -1 



R satisfies R 2 = e and can be used to decompose the state space into mutually or- 
thogonal symmetric and antisymmetric subspaces by means of projection opera- 
tors (21.1). More generally the projection operator onto the a irreducible subspace 
of dimension d a is given by P a - (d a /\G\) J^XaQOh , where Xa{h) - tr D a (h) 
are the group characters, and the transfer operator £, splits into a sum of inequiv- 
alent irreducible subspace contributions tr £ a , 

■Ca(y,x) = TF\ y^WX(h-V,x) . (21.5) 



The prefactor d a in the above reflects the fact that a -dimensional representation 
occurs d n times. 



21.2.1 Cycle degeneracies 

Taking into account these degeneracies, the Euler product (19.15) takes the form 

~]Q-t P ) = Y\(i-t P r>. (2i.6) 



The Euler product (19.15) for the D3 symmetric 3-disk problem is given in 
(20.43). 



21.3 Dynamics in the fundamental domain 



If the dynamics is invariant under a discrete symmetry, the state space M can be 
completely tiled by the fundamental domain M and its images aM, bM, . . . under 
the action of the symmetry group G = {e, a,b,. . .}, 

aeG aeG 



symm - 13jun2008 



ChaosBook.org versionl4.4.1, Apr 21 2013 



CHAPTER 21. DISCRETE FACTORIZATION 



426 



In the above example (21.4) with symmetry group G = {e, C), the state space 
M - \x\-%2 plane} can be tiled by a fundamental domainM - {half-plane x\ > 0}, 
and CM = {half-plane x\ < 0), its image under rotation by n. 

If the space M is decomposed into g tiles, a function <p(x) over M splits into 
a g-dimensional vector <p a (x) defined by (f> a (x) = (p(x) if x e M a , cp a {x) - 0 
otherwise. Let h = ab~ l conflicts with be the symmetry operation that maps the 
endpoint domain Mb into the starting point domain M a , and let D(h)b a , the left 
regular representation, be the [g x g] matrix whose b, a-th entry equals unity if 
a = hb and zero otherwise; D{h\ a = 5bh,a- Since the symmetries act on state 
space as well, the operation h enters in two guises: as a [g x g] matrix D(h) which 
simply permutes the domain labels, and as a [d x d] matrix representation h of a 
discrete symmetry operation on the d state space coordinates. For instance, in the 
above example (21.4) h e C2 and D(h) can be either the identity or the interchange 
of the two domain labels, 

D(e) = (j J), 0(0 = ^ I). (21.7) 

Note that D(h) is a permutation matrix, mapping a tile M a into a different tile 
Mha + M a if h t e. Consequently only D(e) has diagonal elements, and trD(/i) - 
g5h, e - However, the state space transformation h + e leaves invariant sets of 
boundary points; for example, under reflection cr across a symmetry axis, the 
axis itself remains invariant. The boundary periodic orbits that belong to such 
pointwise invariant sets will require special care in tr X evaluations. 

One can associate to the evolution operator (17.25) a [g x g] matrix evolution 
operator defined by 

£ba(y, x) = D(h) ba £(y, x) , 

if x € M a and y € Mb, and zero otherwise. Now we can use the invariance 
condition (21.3) to move the starting point x into the fundamental domain x = ax, 
£,(y,x) = X.{sC x y,x), and then use the relation a~ l b = h~ l to also relate the 
endpoint y to its image in the fundamental domain, £(y, x) :- £(h~ l y, x). With 
this operator which is restricted to the fundamental domain, the global dynamics 
reduces to 

£ba(y, x) = D(h) ba £(y, x) . 

While the global trajectory runs over the full space M, the restricted trajectory is 
brought back into the fundamental domain M any time it crosses into adjoining 
tiles; the two trajectories are related by the symmetry operation h which maps the 
global endpoint into its fundamental domain image. 
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Now the traces (19.3) required for the evaluation of the eigenvalues of the 
transfer operator can be evaluated on the fundamental domain alone 

tr£= f dx£(x,x)= f dx Y tr D(h) £(h~ l x, x) (21.8) 
Jm Jm h 

The fundamental domain integral J dx £(h~ l x, x) picks up a contribution from 
every global cycle (for which h = e), but it also picks up contributions from 
shorter segments of global cycles. The permutation matrix D(h) guarantees by the 
identity tr D(h) - 0, h + e, that only those repeats of the fundamental domain 
cycles p that correspond to complete global cycles p contribute. Compare, for 
example, the contributions of the 12 and 0 cycles of figure 12.12. irD{h)£ does 
not get a contribution from the 0 cycle, as the symmetry operation that maps the 
first half of the 12 into the fundamental domain is a reflection, and tr D(cr) = 0. In 
contrast, o 2 - e, tr D(cr 2 ) = 6 insures that the repeat of the fundamental domain 
fixed point ti(D(h)£) 2 = 6t 2 , gives the correct contribution to the global trace 
tr£ 2 = 3-2? 12 . 

Let p be the full orbit, p the orbit in the fundamental domain and hp an ele- 
ment of 'Hp, the symmetry group of p. Restricting the volume integrations to the 
infinitesimal neighborhoods of the cycles p and p, respectively, and performing 
the standard resummations, we obtain the identity 

(l-tpT" = det (l - D(hp)t p ) , (21.9) 

valid cycle by cycle in the Euler products (19. 15) for det (1 - £). Here "det" refers 
to the [g x g] matrix representation D{hp); as we shall see, this determinant can be 
evaluated in terms of standard characters, and no explicit representation of DQp) 
is needed. Finally, if a cycle p is invariant under the symmetry subgroup 'Hp £ G 
of order h p , its weight can be written as a repetition of a fundamental domain 
cycle 

t p = t h f (21.10) 

computed on the irreducible segment that corresponds to a fundamental domain 
cycle. For example, in figure 12. 12 we see by inspection that tu = and ?i23 = . 

21.3.1 Boundary orbits 

Before we can turn to a presentation of the factorizations of dynamical zeta func- 
tions for the different symmetries we have to discuss an effect that arises for orbits 
that run on a symmetry line that borders a fundamental domain. In our 3-disk 
example, no such orbits are possible, but they exist in other systems, such as in 
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the bounded region of the Henon-Heiles potential and in 1-d maps. For the sym- 
metrical 4-disk billiard, there are in principle two kinds of such orbits, one kind 
bouncing back and forth between two diagonally opposed disks and the other kind 
moving along the other axis of reflection symmetry; the latter exists for bounded 
systems only. While there are typically very few boundary orbits, they tend to be 
among the shortest orbits, and their neglect can seriously degrade the convergence 
of cycle expansions, as those are dominated by the shortest cycles. 

While such orbits are invariant under some symmetry operations, their neigh- 
borhoods are not. This affects the Jacobian matrix M p of the linearization perpen- 
dicular to the orbit and thus the eigenvalues. Typically, e.g. if the symmetry is 
a reflection, some eigenvalues of M p change sign. This means that instead of a 
weight 1 /det (1 - M p ) as for a regular orbit, boundary cycles also pick up contri- 
butions of form 1 /det (1 - hM p ), where h is a symmetry operation that leaves the 
orbit pointwise invariant; see for example sect. 21.1.1. 

Consequences for the dynamical zeta function factorizations are that some- 
times a boundary orbit does not contribute. A derivation of a dynamical zeta 
function (19.15) from a determinant like (19.9) usually starts with an expansion 
of the determinants of the Jacobian. The leading order terms just contain the prod- 
uct of the expanding eigenvalues and lead to the dynamical zeta function (19.15). 
Next to leading order terms contain products of expanding and contracting eigen- 
values and are sensitive to their signs. Clearly, the weights t p in the dynamical 
zeta function will then be affected by reflections in the Poincare surface of section 
perpendicular to the orbit. In all our applications it was possible to implement 
these effects by the following simple prescription. 

If an orbit is invariant under a little group ^H p - \e,bi,..., bh), then the cor- 
responding group element in (21.9) will be replaced by a projector. If the weights 
are insensitive to the signs of the eigenvalues, then this projector is 



In the cases that we have considered, the change of sign may be taken into account 
by defining a sign function e p (g) = ±1, with the "-" sign if the symmetry element 
g flips the neighborhood. Then (21.11) is replaced by 



We have illustrated the above in sect. 21.1.1 by working out the full factorization 
for the 1-dimensional reflection symmetric maps. 




(21.11) 



1=1 




(21.12) 
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21.4 Factorizations of dynamical zeta functions 

In chapter 9 we have shown that a discrete symmetry induces degeneracies among 
periodic orbits and decomposes periodic orbits into repetitions of irreducible seg- 
ments; this reduction to a fundamental domain furthermore leads to a convenient 
symbolic dynamics compatible with the symmetry, and, most importantly, to a 
factorization of dynamical zeta functions. This we now develop, first in a general 
setting and then for specific examples. 

21.4.1 Factorizations of dynamical dynamical zeta functions 

According to (21.9) and (21.10), the contribution of a degenerate class of global 
cycles (cycle p with multiplicity m p = g/h p ) to a dynamical zeta function is given 
by the corresponding fundamental domain cycle p: 



Let D(h) = (J) a d a D a (h) be the decomposition of the matrix representation D(h) 
into the d a dimensional irreducible representations a of a finite group G. Such 
decompositions are block-diagonal, so the corresponding contribution to the Euler 
product (19.9) factorizes as 



where now the product extends over all distinct ^-dimensional irreducible rep- 
resentations, each contributing da times. For the cycle expansion purposes, it 
has been convenient to emphasize that the group-theoretic factorization can be ef- 
fected cycle by cycle, as in (21.13); but from the transfer operator point of view, 
the key observation is that the symmetry reduces the transfer operator to a block 
diagonal form; this block diagonalization implies that the dynamical zeta func- 
tions (19.15) factorize as 



Determinants of cf-dimensional irreducible representations can be evaluated 
using the expansion of determinants in terms of traces, 




(21.13) 




(21.14) 



(i 




(21.15) 



det(l +M) 




(21.16) 
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and each factor in (21.14) can be evaluated by looking up the characters Xa{h) - 
tr D a (h) in standard tables [18]. In terms of characters, we have for the 1-dimensional 
representations 

det(l -D a (h)t) = 1 -Xa(fr)t , 

for the 2-dimensional representations 

det (1 - D a {h)t) = 1 - Xa {h)t + l - ( Xa (h) 2 - Xa(h 2 )) t 1 , 
and so forth. 

In the fully symmetric subspace tr Da,(/j) = 1 for all orbits; hence a straight- 
forward fundamental domain computation (with no group theory weights) always 
yields a part of the full spectrum. In practice this is the most interesting subspec- 
trum, as it contains the leading eigenvalue of the transfer operator. 



tr L a = d a V Xa(h) I dx £(h 1 x, x). 



This leads by standard manipulations to the factorization of (19.9) into 



F a (z) = exp 



,rh r m (i-^)i 



(21.17) 



where Mp = \ipMp is the fundamental domain Jacobian. Boundary orbits re- 
quire special treatment, discussed in sect. 21.3.1, with examples given in the next 
section as well as in the specific factorizations discussed below. 



The factorizations (21.15), (21.17) are the central formulas of this chapter. 
We now work out the group theory factorizations of cycle expansions of dynam- 
ical zeta functions for the cases of C2 and D3 symmetries. The cases of the D2, 
D4 symmetries are worked out in appendix I below. 



exercise 21 .2 



21.4.2 Factorizations of spectral determinants 

Factorization of the full spectral determinant (19.3) proceeds in essentially the 
same manner as the factorization of dynamical zeta functions outlined above. By 
(21.5) and (21.8) the trace of the transfer operator X splits into the sum of inequiv- 
alent irreducible subspace contributions Y>a ti'X ff , with 
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21.5 C2 factorization 

As the simplest example of implementing the above scheme consider the Ci sym- 
metry. For our purposes, all that we need to know here is that each orbit or configu- 
ration is uniquely labeled by an infinite string {$}, Sj = +,- and that the dynamics 
is invariant under the + «-> - interchange, i.e., it is C2 symmetric. The C2 sym- 
metry cycles separate into two classes, the self-dual configurations -1 — , + -1 , 

+ + H , -I 1 h H — with multiplicity m p = 1, and the asymmetric 

configurations +,-, + + -, h, • • •, with multiplicity m p = 2. For example, 

as there is no absolute distinction between the "up" and the "down" spins, or the 

"left" or the "right" lobe, t+ — t-, t++- = t+ , and so on. 

b exercise 21.5 

The symmetry reduced labeling p,- e {0, 1} is related to the standard Si e {+, -} 
Ising spin labeling by 

If Si = Si-i then p, = 1 

If Si * Si-\ then Pi = 0 (21.18) 

For example, + = ••• + + + -1 maps into • • • 111 • • • = 1 (and so does -), 

~ -..._ + _ H maps into • • • 000 • • • = 0, - + +- = • • ■ — + + h H 

maps into • • • 0101 • • • = 01, and so forth. A list of such reductions is given in 
table 12.1. 

Depending on the maximal symmetry group 1-i p that leaves an orbit p invariant 
(see sects. 21.2 and 21.3 as well as sect. 21.1.1), the contributions to the dynamical 
zeta function factor as 

Ai A 2 

<H p = {e}: (l-tp) 2 - (1-tpXl-tp) 

n p = {e,<T\: a-tp) - a-tp)a+tp), (21.19) 

For example: 

(l-f ++ -) 2 - (l-?ooi)(l-?ooi) 
= [e, or}: (1 - f+_) - (1 - to) (1 + to), f+_ = tl 

This yields two binary cycle expansions. The A\ subspace dynamical zeta function 
is given by the standard binary expansion (20.7). The antisymmetric A2 subspace 
dynamical zeta function ^ 2 differs from only by a minus sign for cycles with 
an odd number of 0's: 

\IU 2 = (1 + to)(l - h)(l + t 10 )(l - ?ioo)(l + fioi)(l + '1000) 
(1 - ?iooi )(1 + ?ioii)(l - ?ioooo)(l + ^10001) 
(1 + ?iooio)(l - ?ioon)(l _ ?ioioi)(l + hom) ■ ■ ■ 
= 1 + to - h + (?io - hto) - (?ioo - hoto) + (hoi - hoh) 

-(?iooi - hkm - ho\to + hotoh) - (21.20) 
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Note that the group theory factors do not destroy the curvature corrections (the 
cycles and pseudo cycles are still arranged into shadowing combinations). 

If the system under consideration has a boundary orbit (cf. sect.21.3.1) with 
group-theoretic factor h p = (e + cr)/2, the boundary orbit does not contribute to 
the antisymmetric subspace 



This is the 1/^part of the boundary orbit factorization of sect. 2 1.1.1. 



21.6 D 3 factorization: 3-disk game of pinball 

The next example, the D3 symmetry, can be worked out by a glance at figure 1 2. 1 2 (a). 
For the symmetric 3-disk game of pinball the fundamental domain is bounded by 
a disk segment and the two adjacent sections of the symmetry axes that act as 
mirrors (see figure 12.12(b)). The three symmetry axes divide the space into six 
copies of the fundamental domain. Any trajectory on the full space can be pieced 
together from bounces in the fundamental domain, with symmetry axes replaced 
by flat mirror reflections. The binary {0,1} reduction of the ternary three disk 
{1, 2, 3} labels has a simple geometric interpretation: a collision of type 0 reflects 
the projectile to the disk it comes from (back-scatter), whereas after a collision 
of type 1 projectile continues to the third disk. For example, 23 = • • • 232323 • • • 
maps into • • • 000 • • • = 0 (and so do 12 and 13), 123 = • • ■ 12312 • • • maps into 
• • • 111 • • • = 1 (and so does 132), and so forth. A list of such reductions for short 
cycles is given in table 12.2. 

D3 has two 1 -dimensional irreducible representations, symmetric and anti- 
symmetric under reflections, denoted A\ and A%, and a pair of degenerate 2- 
dimensional representations of mixed symmetry, denoted E. The contribution 
of an orbit with symmetry g to the l/g Euler product (21.14) factorizes according 
to 



det(l-D(A)0 - (1 -XM(h)t)(l ~XA 2 (h)t)(l - X eW + XA 2 (h)t 2 ) (21.22) 



with the three factors contributing to the D$ irreducible representations A\, A2 
and E, respectively, and the 3-disk dynamical zeta function factorizes into f = 
%A\£a 2 £%- Substituting the D3 characters [18] 



boundary: (1 - t p ) 



Ay A 2 
(1 - t p ){\ - 0t p ) 



(21.21) 



O3 



Ai A 2 E 



C, C 2 



e 



1 1 2 
1 1 -1 
1 -1 0 
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into (21.22), we obtain for the three classes of possible orbit symmetries (indicated 
in the first column) 



bp 






A x A 2 E 


■ (1 


-tpf -- 


~- (1 


-tp)(l-t p )(l-2t p + tp 2 


: (1 


p> 


= (1 


- t p )(l - t p )(l + t p + tjf 


: (1 


-t 2 ) 3 -- 
p> 


= (1 


-tp){\+tp){\+Q)tp-tpf 



c, c z 



where tr v stands for any one of the three reflections. 



(21.23) 



The Euler product (19.15) on each irreducible subspace follows from the fac- 
torization (21.23). On the symmetric A\ subspace the &i is given by the standard 
binary curvature expansion (20.7). The antisymmetric Ai subspace £4? differs 
from only by a minus sign for cycles with an odd number of 0's, and is given 
in (21.20). For the mixed-symmetry subspace E the curvature expansion is given 
by 

1/& = (1 + zh + z 2 t 2 ){\ - z 2 t 2 0 )(l + z 3 t m + z 6 t 2 m )(l - z 4 t 2 0 ) 
(1 +z 4 fiooi +z t\ mi )(l +z 5 fioooo +z 10f ioooo) 
(1 +z fioioi + z w t 2 wm ){\ - z 5 fioon) 2 ■ • ■ 
- 1 + zti + z 2 (t 2 - t 2 0 ) + z 3 (?ooi - ht 2 a ) 
+z 4 [?oon + ( f ooi - ht 2 ))t\ - 

+z 5 [fooooi + ^oioii - 2?ooin + (?oon - tli)h + (t 2 - ?Q)?io^|fe24) 

We have reinserted the powers of z in order to group together cycles and pseudo- 
cycles of the same length. Note that the factorized cycle expansions retain the 
curvature form; long cycles are still shadowed by (somewhat less obvious) com- 
binations of pseudo-cycles. 

Referring back to the topological polynomial (15.40) obtained by setting t p = 
1, we see that its factorization is a consequence of the Eh, factorization of the £ 
function: 

MUy = 1 - 2z , l/Ui = 1 . 1/& = 1 + z , (21.25) 

as obtained from (20.7), (21.20) and (21.24) for t p = 1. 

Their symmetry is K = {e, o~}, so according to (21.11), they pick up the group- 
theoretic factor hp = (e + a) 12. If there is no sign change in t p , then evaluation of 
det(l - $±ftp) yields 



boundary: (1 - t p f = (1 - t p )(l - 0t p )(l - tp) 2 , t p = t p . (21.26) 
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However, if the cycle weight changes sign under reflection, ta-p - -tp, the bound- 
ary orbit does not contribute to the subspace symmetric under reflection across the 
orbit; 

At A 2 E 

boundary: (1 - t p f = (1 - Qt p ){\ - tp){\ - tpf , t p = tp . (21.27) 

Resume 

If a dynamical system has a discrete symmetry, the symmetry should be exploited; 
much is gained, both in understanding of the spectra and ease of their evaluation. 
Once this is appreciated, it is hard to conceive of a calculation without factor- 
ization; it would correspond to quantum mechanical calculations without wave- 
function symmetrizations. 

While the reformulation of the chaotic spectroscopy from the trace sums to 
the cycle expansions does not reduce the exponential growth in number of cycles 
with the cycle length, in practice only the short orbits are used, and for them the 
labor saving is dramatic. For example, for the 3-disk game of pinball there are 
256 periodic points of length 8, but reduction to the fundamental domain non- 
degenerate prime cycles reduces the number of the distinct cycles of length 8 to 
30. 

In addition, cycle expansions of the symmetry reduced dynamical zeta func- 
tions converge dramatically faster than the unfactorized dynamical zeta functions. 
One reason is that the unfactorized dynamical zeta function has many closely 
spaced zeros and zeros of multiplicity higher than one; since the cycle expansion 
is a polynomial expansion in topological cycle length, accommodating such be- 
havior requires many terms. The dynamical zeta functions on separate subspaces 
have more evenly and widely spaced zeros, are smoother, do not have symmetry- 
induced multiple zeros, and fewer cycle expansion terms (short cycle truncations) 
suffice to determine them. Furthermore, the cycles in the fundamental domain 
sample state space more densely than in the full space. For example, for the 3- 
disk problem, there are 9 distinct (symmetry unrelated) cycles of length 7 or less 
in full space, corresponding to 47 distinct periodic points. In the fundamental 
domain, we have 8 (distinct) periodic orbits up to length 4 and thus 22 different 
periodic points in 1/6-th the state space, i.e., an increase in density by a factor 3 
with the same numerical effort. 

We emphasize that the symmetry factorization (21.23) of the dynamical zeta 
function is intrinsic to the classical dynamics, and not a special property of quantal 
spectra. The factorization is not restricted to the Hamiltonian systems, or only 
to the configuration space symmetries; for example, the discrete symmetry can 
be a symmetry of the Hamiltonian phase space [4]. In conclusion, the manifold 
advantages of the symmetry reduced dynamics should thus be obvious; full state 
space cycle expansions, such as those of exercise 20.8, are useful only for cross- 
checking purposes. 
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Commentary 



Remark 21.1 Symmetry reductions in periodic orbit theory. This chapter is based 
on a collaborative effort with B. Eckhardt, ref. [1]. The group-theoretic factorizations of 
dynamical zeta functions that we develop here were first introduced and applied in ref. [ 2]. 
They are closely related to the symmetrizations introduced by Gutzwiller [ 3] in the context 
of the semiclassical periodic orbit trace formulas, put into more general group-theoretic 
context by Robbins [ ], whose exposition, together with Lauritzen's [5] treatment of the 
boundary orbits, has influenced the presentation given here. The symmetry reduced trace 
formula for a finite symmetry group G = {e,g2, ■ ■ ■ ,g\c\] with \G\ group elements, where 
the integral over Haar measure is replaced by a finite group discrete sum |G| _1 2t?ec = 1 ' 
was derived in ref. [ 1 ] . A related group-theoretic decomposition in context of hyperbolic 
billiards was utilized in ref. [6], and for the Selberg's zeta function in ref. [7]. One of its 
loftier antecedents is the Artin factorization formula of algebraic number theory, which 
expresses the zeta-function of a finite extension of a given field as a product of L-functions 
over all irreducible representations of the corresponding Galois group. 

The techniques of this chapter have been applied to computations of the 3-disk clas- 
sical and quantum spectra in refs. [8, 9], and to a "Zeeman effect" pinball and the x 2 y 2 
potentials in ref. [10, 11]. In a larger perspective, the factorizations developed above are 
special cases of a general approach to exploiting the group-theoretic invariances in spec- 
tra computations, such as those used in enumeration of periodic geodesies [6, 12, 13] for 
hyperbolic billiards [14] and Selberg zeta functions [15]. 

Remark 21.2 Other symmetries. In addition to the symmetries exploited here, time 
reversal symmetry and a variety of other non-trivial discrete symmetries can induce fur- 
ther relations among orbits; we shall point out several of examples of cycle degeneracies 
under time reversal. We do not know whether such symmetries can be exploited for fur- 
ther improvements of cycle expansions. 



Exercises 



21.1. Sawtooth map desymmetrization. Work out the 



Show that if a is a 2-dimensional representation, this is 
the An antisymmetric representation, and 



some of the shortest global cycles of different symme- 
tries and fundamental domain cycles for the sawtooth 
map of figure 9.4. Compute the dynamical zeta function 
and the spectral determinant of the Perron-Frobenius 
operator for this map; check explicitly the factorization 
(21.2). 



21.3. Characters of D3. (continued from exer- 



cise 9.5) D3 = C3,,, the group of symmetries of an equi- 
lateral triangle: has three irreducible representations, 
two one-dimensional and the other one of multiplicity 



2-dim: det (1-D a (h)t) = l-x a (h)t+XA 2 (h)t 2 .(21.28) 



21.2. 2-dimensional asymmetric representation. The 

above expressions can sometimes be simplified further 
using standard group-theoretical methods. For example, 



2. 



(a) All finite discrete groups are isomorphic to a per- 
mutation group or one of its subgroups, and ele- 
ments of the permutation group can be expressed 




antisymmetric part of the M x M Kronecker product. 
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as cycles. Express the elements of the group D3 
as cycles. For example, one of the rotations is 
(123), meaning that vertex 1 maps to 2, 2 — > 3, 
and 3 — > 1 . 

(b) Use your representation from exercise 9.5 to com- 
pute the D3 character table. 

(c) Use a more elegant method from the group-theory 
literature to verify your D 3 character table. 

(d) Two D3 irreducible representations are one dimen- 
sional and the third one of multiplicity 2 is formed 
by [2x2] matrices. Find the matrices for all six 
group elements in this representation. 

(Hint: get yourself a good textbook, like Hamermesh [ 2] 
or Tinkham [16], and read up on classes and charac- 
ters.) 

21.4. 3-disk desymmetrization. 



a) Work out the 3-disk symmetry factorization for 
the 0 and 1 cycles, i.e. which symmetry do they 
have, what is the degeneracy in full space and how 
do they factorize (how do they look in the A 1 , A 2 
and the E representations). 

b) Find the shortest cycle with no symmetries and 
factorize it as in a) 

c) Find the shortest cycle that has the property that 
its time reversal is not described by the same sym- 
bolic dynamics. 

d) Compute the dynamical zeta functions and the 
spectral determinants (symbolically) in the three 



21.5. 



21.6. 



21.7. 



representations; check the factorizations (21.15) 
and (21.17). 

(Per Rosenqvist) 

C2 factorizations: the Lorenz and Ising systems. In 

the Lorenz system the labels + and - stand for the left or 
the right lobe of the attractor and the symmetry is a rota- 
tion by n around the z-axis. Similarly, the Ising Hamil- 
tonian (in the absence of an external magnetic field) is 
invariant under spin flip. Work out the factorizations for 
some of the short cycles in either system. 

Ising model. The Ising model with two states e, = 
{+, -} per site, periodic boundary condition, and Hamil- 
tonian 



H(e) = ~JJ] s *a 



is invariant under spin-flip: +<-»—. Take advantage of 
that symmetry and factorize the dynamical zeta function 
for the model, i.e., find all the periodic orbits that con- 
tribute to each factor and their weights. 

One orbit contribution. If p is an orbit in the fun- 
damental domain with symmetry h, show that it con- 
tributes to the spectral determinant with a factor 



det 



where D(h) is the representation of h in the regular rep- 
resentation of the group. 
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What you know now is to partition topologically and invariantly the state space, 
compute a hierarchy of cycles, compute spectral determinants and their eigen- 
values. What next? 

1. Why cycle? (Chapter 22) 

2. Why does it work? (Chapter 23) 

3. When does it not work? (Chapter 24) 

4. When does it work? Deterministic diffusion and foundations of 'far for equilibrium' 
statistical mechanics (Chapter 25) 
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Chapter 22 



Why cycle? 



"Progress was a labyrinth ... people plunging blindly in 
and then rushing wildly back, shouting that they had found 
it ... the invisible king - the elan vital - the principle of 
evolution ... writing a book, starting a war, founding a 
school...." 



— F. Scott Fitzgerald, This Side of Paradise 



In the preceding chapters we have moved rather briskly through the evolution 
operator formalism. Here we slow down in order to develop some fingertip 
feeling for the traces of evolution operators. It is a melancholy task, as the 
"intuition" garnered by these heuristic approximations is in all ways inferior to 
the straightforward and exact theory developed so far. But, it has to be done, as 
there is immense literature out there that deploys these heuristic estimates, most 
of it of it uninspired, some of it plain wrong, and the reader should be able to 
understand and sort through that literature. We start out by explaining qualitatively 
how local exponential instability of topologically distinct trajectories leads to a 
global exponential instability. 



22.1 Escape rates 

We start by verifying the claim (17.1 1) that for a nice hyperbolic flow the trace of 
the evolution operator grows exponentially with time. Consider again the game 
of pinball in figure 1.1. Designate by M a region of state space that encloses the 
three disks, such as the surface of the table along with all pinball directions. The 
fraction of initial points whose trajectories start within M and recur within that 
region at the time t is given by 




(22.1) 
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This quantity is both measurable and physically interesting in a variety of prob- 
lems spanning nuclear physics to celestial mechanics. The integral over x takes 
care of all possible initial pinballs; the integral over y checks whether they are 
still within M by time t. If the dynamics is bounded, and M envelops the entire 
accessible state space, Tj^{t) = 1 for all t. However, if trajectories exit At, the 
recurrence fraction decreases with time. For example, any trajectory that falls off 
the pinball table in figure 1 . 1 is gone for good. 

These observations can be made more concrete by examining the pinball 
phase-space of figure 1.9. With each pinball bounce the initial conditions that 
survive get thinned out, each strip yielding two thinner strips within it. The total 
fraction of survivors (1.2) after n bounces is given by 



, in) 

f » = \M\^ lMil ' (222) 



where i is a binary label of the ith strip, and \A\\ is the area of the ith strip. Phase- 
space volume is preserved by the flow, so the strips of survivors are contracted 
along the stable eigen-directions and ejected along the unstable eigen-directions. 
As a crude estimate of the number of survivors in the ith strip, assume that a ray 
of trajectories spreads by a factor A after every bounce. The quantity A represents 
the mean value of the expanding eigenvalue of the corresponding Jacobian matrix 
of the flow. We replace |Afj| by the phase-space strip width estimate |Atf|/|At| ~ 
1/A,-, which is right in spirit but not without drawbacks. For example, in general 
the eigenvalues of a Jacobian matrix for a finite segment of a trajectory have no 
invariant meaning; they depend on the choice of coordinates. However, we saw 
in chapter 18 that neighborhood sizes are determined by Floquet multipliers of 
periodic points, which are invariant under smooth coordinate transformations. 

In the approximation t„ receives 2" contributions of equal size 

f i ~ | + | , • • • , f „ ~ ^ = e-"W = e-»T , (22.3) 
A A A" 



up to pre-exponential factors. We see here the interplay of the two key ingredients 
of chaos first mentioned in sect. 1.3.1: the escape rate y equals the local expansion 
rate (the Lyapunov exponent A - In A) minus the rate of global reinjection back 
into the system (the topological entropy h = In 2). 

At each bounce one routinely loses the same fraction of trajectories, so one 
expects the sum (22.2) to decay exponentially with n. More precisely, by the 
hyperbolicity assumption of sect. 18.1.1, the expanding eigenvalue of the Jacobian 
matrix of the flow is exponentially bounded from both above and below, 

1 < |A™„| < |A(x)| < lA,^ , (22.4) 
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and the area of each strip in (22.2) is bounded by \A m n ax \ < |At/| < |A~" |. Replac- 
ing \Mj\ in (22.2) by its estimates in terms of |A,„ ar | and |A m ,„| immediately leads 
to exponential bounds (2/|A, m , v |)" < T„ < (2/|A m! „|)", i.e., 



1 . 

ln|A mu ,|-ln2 > — lnT ;1 > In |A mi „| - In 2 . (22.5) 
n 



The argument based on (22.5) establishes only that the sequence y n - —j- lnT,, 
has a lower and an upper bound for any n. In order to prove that % converge to the 
limit y, we first show that for hyperbolic systems the sum over surviving intervals 
(22.2) can be replaced by a sum over periodic orbit stabilities. By (22.4) the size 
of the strip Mi can be bounded by the stability A of the ith periodic point: 



Cl _L<^d <C2 J_, (22 . 6) 
|A«| \M\ 2 |A;I 



for any periodic point i of period n, with constants Cj dependent on the dynamical 
system but independent of n. The meaning of these bounds is that for increasingly 
long cycles in a system of bounded hyperbolicity, the shrinking of the ith strip is 
better approximated by the derivatives evaluated on the periodic point within the 
strip. Hence, the survival probability can be bounded close to the periodic point 
stability sum 



(n) 



\ I At;i 

Cl r " < Z TM < Cl r " ' (22 - 7) 



I All 



(n) 

where Y n = £ 1/|A,-| is the asymptotic trace sum (18.26). This establishes that 
for hyperbolic systems the survival probability sum (22.2) can be replaced by the 
periodic orbit sum (18.26). 

We conclude that for hyperbolic, locally unstable flows the fraction (22.1) of 
initial x whose trajectories remain trapped within At up to time t is expected to 
decay exponentially, 

T M (t) oc e-*, 

where y is the asymptotic escape rate defined by 



y = - \im-\nT M {t). (22.8) 



exercise 22.1 
exercise 16.4 
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22.2 Natural measure in terms of periodic orbits 



Let us now refine the reasoning of sect. 22. 1 and argue that the trace is a discretized 
integral over state space. Consider the trace (18.7) in the large time limit (18.25): 



trX" = J dx6(x-f n (x)) ^ A " {x) * 2 



The factor 1/|A,| was interpreted in (22.2) as the area of the ith phase-space strip. 
Hence, the trX" represents a discrete version of { dxeP A "^ approximated by a 
tessellation into strips centered on periodic points x;, (see figure 1.11), with the 
volume of the ith neighborhood given by estimate ~ 1/|A,|, and e^ A "W es- 
timated by e^ A "(- T '), its value at the ith periodic point. If the symbolic dynam- 
ics is complete, any state space rectangle [s- m ■ ■ ■ so-*i*2 ••• s n ] always contains 
the periodic point s- m • ■ • $0*1*2 hence, although the periodic points are of 

measure zero (just like rationals in the unit interval), they are dense on the non- 
wandering set. Equipped with a measure for the associated rectangles, periodic 
orbits suffice to cover the entire non-wandering set. The average of S A " evaluated 
on the non-wandering set is therefore given by the trace, properly normalized so 
that(l) - 1: 



Z™ l/IA, 

Here /j.; is the normalized natural measure 



(n) 



section 12.3.1 



section 17.3 



2^< = 1. m = ^'/\Ai\, (22.10) 



which is correct both for closed systems as well as open systems. 

Unlike brute numerical slicing of the integration space into an arbitrary lattice 
(for a critique, see sect. 16.3), periodic orbit theory is smart, as it automatically 
partitions integrals according to the intrinsic topology of the flow, and assigns to 
each tile i the invariant natural measure 



22.2.1 Unstable periodic orbits are dense 



(L. Rondoni and P. Cvitanovic) 

Our goal in sect. 17.1 was to evaluate the space and time averaged expectation 
value (17.9). An average over all periodic orbits can accomplish the job only if 
the periodic orbits fully explore the asymptotically accessible state space. 
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Why should unstable periodic points end up being dense? The cycles are intu- 
itively expected to be dense because on a connected chaotic set a typical trajectory 
is expected to behave ergodically, and infinitely many times pass arbitrarily close 
to any point on the set (including the initial point of the trajectory itself). The 
argument proceeds more or less as follows. Partition At in arbitrarily small re- 
gions and consider particles that start in the region At , and return to it in n steps 
after some peregrination in the state space. For example, a trajectory might re- 
turn a little to the left of its original position, whereas a nearby neighbor might 
return a little to the right of its original position. By assumption, the flow is con- 
tinuous, so generically one expects to be able to gently move the initial point in 
such a way that the trajectory returns precisely to the initial point, i.e., one ex- 
pects a periodic point of period n in cell i. As we diminish the size of regions Af , 
aiming a trajectory that returns to At becomes increasingly difficult. Therefore, 
we are guaranteed that unstable orbits of increasingly large periods are densely 
interspersed in the asymptotic non-wandering set. 

The above argument is heuristic, by no means guaranteed to work, and it must 
be checked for the particular system at hand. A variety of ergodic but insuffi- 
ciently mixing counter-examples can be constructed - the most familiar being a 
quasiperiodic motion on a torus. 



22.3 Correlation functions 

The time correlation function Cab(0 of two observables A and B along the trajec- 
tory x{t) = f(xo) is defined as 



If the system is ergodic, with invariant continuous measure p){x)dx, then correla- 
tion functions do not depend on xq (apart from a set of zero measure), and may be 
computed by a state space average as well, 



For a chaotic system we expect that time evolution will lose the information con- 
tained in the initial conditions, so that Cab(0 will approach the uncorrelated limit 
{A) ■ (B). As a matter of fact the asymptotic decay of correlation functions 



for any pair of observables coincides with the definition of mixing, a fundamental 
property in ergodic theory. We now assume without loss of generality that (B) = 0. 






(22.12) 



Cab := C AB -{A){B) 



(22.13) 
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(Otherwise we may define a new observable by B{x) - (B).) Our purpose is now 
to connect the asymptotic behavior of correlation functions with the spectrum of 
the Perron-Frobenius operator £. We can write (22.12) as 

C AB ® = f dx f dyA(y)B(x)p 0 (x)S(y-f t (x)) 
JM JM 

and recover the evolution operator 

C AB (t) = [ dx { dyA(y)£ t (y,x)B(x)p 0 (x). 
JM JM 

Recall sect. 16.1, where we showed that p(x) is the eigenvector of £, corre- 
sponding to probability conservation: 

I dy £\x,y)p(y) = p(x) . 
JM 

We can expand the x-dependent part of this equation in terms of the eigenbasis of 
£: 

oo 

B(x)p 0 (x) = ^ C a pa(x) , 
a=0 

where po(x) is the natural measure. Since the average of the left hand side is zero 
the coefficient c$ must vanish. The action of £, can then be written as 

Cab® = Ye-*'c a f dy A(y) Pa (y). (22.14) 

exercise 22.2 

We see immediately that if the spectrum has a gap, i.e., if the second largest 
leading eigenvalue is isolated from the largest eigenvalue ($ = 0) then (22.14) 
implies exponential decay of correlations 

Cab® ~ e- vt . 

The conelation decay rate v = s\ then depends only on intrinsic properties of the 
dynamical system (the position of the next-to-leading eigenvalue of the Perron- 
Frobenius operator), and the choice of a particular observable influences only the 
prefactor. 

Correlation functions are often accessible from time series measurable in lab- 
oratory experiments and numerical simulations; moreover, they are intimately 
linked to transport exponents. 
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22.4 Trace formulas vs. level sums 



Benoit B. Mandelbrot: "I would be perfectly happy being 
Kepler" [to a coming fractals' Newton]. Referring to the 
broad array of things now described by fractals, he added, 
"I have been Kepler many times over." 

— J. Gleick, New York Times, January 22, 1985 



Trace formulas (18.10) and (18.23) diverge precisely where one would 
like to use them, at s equal to eigenvalues s a . To avoid this divergence, one can 
proceed as follows; according to (18.27) the "level" sums (all symbol strings of 
length n) are asymptotically dominated by the leading eigenvalue e* 0 " of the evo- 
lution operator 



> e Son 



so an nth order estimate S(„) of the leading eigenvalue so is fixed by the condition 

„/3A"(.Vi) p -s {ll) n 

'= Z S5— < 22 -' 5 ' 

ieFix/" 1 " 

The eigenvalue condition for the level sum (22. 1 5) can be written in the same form 
as the two conditions (20.18) and (20.19) given so far: 

(») 

0=1-^/,-, ti = Uifi, s{p)) , m = n. (22. 16) 



We do not recommended it as a computational method. The difficulty in estimat- 
ing the leading eigenvalue sq from this n — > oo limit is at least twofold: 

1 . Due to an exponential growth in the number of intervals and an exponential 
decrease in the attainable accuracy, the maximum n, achieved experimentally or 
numerically, is approximately between 5 and 20. 

2. The pre-asymptotic sequence of finite estimates s( n ) is not unique, because 
the sums T n depend on how we define the escape region, and because in gen- 
eral the areas \Mi\ in the sum (22.2) should be weighted by the density of initial 
conditions p(0). For example, an overall measuring unit rescaling |7Vf| — > a\Mj\ 
introduces l/n corrections in defined by the log of the sum (22.8): S(„) — » 
s^) + In a In. This problem can be ameliorated by defining a level average as a 
function of s, 

ieFix f" 1 " 
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and determining the nth level estimate s^ n ) by requiring that the ratios of successive 
levels satisfy 




This avoids the worst problem with formula (22.15), the \/n corrections due to 
its lack of rescaling invariance. However, even though much published ponder- 
ing of "chaos" relies on it, there is no need for such gymnastics: dynamical zeta 
functions and spectral determinants are already invariant not only under linear 
rescalings, but under all smooth nonlinear conjugacies x — > h(x), and require no 
n — > oo extrapolations to asymptotic times. Comparing this with cycle expansions 
(20.7), we see the difference; in the level sum approach, we keep increasing expo- 
nentially the number of terms with no reference to the fact that most are already 
known from shorter estimates, but in cycle expansions short terms dominate and 
longer ones enter only as exponentially small corrections. 



22.4.1 Flow conservation sum rules 



The trace formula version of the flow conservation sum rule (20. 17) comes in two 
varieties (one for maps and another for flows). By flow conservation, the leading 
eigenvalue is so = 0, which for maps (22.16) yields 

tr£" = V = 1 + e sin + ... . (22.18) 

*- Zj det (1 - M n ( Xi ))\ 

For flows, one can apply this rule by grouping together cycles from t - T to 
t = T + AT 



AT |det(l-M£)| ArJ T V ) 

1 _°° s a T 

= 1 + — V — (e s " AT - l) « 1 + e SlT + ■■■ .(22.19) 



As is usual for fixed level trace sums, the convergence of (22.18) is controlled 
by the gap between the leading and next-to-leading eigenvalues of the evolution 
operator. 



Resume 



We conclude this chapter by a general comment on the relation of finite trace 
sums such as (22.2) to spectral determinants and dynamical zeta functions. One 
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might be tempted to believe that given a deterministic rule, a sum like (£2.2) 
can be evaluated to any desired precision. For short times, this is indeed true: 
every region Mj in (22.2) can be accurately delineated, and there is no need for 
any fancy theory. However, if the dynamics is unstable, local variations in initial 
conditions grow exponentially and in finite time attain the size of the system. The 
difficulty with estimating the n — > oo limit from (22.2) is then at least twofold: 

1. Due to the exponential growth in number of intervals, and the exponen- 
tial decrease in attainable accuracy, the maximal n attainable experimentally or 
numerically is in practice of order of something between 5 to 20; 

2. The pre-asymptotic sequence of finite estimates y n is not unique, because 
the sums T n depend on how we define the escape region, and because in general 
the areas \Mj\ in the sum (22.2) should be weighted by the density of initial xq. 

In contrast, dynamical zeta functions and spectral determinants are invariant 
under all smooth nonlinear conjugacies x — > h(x), not only linear rescalings, and 
require no n — > oo extrapolations. 



Commentary 

Remark 22.1 Nonhyperbolic measures. The measure = 1/|A,| is the natural 
measure only for the strictly hyperbolic systems. For non-hyperbolic systems, the mea- 
sure might develop cusps. For example, for Ulam maps (unimodal maps with quadratic 
critical point mapped onto the "left" unstable fixed point xq, discussed in more detail in 
chapter 24), the measure develops a square-root singularity on the 0 cycle: 

~ • (22.20) 

Thermodynamic averages are still expected to converge in the "hyperbolic" phase in 
which the positive entropy of unstable orbits dominates the marginal orbits, but they fail 
in the "non-hyperbolic" phase. The general case remains unclear [ 12, 2, 3, 4, 6]. 

Remark 22.2 Trace formula periodic orbit averaging. The cycle averaging formulas 
are not the first thing one intuitively writes down; the approximate trace formulas are 
more accessibly heuristic ally. Trace formula for averaging (22.19) seems to have been 
discussed for the first time by Hannay and Ozorio de Almeida [9, 9]. Another novelty of 
cycle averaging formulas is one of their main virtues, in contrast to the explicit analytical 
results such as those of ref. [4] . Their evaluation does not require any explicit construction 
of the (coordinate dependent) eigenfunctions of the Perron-Frobenius operator (i.e., the 
natural measure po)- 

Remark 22.3 Role of noise in dynamical systems. In any physical application, 
the dynamics is always accompanied by external noise in addition to deterministic chaos. 
The former can be characterized by its strength cr and distribution. Lyapunov exponents, 



getuscd - 24dec2012 



ChaosBook.org versionl4.4.1, Apr 21 2013 



EXERCISES 



449 



correlation decay, and dynamo rate can be defined in this case the same way as in the 
deterministic case. One might think that noise completely destroys the results derived 
here. However, as we show chapter 28, deterministic formulas remain valid to accuracy 
comparable with noise width if the noise level is small. A small level of noise even helps, 
as it makes the dynamics more ergodic. Deterministically non-communicating parts of 
state space become weakly connected due to noise. This argument explains why periodic 
orbit theory is also applicable to non-ergodic systems. For small amplitude noise, one can 
expand perturbatively 



a = ao + a\cr 2 + aicr 4 + ... 



around the deterministic averages oq. The expansion coefficients 57, aT, ... can also be 
expressed in terms of periodic orbit formulas. Calculating these coefficients is one of the 
challenges facing periodic orbit theory, discussed in refs. [9, 10, 11]. 



Exercises 

22. 1 . Escape rate of the logistic map. 

(a) Calculate the fraction of trajectories remaining 
trapped in the interval [0, 1] for the logistic map 



/M=A(l-(2*-l) 2 ), 



(22.21) 



and determine the A dependence of the escape rate 
y(A) numerically. 

(b) Develop a numerical method for calculating the 
lengths of intervals of trajectories remaining stuck 
for n iterations of the map. 

(c) Describe the dependence of A near the critical 
value A c = 1? 



22.2. Four-scale map correlation decay rate. 

the piecewise-linear map 



Consider 





Vi / 

/ \. 
/ \ 


w/ 









00 01 



10 



11 



/(*) = 



' /oo = AqX 

foi = s Q i(x-b) + 1 
fu =Ai(jc-&) + 1 
fw = s w (x- 1) 



with a 4-interval state space Markov partition 

M = {Mqq,M 0 uMw,Mu} 

= {[Q,b/A 0 ],(b/A 0 ,b](b,c](c,l]}. 

(a) compute soi, «io, c. 

(b) Show that the 2-cycle Floquet multiplier does not 
depend on b, 



Aqi - soi^io 



AoAj 



(Ao - 1)(A, + 1) 



(c) Write down the [2x2] Perron-Frobenius operator 
acting on the space of densities piecewise constant 
over the four partitions. 

(d) Construct the corresponding transition graph. 

(e) Write down the corresponding spectral determin- 
ant. 

(f) Show that the escape rate vanishes, y — — ln(zo) = 
0. 

(g) Determine the spectrum of the Perron-Frobenius 
operator on the space of densities piecewise con- 
stant over the four partitions. Show that the second 
largest eigenvalue of the is — = — 1 + 4 — 4- ■ 

& & Zi Ao A, 

(h) Is this value consistent with the tent map value 
previously computed in exercise 16.4 (with the ap- 
propriate choice of {Ao, K\,c}). 
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(i) (optional) Is this next-to leading eigenvalue still 
correct if the Perron-Frobenius operator acts on 
the space of analytic functions? 

22.3. Lyapunov exponents for 1-dimensional maps. Ex- 
tend your cycle expansion programs so that the first and 
the second moments of observables can be computed. 
Use it to compute the Lyapunov exponent for the fol- 
lowing maps: 

(a) the piecewise-linear skew tent (flow conserving 
map) 



/(*) = 



AqX 



if 0<x< A" 1 , 
Ai(l -x) if A" 1 <x< 1. ' 



Ai = A 0 /(A„ - 1). 

(b) the Ulam map f(x) - 4x(l - x) . 

(c) the skew Ulam map 

/(je)=Ao*(l-Jc)(l-&0, 



(22.22) 



1/Ao = x c (l - x c )(l - bx c ) . In our numerical work 
we fix (arbitrarily, the value chosen in ref. [3]) 
b = 0.6, so 

/(*) = 0. 121 8 x(l - x)(l - 0.6 x) 

with a peak f(x c ) = 1 at x c — 0.7. 

(d) the repeller of f(x) = Ax( 1 - x), for either A = 9/2 
or A = 6 (this is a continuation of exercise 20.2). 



(e) the 2-branch flow conserving map 

/oW = ^[ h ~P+ yl(h-p) 2 + 4hx^ 

Mx) = l.(h + p-l) (22.23) 
2h 

+^^(h + p-iy- + 4h(x- P ), 

with a 2-interval state space Markov partition 
M = {Mo, All) = {[0,p],(p,l]}.This is a non- 
linear perturbation of the Bernoulli shift map, for 
which h — 0 (23.6); the first 15 eigenvalues of the 
Perron-Frobenius operator are listed in ref. [ 1] for 
p = 0.8, h = 0.1. Use these parameter values 
when computing the Lyapunov exponent. 

Cases (a) and (b) can be computed analytically; cases 
(c), (d) and (e) require numerical computation of cy- 
cle stabilities. Just to see whether the theory is worth 
the trouble, also check your cycle expansions results for 
cases (c) and (d) with Lyapunov exponents computed 
by direct numerical averaging along trajectories of ran- 
domly chosen initial points: 

(f) trajectory-trajectory separation ( 6. 1) (hint: rescale 
5x every so often, to avoid numerical overflows), 

(g) iterated stability (6. 11). 

How good is the numerical accuracy compared with pe- 
riodic orbit theory predictions for (a) - (g)? 
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Chapter 23 



Why does it work? 



Bloch: "Space is the field of linear operators." 
Heisenberg: "Nonsense, space is blue and birds fly 
through it." 

— Felix Bloch, Heisenberg and the early days of 
quantum mechanics 

(R. Artuso, H.H. Rugh and R Cvitanovic) 

As we shall see, the trace formulas and spectral determinants work well, 
sometimes very well. The question is: Why? And it still is. The heuris- 
tic manipulations of chapter 1 8 were naive and reckless, as we are facing 
infinite-dimensional vector spaces and singular integral kernels. 

We now outline the key ingredients of proofs that put the trace and determi- 
nant formulas on solid footing. This requires taking a closer look at the evolution 
operators from a mathematical point of view, since up to now we have talked 
about eigenvalues without any reference to what kind of a function space the cor- 
responding eigenfunctions belong to. We shall restrict our considerations to the 
spectral properties of the Perron-Frobenius operator for maps, as proofs for more 
general evolution operators follow along the same lines. What we refer to as a "the 
set of eigenvalues" acquires meaning only within a precisely specified functional 
setting: this sets the stage for a discussion of the analyticity properties of spectral 
determinants. In example 23.1 we compute explicitly the eigenspectrum for the 
three analytically tractable piecewise linear examples. In sect.23.3 we review the 
basic facts of the classical Fredholm theory of integral equations. The program 
is sketched in sect. 23.4, motivated by an explicit study of eigenspectrum of the 
Bernoulli shift map, and in sect. 23.5 generalized to piecewise real-analytic hy- 
perbolic maps acting on appropriate densities. We show on a very simple example 
that the spectrum is quite sensitive to the regularity properties of the functions 
considered. 

For expanding and hyperbolic finite-subshift maps analyticity leads to a very 
strong result; not only do the determinants have better analyticity properties than 



452 



CHAPTER 23. WHY DOES IT WORK? 



453 



the trace formulas, but the spectral determinants are singled out as entire functions 
in the complex s plane. 

The goal of this chapter is not to provide an exhaustive review of the rigorous the- 
ory of the Perron-Frobenius operators and their spectral determinants, but rather 
to give you a feeling for how our heuristic considerations can be put on a firm 
basis. The mathematics underpinning the theory is both hard and profound. 

If you are primarily interested in applications of the periodic orbit theory, you 
should skip this chapter on the first reading. 



23.1 Linear maps: exact spectra 

We start gently; in example 23.1 we work out the exact eigenvalues and eigen- 
functions of the Perron-Frobenius operator for the simplest example of unstable, 
expanding dynamics, a linear 1 -dimensional map with one unstable fixed point. 
Ref. [6] shows that this can be carried over to ^-dimensions. Not only that, but 
in example 23.5 we compute the exact spectrum for the simplest example of a 
dynamical system with an infinity of unstable periodic orbits, the Bernoulli shift. 

Example 23.1 The simplest eigenspectrum - a single fixed point: In order to get 
some feeling for the determinants defined so formally in sect. 19.2, let us work out a 
trivial example: a repeller with only one expanding linear branch 

f(x) = Ax, |A|>1, 

and only one fixed point x q = 0. The action of the Perron-Frobenius operator ( 16.10) is 



From this one immediately gets that the monomials y k are eigenfunctions: 

£ s=m yk ' k=0 ' h2 ' - 

What are these eigenfunctions? Think of eigenfunctions of the Schrodinger 
equation: k labels the fcth eigenfunction ^ in the same spirit in which the number 
of nodes labels the kth quantum-mechanical eigenfunction. A quantum-mechanical 
amplitude with more nodes has more variability, hence a higher kinetic energy. 
Analogously, for a Perron-Frobenius operator, a higher k eigenvalue \l\A\Pt is 
getting exponentially smaller because densities that vary more rapidly decay more 
rapidly under the expanding action of the map. 





(23.1) 
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Example 23.2 The trace formula for a single fixed point: The eigenvalues A * 1 
fall off exponentially with k, so the trace of £ is a convergent sum 



1 00 



- k _ i i 



|A|^ |A|(l-A->) |/(0)' 

in agreement with (18. 7). A similar result follows for powers of £,, yielding the single- 
fixed point version of the trace formula for maps (18.10): 



w Si w T 1 

V gg = V l e n = _J_ (23 3) 

lu\- ze s t Zj|i_A r |' |A|A* ' 



k=0 r=l 



The left hand side of (23.3) is a meromorphic function, with the leading zero 
at z = |A|. So what? 



Example 23.3 Meromorphic functions and exponential convergence: As an 

illustration of how exponential convergence of a truncated series is related to analytic 
properties of functions, consider, as the simplest possible example of a meromorphic 
function, the ratio 



Kz) 



with a,b real and positive and a < b. Within the spectral radius |z| < b the function h 
can be represented by the power series 



Kz) = 2 



k=Q 



where <tq = a/b, and the higher order coefficients are given by o-j - (a - b)/b i+1 . 
Consider now the truncation of order N of the power series 



V Jk a z(a-b)(\-z N lb N ) 

h N (z) = > o- k f = - + -j- — 

b b z (l- z/b) 



Let zn be the solution of the truncated series h N (z N ) = 0. To estimate the distance 
between a and zn it is sufficient to calculate h N {d). It is oforder{a/b) N+l , so finite order 
estimates converge exponentially to the asymptotic value. 



This example shows that: (1) an estimate of the leading pole (the leading 
eigenvalue of £) from a finite truncation of a trace formula converges exponen- 
tially, and (2) the non-leading eigenvalues of £ lie outside of the radius of con- 
vergence of the trace formula and cannot be computed by means of such cycle 
expansion. However, as we shall now see, the whole spectrum is reachable at no 
extra effort, by computing it from a determinant rather than a trace. 
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Figure 23.1: The Bernoulli shift map. 




Example 23.4 The spectral determinant for a single fixed point: 

determinant (19.3) follows from the trace formulas of example 23.2: 

CO 1 v CO 



The spectral 



(23.4) 



where the cummulants Q„ are given explicitly by the Euler formula 

1 A" 1 A~" +1 

Qn 



1 - A- 1 1 - A- 2 



1 - A"" 



exercise 23.3 
(23.5) 



The main lesson to glean from this simple example is that the cummulants Q, 
decay asymptotically faster than exponentially, as A _n ^ n_1 ^ 2 . For example, if we 
approximate series such as (23.4) by the first 10 terms, the error in the estimate of 
the leading zero is « 1/A 50 ! 

So far all is well for a rather boring example, a dynamical system with a single 
repelling fixed point. What about chaos? Systems where the number of unstable 
cycles increases exponentially with their length? We now turn to the simplest 
example of a dynamical system with an infinity of unstable periodic orbits. 



Example 23.5 Eigenfunction of Bernoulli shift map. (continued from example 11 .7) The 
Bernoulli shift map figure 23. 1 



/(*) = 



fo(x) = 2x, 



xe/ 0 = [0,1/2) 



f 1 (x) = 2x-l, xeh = (1/2,1] 



(23.6) 



models the 50-50% probability of a coin toss. The associated Perron-Frobenius oper- 
ator (16.9) assembles p(y) from its two prei mages 



(23.7) 



For this simple example the eigenfunctions can be written down explicitly: they coincide, 
up to constant prefactors, with the Bernoulli polynomials B n (x). These polynomials are 
generated by the Taylor expansion of the generating function 

t X ^ °° 1 

0{x,t) = = B k (x)-j-, B 0 (x) = l, B 1 (x)=x- 

e k=0 
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The Perron-Frobenius operator (23.7) acts on the generating function Q as 

l(te x " 2 te^e^X t e*« 2 ^ (t/2) k 

x ' k=l 

hence each B k (x) is an eigenfunction of £ with eigenvalue 1/2*. 

The full operator has two components corresponding to the two branches. For 
the n times iterated operator we have a full binary shift, and for each of the 2" branches 
the above calculations carry over, yielding the same trace (2" - 1) _1 for every cycle on 
length n. Without further ado we substitute everything back and obtain the determinant, 

det (1 - z£) = exp (- £ £ = f[ f 1 " Jf) • < 23 - 8 ) 

verifying that the Bernoulli polynomials are eigenfunctions with eigenvalues 1,1/2,..., 
1/2", .... 



The Bernoulli map spectrum looks reminiscent of the single fixed-point spec- 
trum (23.2), with the difference that the leading eigenvalue here is 1, rather than 
1/|A|. The difference is significant: the single fixed-point map is a repeller, with 
escape rate (1.7) given by the X leading eigenvalue y = In |A|, while there is no 
escape in the case of the Bernoulli map. As already noted in discussion of the 
relation (19.23), for bounded systems the local expansion rate (here In |A| = In 2) 
is balanced by the entropy (here In 2, the log of the number of preimages F s ), 
yielding zero escape rate. 

So far we have demonstrated that our periodic orbit formulas are correct for 
two piecewise linear maps in 1 dimension, one with a single fixed point, and one 
with a full binary shift chaotic dynamics. For a single fixed point, eigenfunctions 
are monomials in x. For the chaotic example, they are orthogonal polynomials on 
the unit interval. What about higher dimensions? We check our formulas on a 
2-dimensional hyperbolic map next. 



Example 23.6 The simplest of 2-dimensional maps - a single hyperbolic fixed 
point: We start by considering a very simple linear hyperbolic map with a single 
hyperbolic fixed point, 

fix) = (fi(xi,x 2 ),f2(xi,x 2 )) = (A s xi,A u x 2 ) , 0 < |A S | < 1 , |A„| > 1 . 

The Perron-Frobenius operator (16.10) acts on the 2-dimensional density functions as 

£p(xi , x 2 ) = —5— p( Xl / A,, x 2 /A u ) (23.9) 

What are good eigenfunctions? Cribbing the l-dimensional eigenfunctions for the sta- 
ble, contracting xi direction from example 23.1 is not a good idea, as under the iter- 
ation of £ the high terms in a Taylor expansion of p(x\,x 2 ) in the x\ variable would 
get multiplied by exponentially exploding eigenvalues 1/A*. This makes sense, as in 
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the contracting directions hyperbolic dynamics crunches up initial densities, instead of 
smoothing them. So we guess instead that the eigenfunctions are of form 

(fk l k 2 (xi,x 2 ) - x k 2 2 /x\ l+l , k\, k,2 = 0,1,2,... , (23.10) 

a mixture of the Laurent series in the contraction x\ direction, and the Taylor series in 
the expanding direction, the x 2 variable. The action of Perron-Frobenius operator on 
this set of basis functions 

cr A* 1 

Jjfhk 2 (x\, x 2 ) = — — YVhhixuXi), <t = A s /\A s \ 

is smoothing, with the higher k\, k 2 eigenvectors decaying exponentially faster, by 
Aj7A* 2+1 factor in the eigenvalue. One verifies by an explicit calculation (undoing 
the geometric series expansions to lead to (19.9)) that the trace of £ indeed equals 
l/|det(l - M)\ = 1 /|(1 - A„)(l - A s )| , from which it follows that all our trace and spectral 
determinant formulas apply. The argument applies to any hyperbolic map linearized 
around the fixed point of form f(x\....,x d ) = (A\X\, A 2 x 2 , . . . , A d x d ). 



So far we have checked the trace and spectral determinant formulas derived 
heuristic ally in chapters 18 and 19, but only for the case of 1 -dimensional and 
2-dimensional linear maps. But for infinite-dimensional vector spaces this game 
is fraught with dangers, and we have already been mislead by piecewise linear 
examples into spectral confusions: contrast the spectra of example 16.1 and ex- 
ample 17.4 with the spectrum computed in example 18.2. 

We show next that the above results do cany over to a sizable class of piece- 
wise analytic expanding maps. 



23.2 Evolution operator in a matrix representation 

The standard, and for numerical puiposes sometimes very effective way to look at 
operators is through their matrix representations. Evolution operators are moving 
density functions defined over some state space, and as in general we can imple- 
ment this only numerically, the temptation is to discretize the state space as in 
sect. 16.3. The problem with such state space discretization approaches that they 
sometimes yield plainly wrong spectra (compare example 17.4 with the result of 
example 18.2), so we have to think through carefully what is it that we really 
measure. 

An expanding map f(x) takes an initial smooth density (fh(x), defined on a 
subinterval, stretches it out and overlays it over a larger interval, resulting in a new, 
smoother density <f) n+ \(x). Repetition of this process smoothes the initial density, 
so it is natural to represent densities $,(x) by their Taylor series. Expanding 



My) = ^ ^(O)^ . <Pn + i(y) k = 2 €lx^ > 



k=0 
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<fi(0) = fdx 5 (t \y - f{x))U4 y=Q • * = ' 



and substitute the two Taylor series into (16.6): 



<Pn+\{y) = U<Pn){y)= f £k<5(y -/(*))&(*). 



The matrix elements follow by evaluating the integral 



y=0 



(23.11) 



we obtain a matrix representation of the evolution operator 



dx£(y,x)— = ^ — Ljet, fc,fc' = 0,1,2,, 



which maps the x* component of the density of trajectories <fh(x) into the y com- 
ponent of the density (f> n +i(y) one time step later, with y = f(x). 

We already have some practice with evaluating derivatives 6 c \y) = ^S(y) from 
sect. 16.2. This yields a representation of the evolution operator centered on the 
fixed point, evaluated recursively in terms of derivatives of the map /: 



dx5 (t \x- f{x)) 



k\ 



(23.12) 



1 



1 \ c x* 



\f'\\dxf'{x)J k\ 



■*=/« 



The matrix elements vanish for I < k, so L is a lower triangular matrix. The 
diagonal and the successive off-diagonal matrix elements are easily evaluated it- 
eratively by computer algebra 



1 



|A|A* ' 



jk + 2)\f" 
2£!|A|A*+ 2 ' 



For chaotic systems the map is expanding, |A| > 1. Hence the diagonal terms drop 
off exponentially, as 1 /|A|^ +1 , the terms below the diagonal fall off even faster, and 
truncating L to a finite matrix introduces only exponentially small errors. 

The trace formula (23.3) takes now a matrix form 



tr- 



z£_ 
l-z£ 



tr- 



1-zL 



(23.13) 
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1 



f(w) 
0.5- 



Figure 23.2: A nonlinear one-branch repeller with a 
single fixed point w q . 



0.5 



In order to illustrate how this works, we work out a few examples. 

In example 23.7 we show that these results carry over to any analytic single- 
branch 1 -dimensional repeller. Further examples motivate the steps that lead to 
a proof that spectral determinants for general analytic 1 -dimensional expanding 
maps, and - in sect. 23.5, for 1-dimensional hyperbolic mappings - are also entire 
functions. 



Example 23.7 Perron-Frobenius operator in a matrix representation: As in ex- 
ample 23.1, we start with a map with a single fixed point, but this time with a nonlin- 
ear piecewise analytic map f with a nonlinear inverse F = f~ l , sign of the derivative 
cr = cr(F') = F'/\F'\ , and the Perron-Frobenius operator acting on densities analytic in 
an open domain enclosing the fixed point x = w q , 

Jjf>(y) = J dxS(y-f(x))(f>(x) = crF'(y) t f>(F(y)). 

Assume that F is a contraction of the unit disk in the complex plane, i.e., 

\F(z)\ < <9 < 1 and \F'(z)\ < C < oo for \z\ < I , (23.14) 
and expand cp in a polynomial basis with the Cauchy integral formula 

dw (f>(w) f dw (p{w) 



n=0 



2ni w — z J 2ni w 



Combining this with (23.22), we see that in this basis Perron-Frobenius operator £ is 
represented by the matrix 

-C4>(W) = 2j W ^mn<t>n , L,„„ = (7) — — . (23.15) 

Taking the trace and summing we get: 

cr F'(w) 



n>0 - -■ 

This integral has but one simple pole at the unique fixed point w* = F(w*) = f(w*). 
Hence 



tr £ 



exercise 23.6 
cr F'(w*) 1 
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This super-exponential decay of cummulants Qt ensures that for a repeller 
consisting of a single repelling point the spectral determinant (23.4) is entire in 
the complex z plane. 

In retrospect, the matrix representation method for solving the density evolu- 
tion problems is eminently sensible — after all, that is the way one solves a close 
relative to classical density evolution equations, the Schrodinger equation. When 
available, matrix representations for £, enable us to compute many more orders 
of cumulant expansions of spectral determinants and many more eigenvalues of 
evolution operators than the cycle expensions approach. 

Now, if the spectral determinant is entire, formulas such as (19.25) imply that 
the dynamical zeta function is a meromorphic function. The practical import of 
this observation is that it guarantees that finite order estimates of zeroes of dyn- 
amical zeta functions and spectral determinants converge exponentially, or - in 
cases such as (23.4) - super-exponentially to the exact values, and so the cycle 
expansions to be discussed in chapter 20 represent a true perturbative approach to 
chaotic dynamics. 

Before turning to specifics we summarize a few facts about classical theory 
of integral equations, something you might prefer to skip on first reading. The 
purpose of this exercise is to understand that the Fredholm theory, a theory that 
works so well for the Hilbert spaces of quantum mechanics does not necessarily 
work for deterministic dynamics - the ergodic theory is much harder. 

fast track: 
sect. 23.4, p. 462 

23.3 Classical Fredholm theory 

He who would valiant be 'gainst all disaster 
Let him in constancy follow the Master. 
— John Bunyan, Pilgrim 's Progress 




The Perron-Frobenius operator 



£<Kx) = J dy6(x-f(y))<f>(y) 
has the same appearance as a classical Fredholm integral operator 

<K<p(x) = f dy<K(x,y)<p(y), (23.16) 
Jm 

and one is tempted to resort to classical Fredholm theory in order to establish 
analyticity properties of spectral determinants. This path to enlightenment is 
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blocked by the singular nature of the kernel, which is a distribution, whereas the 
standard theory of integral equations usually concerns itself with regular kernels 
<K(x,y) e L (M 2 ). Here we briefly recall some steps of Fredholm theory, before 
working out the example of example 23.5. 

The general form of Fredholm integral equations of the second kind is 

<p(x) = f dy<K(x,y)(p(y) + (23.17) 
Jm 

where g(x) is a given function in L 2 (Af) and the kernel y) € L 2 (M 2 ) (Hilbert- 
Schmidt condition). The natural object to study is then the linear integral operator 
(23. 16), acting on the Hilbert space L 2 (M): the fundamental property that follows 
from the L 2 (Q) nature of the kernel is that such an operator is compact, that is 
close to a finite rank operator.A compact operator has the property that for every 
5 > 0 only a finite number of linearly independent eigenvectors exist correspond- 
ing to eigenvalues whose absolute value exceeds 8, so we immediately realize 
(figure 23.5) that much work is needed to bring Perron-Frobenius operators into 
this picture. 

We rewrite (23.17) in the form 

Tip = T = \-<K. (23.18) 

The Fredholm alternative is now applied to this situation as follows: the equation 
7~<p = £, has a unique solution for every £ e l}{M) or there exists a non-zero 
solution of T<po = 0, with an eigenvector of 7C corresponding to the eigenvalue 1 . 
The theory remains the same if instead of T we consider the operator T\ = 1-AK 
with A ± 0. As "7C is a compact operator there is at most a denumerable set of A for 
which the second part of the Fredholm alternative holds: apart from this set the 
inverse operator ( \ -AT)~ x exists and is bounded (in the operator sense). When A 
is sufficiently small we may look for a perturbative expression for such an inverse, 
as a geometric series 

(1-A9Q- 1 - 1 + A<K + A 2< K 2 + ■ ■ ■ - 1 + A<W, (23.19) 

where 7C" is a compact integral operator with kernel 

<K n {x, y) = f dzi ... dz n - 1 K(x, zi ) ■ ■ ■ 'Kiz.n- i,y), 
Jm"- [ 

and is also compact, as it is given by the convergent sum of compact operators. 
The problem with (23.19) is that the series has a finite radius of convergence, 
while apart from a denumerable set of /{'s the inverse operator is well defined. 
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A fundamental result in the theory of integral equations consists in rewriting the 
resolving kernel 'W as a ratio of two analytic functions of A 



W&y) = 



£>(*,yU) 
D(A) 



If we introduce the notation 



7C 



yi 



K(xuyi) ... K(xuyn) 

c K{x n ,y l ) ... K.{x n ,y n ) 



we may write the explicit expressions 



D(A) 



oo „ 

1 + ^(-1)"-: dzi ...dZn'K 
frf n\ J M „ 



Zl ■ ■ ■ Zn 
Z\...Z n 



exp 



- V -trTT 

m 



V m=l 



(23.20) 



<K\ y I 



y r 



Z n 



The quantity D{A) is known as the Fredholm determinant (see (19.24)):it is an 
entire analytic function of A, and D(A) - 0 if and only if 1 /A is an eigenvalue of 
<K. 

Worth emphasizing again: the Fredholm theory is based on the compactness 
of the integral operator, i.e., on the functional properties (summability) of its ker- 
nel. As the Perron-Frobenius operator is not compact, there is a bit of wishful 
thinking involved here. 



23.4 Analyticity of spectral determinants 



They savored the strange warm glow of being much more 
ignorant than ordinary people, who were only ignorant of 
ordinary things. 

— Terry Pratchett 

Spaces of functions integrable Is, or square-integrable L 2 on interval [0, 1] 
are mapped into themselves by the Perron-Frobenius operator, and in both cases 
the constant function 0o = 1 is an eigenf unction with eigenvalue 1. If we focus 
our attention on L 1 we also have a family of L 1 eigenf unctions, 

<M)0 = Y, ex P( 2nik y)jT7Z (23.21) 
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with complex eigenvalue 2 , parameterized by complex 9 with Re 9 > 0. By 
varying 9 one realizes that such eigenvalues fill out the entire unit disk. Such 
essential spectrum, the case k = 0 of figure 23.5, hides all fine details of the 
spectrum. 

What's going on? Spaces L 1 and L 2 contain arbitrarily ugly functions, allow- 
ing any singularity as long as it is (square) integrable - and there is no way that 
expanding dynamics can smooth a kinky function with a non-differentiable singu- 
larity, let's say a discontinuous step, and that is why the eigenspectrum is dense 
rather than discrete. Mathematicians love to wallow in this kind of muck, but there 
is no way to prepare a nowhere differentiable L 1 initial density in a laboratory. The 
only thing we can prepare and measure are piecewise smooth (real-analytic) den- 
sity functions. 

For a bounded linear operator on a Banach space Q, the spectral radius 
is the smallest positive number p spec such that the spectrum is inside the disk of 
radius p spec , while the essential spectral radius is the smallest positive number 
p ess such that outside the disk of radius p ess the spectrum consists only of isolated 
eigenvalues of finite multiplicity (see figure 23.5). 



We may shrink the essential spectrum by letting the Perron-Frobenius oper- 
ator act on a space of smoother functions, exactly as in the one -branch repeller 
case of sect. 23.1. We thus consider a smaller space, C* + ' f , the space of k times 
differentiable functions whose &'th derivatives are Holder continuous with an ex- 
ponent 0 < a < 1 : the expansion property guarantees that such a space is mapped 
into itself by the Perron-Frobenius operator. In the strip 0 <Re 9 < k + a most <fe 
will cease to be eigenf unctions in the space (* +a ; the function <p n survives only for 
integer valued 0 - n. In this way we arrive at a finite set of isolated eigenvalues 
1, 2 _1 , • • • , 2~ k , and an essential spectral radius p ess = 2~^ k+a \ 

We follow a simpler path and restrict the function space even further, namely 
to a space of analytic functions, i.e., functions for which the Taylor expansion is 
convergent at each point of the interval [0, 1]. With this choice things turn out easy 
and elegant. To be more specific, let <p be a holomorphic and bounded function on 
the disk D = B(0, R) of radius R > 0 centered at the origin. Our Perron-Frobenius 
operator preserves the space of such functions provided (1 + R)/2 < R so all we 
need is to choose R > 1. If F s , s 6 {0, 1}, denotes the s inverse branch of the 
Bernoulli shift (23.6), the corresponding action of the Perron-Frobenius operator 
is given by £ s h(y) = cr F' s (y) h o F s (y), using the Cauchy integral formula along 
the 3D boundary contour: 



For reasons that will be made clear later we have introduced a sign cr = ± 1 of the 
given real branch \F'(y)\ = cr F'(y). For both branches of the Bernoulli shift s = 1, 
but in general one is not allowed to take absolute values as this could destroy 
analyticity. In the above formula one may also replace the domain D by any 




(23.22) 



converg - 9nov2008 



ChaosBook.org version 14.4. 1 , Apr 2 1 20 1 3 



CHAPTER 23. WHY DOES IT WORK? 



464 



domain containing [0, 1] such that the inverse branches maps the closure of D into 
the interior of D. Why? simply because the kernel remains non-singular under 
this condition, i.e., w - F(y) + 0 whenever w € dD and y € CI D. The problem 
is now reduced to the standard theory for Fredholm determinants, sect. 23.3. The 
integral kernel is no longer singular, traces and determinants are well-defined, and 
we can evaluate the trace of £p by means of the Cauchy contour integral formula: 

£ dw crF'Cw) 

tr £ F = CD ■ 

J 2m w - F(w) 

Elementary complex analysis shows that since F maps the closure of D into its 
own interior, F has a unique (real-valued) fixed point / with a multiplier strictly 
smaller than one in absolute value. Residue calculus therefore yields 



tr£ F = 



exercise 23.6 
o-F'(x*) 1 



1-F'(x*) |/'(x*)-l|' 
justifying our previous ad hoc calculations of traces using Dirac delta functions. 

Example 23.8 Perron-Frobenius operator in a matrix representation: As in ex- 
ample 23.1, we start with a map with a single fixed point, but this time with a nonlin- 
ear piecewise analytic map f with a nonlinear inverse F = f~ l , sign of the derivative 

o- = o-(F') = F'/\F'\ 

£<P(z) = J<k6(z- f(x)) 4>(x) = cr F'(z) <f>(F(z)) . 

Assume that F is a contraction of the unit disk, i.e., 

\F(z)\ <6<l and \F'(z)\ < C < oo for \z\ < I , (23.23) 
and expand 4> in a polynomial basis by means of the Cauchy formula 

Combining this with (23.22), we see that in this basis £ is represented by the matrix 

£<f>(w) = ^mn<f>n , Km = CD ~ ■ (23.24) 

Taking the trace and summing we get: 

tr£=>L ml = (()— ±-£. 

J 2m w - r (w) 

«>o " x ' 

This integral has but one simple pole at the unique fixed point w* = F(w*) = f(w*). 
Hence 

„ o- F'(w*) 1 
tr £ 



1-F'(w*) \f'(w*)-l\ 



converg - 9nov2008 



ChaosBook.org version 14.4. 1 , Apr 2 1 20 1 3 



CHAPTER 23. WHY DOES IT WORK? 



465 



We worked out a very specific example, yet our conclusions can be gener- 
alized, provided a number of restrictive requirements are met by the dynamical 
system under investigation: 



exercise 23.6 



1) the evolution operator is multiplicative along the flow, 

2) the symbolic dynamics is a. finite subshift, 

3) all cycle eigenvalues are hyperbolic (exponentially bounded in 
magnitude away from 1), 

4) the map (or the flow) is real analytic, i.e., it has a piecewise ana- 
lytic continuation to a complex extension of the state space. 



These assumptions are romantic expectations not satisfied by the dynamical 
systems that we actually desire to understand. Still, they are not devoid of physical 
interest; for example, nice repellers like our 3-disk game of pinball do satisfy the 
above requirements. 

Properties 1 and 2 enable us to represent the evolution operator as a finite 
matrix in an appropriate basis; properties 3 and 4 enable us to bound the size 
of the matrix elements and control the eigenvalues. To see what can go wrong, 
consider the following examples: 

Property 1 is violated for flows in 3 or more dimensions by the following 
weighted evolution operator 

Z t {y,x) = \M{x)f6(y-f t {x)) , 

where A\x) is an eigenvalue of the Jacobian matrix transverse to the flow. Semi- 
classical quantum mechanics suggest operators of this form with fi = l/2.The 
problem with such operators arises from the fact that when considering the Ja- 
cobian matrices J a b = J a Jb for two successive trajectory segments a and b, the 
corresponding eigenvalues are in general not multiplicative, Afo + A a Af, (unless 
a, b are iterates of the same prime cycle p, so J a Jb = Jp + ' b )- Consequently, this 
evolution operator is not multiplicative along the trajectory. The theorems require 
that the evolution be represented as a matrix in an appropriate polynomial basis, 
and thus cannot be applied to non-multiplicative kernels, i.e., kernels that do not 
satisfy the semi-group property £ = X . 

Property 2 is violated by the 1-dimensional tent map (see figure 23.3 (a)) 



f{x) = a{\ - |1 - 2*|) , 1/2 < a < 1 . 



All cycle eigenvalues are hyperbolic, but in general the critical point x- - 1/2 
is not a pre -periodic point, so there is no finite Markov partition and the sym- 
bolic dynamics does not have a finite grammar (see sect. 12.4 for definitions). In 
practice, this means that while the leading eigenvalue of £, might be computable, 
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Figure 23.3: (a) A (hyperbolic) tent map without 
a finite Markov partition, (b) A Markov map with 
a marginal fixed point. 




the rest of the spectrum is very hard to control; as the parameter a is varied, the 
non-leading zeros of the spectral determinant move wildly about. 

Property 3 is violated by the map (see figure 23.3 (b)) 




x + 2x 2 , x e I 0 - [0, A] 
2 - 2x , xeh - [i,l] 



Here the interval [0, 1] has a Markov partition into two subintervals ^ and I\, and 
/ is monotone on each. However, the fixed point at x = 0 has marginal stability 
Ao = 1, and violates condition 3. This type of map is called "intermittent" and 
necessitates much extra work. The problem is that the dynamics in the neighbor- 
hood of a marginal fixed point is very slow, with correlations decaying as power 
laws rather than exponentially. We will discuss such flows in chapter 24. 

Property 4 is required as the heuristic approach of chapter 1 8 faces two major 
hurdles: 

1. The trace (18.8) is not well defined because the integral kernel is singular. 

2. The existence and properties of eigenvalues are by no means clear. 

Actually, property 4 is quite restrictive, but we need it in the present approach, 
so that the Banach space of analytic functions in a disk is preserved by the Perron- 
Frobenius operator. 

In attempting to generalize the results, we encounter several problems. First, 
in higher dimensions life is not as simple. Multi-dimensional residue calculus is 
at our disposal but in general requires that we find poly-domains (direct product 
of domains in each coordinate) and this need not be the case. Second, and per- 
haps somewhat surprisingly, the 'counting of periodic orbits' presents a difficult 
problem. For example, instead of the Bernoulli shift consider the doubling map 
(11.8) of the circle, x i-> 2x mod 1, x € R/Z. Compared to the shift on the interval 
[0, 1] the only difference is that the endpoints 0 and 1 are now glued together. Be- 
cause these endpoints are fixed points of the map, the number of cycles of length 
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n decreases by 1. The determinant becomes: 



det(l -z£) = exp 



= 1 -z. 



(23.25) 



n 2" - 1 



The value z = 1 still comes from the constant eigenfunction, but the Bernoulli 
polynomials no longer contribute to the spectrum (as they are not periodic). Proofs 
of these facts, however, are difficult if one sticks to the space of analytic functions. 

Third, our Cauchy formulas a priori work only when considering purely ex- 
panding maps. When stable and unstable directions co-exist we have to resort to 
stranger function spaces, as shown in the next section. 



Proceeding to hyperbolic systems, one faces the following paradox: If / is an 
area-preserving hyperbolic and real-analytic map of, for example, a 2-dimensional 
torus then the Perron-Frobenius operator is unitary on the space of Z? functions, 
and its spectrum is confined to the unit circle. On the other hand, when we 
compute determinants we find eigenvalues scattered around inside the unit disk. 
Thinking back to the Bernoulli shift example 23.5 one would like to imagine 
these eigenvalues as popping up from the L 2 spectrum by shrinking the function 
space. Shrinking the space, however, can only make the spectrum smaller so this 
is obviously not what happens. Instead one needs to introduce a 'mixed' function 
space where in the unstable direction one resorts to analytic functions, as before, 
but in the stable direction one instead considers a 'dual space' of distributions on 
analytic functions. Such a space is neither included in nor includes Z? and we 
have thus resolved the paradox. However, it still remains to be seen how traces 
and determinants are calculated. 

The linear hyperbolic fixed point example23.6 is somewhat misleading, as we 
have made explicit use of a map that acts independently along the stable and unsta- 
ble directions. For a more general hyperbolic map, there is no way to implement 
such direct product structure, and the whole argument falls apart. Her comes an 
idea; use the analyticity of the map to rewrite the Perron-Frobenius operator acting 
as follows (where cr denotes the sign of the derivative in the unstable direction): 



23.5 Hyperbolic maps 



I can give you a definion of a Banach space, but I do not 
know what that means. 

— Federico Bonnetto, Banach space 



(H.H. Rugh) 



£Kzi,Z2) = 




(zi -/i(wi,w 2 )(/2(wi,W2) -Zi) 2m 2m 



cr h(w\,W2) dw\ d\V2 



(23.26) 
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Figure 23.4: For an analytic hyperbolic map, specify- 
ing the contracting coordinate w/, at the initial rectangle 
and the expanding coordinate z,. at the image rectangle 
defines a unique trajectory between the two rectangles. 
In particular, w v and zh ( n °t shown) are uniquely spec- 
ified. 




Here the function cf> should belong to a space of functions analytic respectively 
outside a disk and inside a disk in the first and the second coordinates; with the 
additional property that the function decays to zero as the first coordinate tends 
to infinity. The contour integrals are along the boundaries of these disks. It is 
an exercise in multi-dimensional residue calculus to verify that for the above lin- 
ear example this expression reduces to (23.9). Such operators form the building 
blocks in the calculation of traces and determinants. One can prove the following: 

Theorem: The spectral determinant for 2- dimensional hyperbolic analytic maps 
is entire. 

The proof, apart from the Markov property that is the same as for the purely 
expanding case, relies heavily on the analyticity of the map in the explicit con- 
struction of the function space. The idea is to view the hyperbolicity as a cross 
product of a contracting map in forward time and another contracting map in back- 
ward time. In this case the Markov property introduced above has to be elaborated 
a bit. Instead of dividing the state space into intervals, one divides it into rectan- 
gles. The rectangles should be viewed as a direct product of intervals (say hori- 
zontal and vertical), such that the forward map is contracting in, for example, the 
horizontal direction, while the inverse map is contracting in the vertical direction. 
For Axiom A systems (see remark 23.8) one may choose coordinate axes close 
to the stable/unstable manifolds of the map. With the state space divided into 
N rectangles [M\,M2, ■ ■ ■ . Atjv}, Mi = if x I v one needs a complex extension 
D h t X D v ., with which the hyperbolicity condition (which simultaneously guaran- 
tees the Markov property) can be formulated as follows: 

Analytic hyperbolic property: Either /(Afc) n Int(Aly) = 0, or for each pair 
wh € C1(D| ! ), z v £ Cl(Dp there exist unique analytic functions of v%,z v : w v = 
w v (w h ,Zv) e Int(DJ'), Zh = Zh(wh,z v ) e Int(D^), such that f(w h ,w v ) = (zh,z v ). 
Furthermore, if w/, € if and z. v £ Tj, then w v e IJ and zh £ I h j (see figure 23.4). 

In plain English, this means for the iterated map that one replaces the coor- 
dinates Zh,z v at time n by the contracting pair Zh,w v , where w v is the contracting 
coordinate at time n + 1 for the 'partial' inverse map. 

In two dimensions the operator in (23.26) acts on functions analytic outside 
D' ! in the horizontal direction (and tending to zero at infinity) and inside U in 
the vertical direction. The contour integrals are precisely along the boundaries of 
these domains. 
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A map / satisfying the above condition is called analytic hyperbolic and the 
theorem states that the associated spectral determinant is entire, and that the trace 
formula (18.8) is correct. 

Examples of analytic hyperbolic maps are provided by small analytic pertur- 
bations of the cat map, the 3-disk repeller, and the 2-dimensional baker's map. 



23.6 Physics of eigenvalues and eigenfunctions 



^■r By now we appreciate that any honest attempt to look at the spectral prop- 
erties of the Perron-Frobenius operator involves hard mathematics, but the reward 
is of this effort is that we are able to control the analyticity properties of dynamical 
zeta functions and spectral determinants, and thus substantiate the claim that these 
objects provide a powerful and well-founded theory. 

Often (see chapter 17) physically important part of the spectrum is just the 
leading eigenvalue, which gives us the escape rate from a repeller, or, for a gen- 
eral evolution operator, formulas for expectation values of observables and their 
higher moments. Also the eigenfunction associated to the leading eigenvalue has 
a physical interpretation (see chapter 16): it is the density of the natural measures, 
with singular measures ruled out by the proper choice of the function space. This 
conclusion is in accord with the generalized Perron-Frobenius theorem for evolu- 
tion operators. In a finite dimensional setting, the statement is: 



• Perron-Frobenius theorem: Let L ; y be a non-negative matrix, such that 
some finite n exists for which any initial state has reached any other state, 
(L n )ij > 0 ViJ: then 

1. The maximal modulus eigenvalue is non-degenerate, real, and posi- 



2. The corresponding eigenvector (defined up to a constant) has non- 
negative coordinates. 

We may ask what physical information is contained in eigenvalues beyond the 
leading one: suppose that we have a probability conserving system (so that the 
dominant eigenvalue is 1), for which the essential spectral radius satisfies 0 < 
Pess < # < 1 on some Banach space S. Denote by P the projection corresponding 
to the part of the spectrum inside a disk of radius 9. We denote by \, A% . . . , Am 
the eigenvalues outside of this disk, ordered by the size of their absolute value, 
with A.\ = 1. Then we have the following decomposition 



remark 23.7 



tive, 



M 




(23.27) 
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when Lj are (finite) matrices in Jordan canomical form (Lq - 0 is a [1 x 1] matrix, 
as Aq is simple, due to the Perron-Frobenius theorem), whereas ifc is a row vector 
whose elements form a basis on the eigenspace corresponding to A, and i]/* is 
a column vector of elements of & (the dual space of linear functionals over S) 
spanning the eigenspace of ■£* corresponding to Aj. For iterates of the Perron- 
Frobenius operator, (23.27) becomes 



M 

£> - 2 WiLltftp + P£'V ■ (23.28) 

(=i 

If we now consider, for example, correlation between initial ip evolved n steps and 
final 

<^|X M |^> - f dymtt n <P)(y) = f dw(Zof)( w )<p(w), (23.29) 
Jm Jm 

it follows that 

L 

<£|£» = X[io^, <p) + Y, <p) + Oiff 1 ) , (23.30) 



1=2 



where 



(n) 

a> 



Jm 



The eigenvalues beyond the leading one provide two pieces of information: 
they rule the convergence of expressions containing high powers of the evolution 
operator to leading order (the A\ contribution). Moreover if a>\(^,ip) = 0 then 
(23.29) defines a correlation function: as each term in (23.30) vanishes exponen- 
tially in the n — > oo limit, the eigenvalues A2, ... ,Am determine the exponential 
decay of correlations for our dynamical system. The prefactors oj depend on the 
choice of functions, whereas the exponential decay rates (given by logarithms of 
Ai) do not: the correlation spectrum is thus a universal property of the dynamics 
(once we fix the overall functional space on which the Perron-Frobenius operator 
acts). 



exercise 23.7 



Example 23.9 Bernoulli shift eigenfunctions: Let us revisit the Bernoulli shift ex- 
ample (23.6) on the space of analytic functions on a disk: apart from the origin we have 
only simple eigenvalues A k = 2~ k , k = 0, 1, . . .. The eigenvalue A u = 1 corresponds to 
probability conservation: the corresponding eigenfunction B 0 (x) = 1 indicates that the 
natural measure has a constant density over the unit interval. If we now take any ana- 
lytic function n(x) with zero average (with respect to the Lebesgue measure), it follows 
that cj\(n,n) = 0, and from (23.30) the asymptotic decay of the correlation function is 
(unless also u>i(n, rj) = 0) 

C^in) ~ exp(-n log 2). (23.31) 
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Thus, -log/li gives the exponential decay rate of correlations (with a prefactor that 
depends on the choice of the function). Actually the Bernoulli shift case may be treated 
exactly as for analytic functions we can employ the Euler-MacLaurin summation for- 
mula 

rj{z) = dwrjiw) +Vi ^— £ —B m (z). (23.32) 

Jo ml 

As we are considering functions with zero average, we have from (23.29) and the fact 
that Bernoulli polynomials are eigenvectors of the Perron-Frobenius operator that 

r ^ V (2- m )"(/7 (m) (D-77 (m) (0)) f 1 . 

C,,,(h) = ) ; dZT](z)B m (z) . 

The decomposition (23.32) is also useful in realizing that the linear functional ij/* are 
singular objects: if we write it as 

CO 

Viz) = BmbWJfl, 
m=0 

we see that these functionals are of the form 

i[f*[s] = f i/wTi(w)£(w), 
Jo 

where 

*¥i(w) = [^- l \w - 1) - 6 {i - 1} (w)) , (23.33) 

when i > 1 and v Fo(w) = 1. This representation is only meaningful when the function s 
is analytic in neighborhoods of w, w - 1. 



23.7 Troubles ahead 



The above discussion confirms that for a series of examples of increasing gener- 
ality formal manipulations with traces and determinants are justified: the Perron- 
Frobenius operator has isolated eigenvalues, the trace formulas are explicitly ver- 
ified, and the spectral determinant is an entire function whose zeroes yield the 
eigenvalues. Real life is harder, as we may appreciate through the following 
considerations: 



• Our discussion tacitly assumed something that is physically entirely reason- 
able: our evolution operator is acting on the space of analytic functions, i.e., 
we are allowed to represent the initial density p(x) by its Taylor expansions 
in the neighborhoods of periodic points. This is however far from being the 
only possible choice: mathematicians often work with the function space 
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Figure 23.5: Spectrum of the Perron-Frobenius oper- 
ator acting on the space of C k+a Holder-continuous 
functions: only k isolated eigenvalues remain between 
the spectral radius, and the essential spectral radius 
which bounds the "essential," continuous spectrum. 




isolated eigenvalue 



C k+a , i.e., the space of k times differentiable functions whose k'th deriva- 
tives are Holder continuous with an exponent 0 < a < 1 : then every f with 
Re x] > k is an eigenfunction of the Perron-Frobenius operator and we have 

w = m?- " £C - 

This spectrum differs markedly from the analytic case: only a small number 
of isolated eigenvalues remain, enclosed between the spectral radius and a 
smaller disk of radius 1/|A|* +1 , see figure 23.5. In literature the radius of 
this disk is called the essential spectral radius. 

In sect. 23.4 we discussed this point further, with the aid of a less trivial 
1 -dimensional example. The physical point of view is complementary to 
the standard setting of ergodic theory, where many chaotic properties of a 
dynamical system are encoded by the presence of a continuous spectrum, 
used to prove asymptotic decay of correlations in the space of L 2 square - 
integrable functions. 

A deceptively innocent assumption is hidden beneath much that was dis- 
cussed so far: that (23.1) maps a given function space into itself. The ex- 
panding property of the map guarantees that: if f(x) is smooth in a do- 
main D then f(x/A) is smooth on a larger domain, provided |A| > 1. For 
higher-dimensional hyperbolic flows this is not the case, and, as we saw in 
sect. 23.5, extensions of the results obtained for expanding 1 -dimensional 
maps are highly nontrivial. 

It is not at all clear that the above analysis of a simple one-branch, one fixed 
point repeller can be extended to dynamical systems with Cantor sets of 
periodic points: we showed this in sect. 23.4. 



exercise 23.2 



Resume 



Examples of analytic eigenf unctions for 1 -dimensional maps are seductive, and 
make the problem of evaluating ergodic averages appear easy; just integrate over 
the desired observable weighted by the natural measure, right? No, generic natural 
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measure sits on a fractal set and is singular everywhere. The point of this book 
is that you never need to construct the natural measure, cycle expansions will do 
that job. 

A theory of evaluation of dynamical averages by means of trace formulas 
and spectral determinants requires a deep understanding of their analyticity and 
convergence. We worked here through a series of examples: 

1 . exact spectrum (but for a single fixed point of a linear map) 

2. exact spectrum for a locally analytic map, matrix representation 

3. rigorous proof of existence of discrete spectrum for 2-dimensional hyper- 
bolic maps 

In the case of especially well-behaved "Axiom A" systems, where both the 
symbolic dynamics and hyperbolicity are under control, it is possible to treat 
traces and determinants in a rigorous fashion, and strong results about the ana- 
lyticity properties of dynamical zeta functions and spectral determinants outlined 
above follow. 

Most systems of interest are not of the "axiom A" category; they are neither 
purely hyperbolic nor (as we have seen in chapters 11 and 12 ) do they have 
finite grammar. The importance of symbolic dynamics is generally grossly under 
appreciated; the crucial ingredient for nice analyticity properties of zeta functions 
is the existence of a finite grammar (coupled with uniform hyperbolicity). 

The dynamical systems which are really interesting - for example, smooth 
bounded Hamiltonian potentials - are presumably never fully chaotic, and the 
central question remains: How do we attack this problem in a systematic and 
controllable fashion? 
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Theorem: Conjecture 3 with technical hypothesis is true 
in a lot of cases. 
— M. Shub 



Commentary 

Remark 23.1 Surveys of rigorous theory. We recommend the references listed in 
remark 1.1 for an introduction to the mathematical literature on this subject. For a physi- 
cist, Driebe's monograph [ 19] might be the most accessible introduction into mathemat- 
ics discussed briefly in this chapter. There are a number of reviews of the mathematical 
approach to dynamical zeta functions and spectral determinants, with pointers to the orig- 
inal references, such as refs. [1, 2]. An alternative approach to spectral properties of the 
Perron-Frobenius operator is given in ref. [3]. 

Ergodic theory, as presented by Sinai [14] and others, tempts one to describe the 
densities on which the evolution operator acts in terms of either integrable or square- 
integrable functions. For our purposes, as we have already seen, this space is not suitable. 
An introduction to ergodic theory is given by Sinai, Kornfeld and Fomin [15]; more ad- 
vanced old-fashioned presentations are Walters [ 1 2] and Denker, Grillenberger and Sig- 
mund [ I (>]; and a more formal one is given by Peterson [17]. 

Remark 23.2 Fredholm theory. Our brief summary of Fredholm theory is based on 
the exposition of ref. [4]. A technical introduction of the theory from an operator point of 
view is given in ref. [5]. The theory is presented in a more general form in ref. [ 6]. 

Remark 23.3 Bernoulli shift. For a more in-depth discussion, consult chapter 3 
of ref. [19]. The extension of Fredholm theory to the case or Bernoulli shift on C k+a 
(in which the Perron-Frobenius operator is not compact - technically it is only quasi- 
compact. That is, the essential spectral radius is strictly smaller than the spectral radius) 
has been given by Ruelle [7]: a concise and readable statement of the results is contained 
in ref. [8]. We see from (23.31) that for the Bernoulli shift the exponential decay rate 
of correlations coincides with the Lyapunov exponent: while such an identity holds for a 
number of systems, it is by no means a general result, and there exist explicit counterex- 
amples. 

Remark 23.4 Hyperbolic dynamics. When dealing with hyperbolic systems one 
might try to reduce to the expanding case by projecting the dynamics along the unstable 
directions. As mentioned in the text this can be quite involved technically, as such unstable 
foliations are not characterized by strong smoothness properties. For such an approach, 
see ref. [3]. 

Remark 23.5 Spectral determinants for smooth flows. The theorem on page 467 also 
applies to hyperbolic analytic maps in d dimensions and smooth hyperbolic analytic flows 
in (of + 1) dimensions, provided that the flow can be reduced to a piecewise analytic map 
by a suspension on a Poincare section, complemented by an analytic "ceiling" function 
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(3.5) that accounts for a variation in the section return times. For example, if we take 
as the ceiling function g(x) = e sT( - x) , where T(x) is the next Poincare section time for a 
trajectory staring at x, we reproduce the flow spectral determinant ( 19.13). Proofs are 
beyond the scope of this chapter. 

Remark 23.6 Explicit diagonalization. For 1 -dimensional repellers a diagonalization 
of an explicit truncated L mn matrix evaluated in a judiciously chosen basis may yield 
many more eigenvalues than a cycle expansion (see refs. [ 10, 1 1]). The reasons why one 
persists in using periodic orbit theory are partially aesthetic and partially pragmatic. The 
explicit calculation of L„,„ demands an explicit choice of a basis and is thus non-invariant, 
in contrast to cycle expansions which utilize only the invariant information of the flow. In 
addition, we usually do not know how to construct L„,„ for a realistic high-dimensional 
flow, such as the hyperbolic 3-disk game of pinball flow of sect. 1.3, whereas periodic 
orbit theory is true in higher dimensions and straightforward to apply. 

Remark 23.7 Perron-Frobenius theorem. A proof of the Perron-Frobenius the- 
orem may be found in ref. [12]. For positive transfer operators, this theorem has been 
generalized by Ruelle [13]. 

Remark 23.8 Axiom A systems. The proofs in sect. 23.5 follow the thesis 

work of H.H. Rugh [9, 18, 19]. For a mathematical introduction to the subject, consult 
the excellent review by V. Baladi [ 1 ]. It would take us too far afield to give and explain 
the definition of Axiom A systems (see refs. [27, 28]). Axiom A implies, however, the 
existence of a Markov partition of the state space from which the properties 2 and 3 
assumed on page 456 follow. 

Remark 23.9 Left eigenfunctions. We shall never use an explicit form of left eigen- 
functions, corresponding to highly singular kernels like (23.33). Many details have been 
elaborated in a number of papers, such as ref. [20], with a daring physical interpretation. 

Remark 23.10 Ulam's idea. The approximation of Perron-Frobenius operator defined 
by (16.14) has been shown to reproduce the spectrum for expanding maps, once finer 
and finer Markov partitions are used [21]. The subtle point of choosing a state space 
partitioning for a "generic case" is discussed in ref. [22]. 
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Exercises 



23.1. What space does £ act on? Show that (23.2) is a 
complete basis on the space of analytic functions on a 
disk (and thus that we found the complete set of eigen- 
values). 

23.2. What space does £. act on? What can be said about 
the spectrum of (23.1) on L l [Q, 1]? Compare the result 
with figure 23.5. 

23.3. Euler formula. Derive the Euler formula (23.5), 
\u\ < 1: 



ria + m k ) = i + — — + 

u 1 - u 



(1 - K)(l - U 2 ) 



(1- M )(1-M 2 )(1-M 3 ) 

CO Ki-l) 

yv «j 

4-i (l -«)•••(! -«*) 



i=0 



23.4. 2 -dimensional product expansion**. We conjecture 
that the expansion corresponding to exercise 23.3 is in 
the 2-dimensional case given by 



~[(1 + tu k ) 



k\k+l 



- S 

k=0 
= 1 + 



F k (u) 



(1 - w) 2 (l - u 2 ) 2 ■ ■ ■ (1 - u k ) 2 



2u 



(1-m) 2 (1 -m) 2 (1 -m 2 ) 2 
m 2 (1 +4u + u 1 ) 



(1 - M) 2 (l - « 2 ) 2 (1 - W 3 ) 2 



r 3 + 



i=0 



^(m) is a polynomial in u, and the coefficients fall off 

3/2 

asymptotically as C„ ~ m" . Verify; if you have a proof 
to all orders, e-mail it to the authors. (See also solu- 
tion 23.3). 

23.5. Bernoulli shift on L spaces. Check that the family 
(23.21) belongs to L'([0, 1]). What can be said about 
the essential spectral radius on L 2 ([0, 1])? A useful ref- 
erence is [24] . 

23.6. Cauchy integrals. Rework all complex analysis steps 
used in the Bernoulli shift example on analytic functions 
on a disk. 

23.7. Escape rate. Consider the escape rate from a strange 
repeller: find a choice of trial functions £ and if such 
that (23.29) gives the fraction on particles surviving after 
n iterations, if their initial density distribution is po(x). 
Discuss the behavior of such an expression in the long 
time limit. 
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Chapter 24 
Intermittency 



Sometimes They Come Back 
— Stephen King 

(R. Artuso, P. Dahlqvist, G. Tanner and P. Cvitanovic) 




n the theory of chaotic dynamics developed so far we assumed that the evolu- 
tion operators have discrete spectra {zo,Zi,Z2, ■ ■ -I given by the zeros of 



* 

The assumption was based on the tacit premise that the dynamics is everywhere 
exponentially unstable. Real life is nothing like that - state spaces are gener- 
ically infinitely interwoven patterns of stable and unstable behaviors. The 
stable (in the case of Hamiltonian flows, integrable) orbits do not communicate 
with the ergodic components of the phase space, and can be treated by classical 
methods. In general, one is able to treat the dynamics near stable orbits as well 
as chaotic components of the phase space dynamics well within a periodic orbit 
approach. Problems occur at the borderline between chaos and regular dynamics 
where marginally stable orbits and manifolds present difficulties and still unre- 
solved challenges. 

We shall use the simplest example of such behavior - intermittency in 1- 
dimensional maps - to illustrate effects of marginal stability. The main message 
will be that spectra of evolution operators are no longer discrete, dynamical zeta 
functions exhibit branch cuts of the form 

i/iiz) = o-o + (i-zro-o, 

and correlations decay no longer exponentially, but as power laws. 
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Figure 24.1: Typical phase space for an area- 
preserving map with mixed phase space dynamics; 
here the standard map for k = 1.2 . 




24.1 Intermittency everywhere 

In many fluid dynamics experiments one observes transitions from regular behav- 
iors to behaviors where long time intervals of regular behavior ("laminar phases") 
are interrupted by fast irregular bursts. The closer the parameter is to the onset of 
such bursts, the longer are the intervals of regular behavior. The distributions of 
laminar phase intervals are well described by power laws. 

This phenomenon is called intermittency, and it is a very general aspect of 
dynamics, a shadow cast by non-hyperbolic, marginally stable state space regions. 
Complete hyperbolicity assumed in (18.5) is the exception rather than the rule, 
and for almost any dynamical system of interest (dynamics in smooth potentials, 
billiards with smooth walls, the infinite horizon Lorentz gas, etc.) one encounters 
mixed state spaces with islands of stability coexisting with hyperbolic regions, 
see figure 24.1 andexample 7.7. Wherever stable islands are interspersed with 
chaotic regions, trajectories which come close to the stable islands can stay 'glued' 
for arbitrarily long times. These intervals of regular motion are interrupted by 
irregular bursts as the trajectory is re-injected into the chaotic part of the phase 
space. How the trajectories are precisely 'glued' to the marginally stable region is 
often hard to describe. What coarsely looks like a border of an island will under 
magnification dissolve into infinities of island chains of decreasing sizes, broken 
tori and bifurcating orbits, as illustrated in figure 24. 1 . 

Intermittency is due to the existence of fixed points and cycles of marginal 
stability (5.8), or (in studies of the onset of intermittency) to the proximity of a 
nearly marginal complex or unstable orbits. In Hamiltonian systems intermittency 
goes hand in hand with the existence of (marginally stable) KAM tori. In more 
general settings, the existence of marginal or nearly marginal orbits is due to in- 
complete intersections of stable and unstable manifolds in a Smale horseshoe type 
dynamics (see figure 12.1 1). Following the stretching and folding of the invariant 
manifolds in time one will inevitably find state space points at which the stable and 
unstable manifolds are almost or exactly tangential to each other, implying non- 
exponential separation of nearby points in state space or, in other words, marginal 
stability. Under small parameter perturbations such neighborhoods undergo tan- 
gent bifurcations - a stable/unstable pair of periodic orbits is destroyed or created 
by coalescing into a marginal orbit, so the pruning which we shall encounter in 

chapter 12, and the intermittency discussed here are two sides of the same coin. 

r section 12.4 
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f(x) 



Figure 24.2: A complete binary repeller with a 
marginal fixed point. 



Figure 24.3: (a) A tent map trajectory, (b) A 
Farey map trajectory. 




0 50 100 150 200 250 300 0 200 400 600 



How to deal with the full complexity of a typical Hamiltonian system with 
mixed phase space is a very difficult, still open problem. Nevertheless, it is pos- 
sible to learn quite a bit about intermittency by considering rather simple exam- 
ples. Here we shall restrict our considerations to 1 -dimensional maps which in the 
neighborhood of a single marginally stable fixed point at x=0 take the form 

x ^ f{x) = x + 0(x l+s ) , (24.1) 



and are expanding everywhere else. Such a map may allow for escape, like the 
map shown in figure 24.2 or the dynamics may be bounded, like the Farey map 



Figure 24.3 compares a trajectory of the tent map (11.4) side by side with a 
trajectory of the Farey map. In a stark contrast to the uniformly chaotic trajectory 
of the tent map, the Farey map trajectory alternates intermittently between slow 
regular motion close to the marginally stable fixed point, and chaotic bursts. 



section 24.3.4 
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The presence of marginal stability has striking dynamical consequences: cor- 
relation decay may exhibit long range power law asymptotic behavior and diffu- 
sion processes can assume anomalous character. Escape from a repeller of the 
form figure 24.2 may be algebraic rather than exponential. In long time explo- 
rations of the dynamics intermittency manifests itself by enhancement of natural 
measure in the proximity of marginally stable cycles. 

The questions we shall address here are: how does marginal stability affect 
zeta functions or spectral determinants? And, can we deduce power law decays of 
correlations from cycle expansions? 

In example 23.5 we saw that marginal stability violates one of the conditions 
which ensure that the spectral determinant is an entire function. Already the sim- 
ple fact that the cycle weight 1/|1 - A£| in the trace (18.3) or the spectral determi- 
nant (19.3) diverges for marginal orbits with |A p | = 1 tells us that we have to treat 
these orbits with care. 

In the following we will incorporate marginal stability orbits into cycle-expansions 
in a systematic manner. To get to know the difficulties lying ahead, we will start 
in sect. 24.2 with a piecewise linear map, with the asymptotics (24.1). We will 
construct a dynamical zeta function in the usual way without worrying too much 
about its justification and show that it has a branch cut singularity. We will cal- 
culate the rate of escape from our piecewise linear map and find that it is charac- 
terized by decay, rather than exponential decay, a power law. We will show that 
dynamical zeta functions in the presence of marginal stability can still be written 
in terms of periodic orbits, exactly as in chapters 17 and 22, with one exception: 
the marginally stable orbits have to be explicitly excluded. This innocent looking 
step has far reaching consequences; it forces us to change the symbolic dynamics 
from a finite to an infinite alphabet, and entails a reorganization of the order of 
summations in cycle expansions, sect. 24.2.4. 

Branch cuts are typical also for smooth intermittent maps with isolated marginally 
stable fixed points and cycles. In sect. 24.3, we discuss the cycle expansions and 
curvature combinations for zeta functions of smooth maps tailored to intermit- 
tency. The knowledge of the type of singularity one encounters enables us to 
develop the efficient resummation method presented in sect. 24.3.1. 

Finally, in sect. 24.4, we discuss a probabilistic approach to intermittency that 
yields approximate dynamical zeta functions and provides valuable information 
about more complicated systems, such as billiards. 



24.2 Intermittency for pedestrians 

Intermittency does not only present us with a large repertoire of interesting dy- 
namics, it is also at the root of many sorrows such as slow convergence of cycle 
expansions. In order to get to know the kind of problems which arise when study- 
ing dynamical zeta functions in the presence of marginal stability we will consider 
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Figure 24.4: A piecewise linear intermittent map of 
(24.3) type: more specifically, the map piecewise lin- 
ear over intervals (24.9) of the toy example studied be- 
low, a = .5, b = .6, s = 1.0. 




an artfully concocted piecewise linear model first. From there we will move on to 
the more general case of smooth intermittant maps, sect. 24.3. 



24.2.1 A toy map 

The Bernoulli shift map (23.6) is an idealized, but highly instructive, example 
of a hyperbolic map. To study intermittency we will now construct a likewise 
piecewise linear model, an intermittent map stripped down to its bare essentials. 

Consider a map x i-> f(x) on the unit interval At - [0, 1] with two monotone 
branches 



/(*) = 



fo(x) for x e Ato = [0, a] 
/i(jc) for xeMi = [b,l] 



(24.3) 



The two branches are assumed complete, that is fo(Mo) = /i(Ati) = At. The map 
allows escape if a < b and is bounded if a = b (see figure 24.2 and figure 24.4). 
We take the right branch to be expanding and linear: 



fi(x) 



1 



1 -b 



(x-b). 



Next, we will construct the left branch in a way, which will allow us to model 
the intermittent behavior (24.1) near the origin. We chose a monotonically de- 
creasing sequence of points q n in [0, a] with q\ - a and q n — » 0 as n — > oo. 
This sequence defines a partition of the left interval At into an infinite number of 
connected intervals At,, n > 2 with 



At„ = ]q n ,q n -\] and Ato = |J At„. 



(24.4) 



n=2 



The map fo(x) is now specified by the following requirements 



fo(x) is continuous. 
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• fo(x) is linear on the intervals At, for n > 2. 

• Mq n ) - q n -\, that is M n = / 0 ~ " +1 ([a, 1]) . 

This fixes the map for any given sequence {§,}. The last condition ensures the 
existence of a simple Markov partition. The slopes of the various linear segments 
are 

/o« = M Vli {gn) = W forxeM n , n>3 

/o« = /0( t ) lf g2) = forxeM 2 (24.5) 

with \M n \ = q n -\ - qn for n > 2. Note that we do not require as yet that the map 
exhibit intermittent behavior. 

We will see that the family of periodic orbits with code 10' plays a key role 
for intermittent maps of the form (24.1). An orbit 10" enters the intervals All — > 
M n +i — > At,, —>...—» AI2 successively and the family approaches the marginal 
stable fixed point at x - 0 for n —> 00. The stability of a cycle 10 1 for /1 > 1 is 
given by the chain rule (4.42), 

Aur = /oCt« + i)/o(*«) • • • /o(*2)/i(*i) = * , (24.6) 

with x,- € A!,-. 

The properties of the map (24.3) are completely determined by the sequence 
{q n \. By choosing q n = 2~", for example, we recover the uniformly hyperbolic 
Bernoulli shift map (23.6). An intermittent map of the form (24.4) having the 
asymptotic behavior (24.1) can be constructed by choosing an algebraically de- 
caying sequence {q n } behaving asymptotically like 

q n ~ 4t7, (24.7) 

n i/s 

where s is the intermittency exponent in (24.1). Such a partition leads to intervals 
whose length decreases asymptotically like a power-law, that is, 

\M n \ ~ . (24.8) 

As can be seen from (24.6), the Floquet multipliers of periodic orbit families ap- 
proaching the marginal fixed point, such as the 10 1 family increase in turn only 
algebraically with the cycle length. 

It may now seem natural to construct an intermittent toy map in terms of a 
partition |Al„| = l/n l+l ^', that is, a partition which follows (24.8) exactly. Such 
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a choice leads to a dynamical zeta function which can be written in terms of so- 
called Jonquiere functions (or polylogarithms) which arise naturally also in the 
context of the Farey map (24.2), and the anomalous diffusion of sect. 25.3. We 
will, however, not go along this route here; instead, we will engage in a bit of 
reverse engineering and construct a less obvious partition which will simplify the 
algebra considerably later without loosing any of the key features typical for inter- 
mittent systems. We fix the intermittent toy map by specifying the intervals Af, 
in terms of Gamma functions according to 



T(n + m- l/s- 1) 

\M n \=C-—- '- for n>2, (24.9) 

1 (n + m) 



where m = [l/s] denotes the integer part of l/s and C is a normalization constant 
fixed by the condition Y^=2 \M n \ = q\ - a, that is, 



C = a 



T(n- l/s) 



T(n + 1) 

n=m+l 



(24.10) 



Using Stirling's formula for the Gamma function 

r(z) ~ e- z z z ~ 1/2 V5r(l + l/12z + . . .) , 

we verify that the intervals decay asymptotically like tf (l+l ^ s \ as required by the 
condition (24.8). 

Next, let us write down the dynamical zeta function of the toy map in terms 
of its periodic orbits, that is 



W(d= n('-£) 



remark 25.7 



One may be tempted to expand the dynamical zeta function in terms of the binary 
symbolic dynamics of the map; we saw, however, in sect. 20.6 that such cycle ex- 
pansion converges extremely slowly. The shadowing mechanism between orbits 
and pseudo-orbits fails for orbits of the form 10 1 with stabilities given by (24.6), 
due to the marginal stability of the fixed point 0. It is therefore advantageous to 
choose as the fundamental cycles the family of orbits with code 10 1 or, equiva- 
lently, switch from the finite (binary) alphabet to an infinite alphabet given by 



1(f -i 



Due to the piecewise-linear form of the map which maps intervals A\, exactly 
onto M n -\, all periodic orbits entering the left branch at least twice are canceled 
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exactly by pseudo cycles, and the cycle expanded dynamical zeta function depends 
only on the fundamental series 1, 10, 100, . . .: 



The fundamental term (20.7) consists here of an infinite sum over algebraically 
decaying cycle weights. The sum is divergent for \z\ > 1. We will see that this 
behavior is due to a branch cut of 1 starting at z = 1 - We need to find analytic 
continuations of sums over algebraically decreasing terms in (24.11). Note also 
that we omitted the fixed point 0 in the above Euler product; we will discussed 
this point as well as a proper derivation of the zeta function in more detail in 
sect. 24.2.4. 



24.2.2 Branch cuts 

Starting from the dynamical zeta function (24.11), we first have to worry about 
finding an analytical continuation of the sum for |z| > 1. We do, however, get this 
part for free here due to the particular choice of interval lengths made in (24.9). 
The sum over ratios of Gamma functions in (24.1 1) can be evaluated analytically 
by using the following identities valid for l/s = a > 0 (the famed binomial 
theorem in disguise), 

• a non-integer 





(24.11) 




(24.12) 



• a integer 



(i 




(24.13) 



n=l 



+ 




with 
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In order to simplify the notation, we restrict the intermittency parameter to the 
range 1 < l/s < 2 with [l/s] —m— 1. All what follows can easily be generalized 
to arbitrary s > 0 using equations (24.12) and (24.13). The infinite sum in (24.11) 
can now be evaluated with the help of (24.12) or (24.13), that is, 

y Y{n- l/s) n = ( r(-i) [(1 - z) 1,s - 1 + \z] for 1 < l/s < 2; 
^T(n + l) Z \ (1 -z)log(l -z) + z for s=l. 

The normalization constant C in (24.9) can be evaluated explicitly using (24.10) 
and the dynamical zeta function can be given in closed form. We obtain for 1 < 
l/s < 2 

1/S(z) = 1 - (1 - b)z - ,/ , \~ b ((1 - z) i/s - 1 + -z) . (24.14) 
l/s - 1 1 - a \ s J 

and f or s = I, 

l/az) = 1 - (1 - b)z - a-f^- ((1 - z )log(l -z)+z). (24.15) 

I - a 

It now becomes clear why the particular choice of intervals At, made in the last 
section is useful; by summing over the infinite family of periodic orbits 0 1 1 ex- 
plicitly, we have found the desired analytical continuation for the dynamical zeta 
function for \z\ > 1. The function has a branch cut stalling at the branch point z- I 
and running along the positive real axis. That means, the dynamical zeta function 
takes on different values when approaching the positive real axis for Re z > 1 from 
above and below. The dynamical zeta function for general s > 0 takes on the form 

l/az) = l-{l-b)z- "\~ b * ((1- z) l/s - g s (zj) (24.16) 
g s (l) 1 -az m - 1 v ' 

for non-integer s with m — [l/s] and 

l/az) - 1-(1 -b)z ~t7t~~~~y ((1 - z) m log(l -z) - gmiz)) (24.17) 

g m (l) 1 -az' n ~ l 

for l/s = m integer and g s (z) are polynomials of order m = [l/s] which can 
be deduced from (24.12) or (24.13). We thus find algebraic branch cuts for non 
integer intermittency exponents 1/s and logarithmic branch cuts for l/s integer. 
We will see in sect. 24.3 that branch cuts of that form are generic for 1 -dimensional 
intermittent maps. 

Branch cuts are the all important new feature of dynamical zeta functions due 
to intermittency. So, how do we calculate averages or escape rates of the dynamics 
of the map from a dynamical zeta function with branch cuts? We take 'a learning 
by doing' approach and calculate the escape from our toy map for a < b. 
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Figure 24.5: The survival probability r„ calcu- 
lated by contour integration; integrating (24.18) 
inside the domain of convergence \z\ < 1 (shaded 
area) of l/£(z) in periodic orbit representation 
yields (18.26). A deformation of the contour -f r 
(dashed line) to a larger circle y~ R gives contribu- 
tions from the poles and zeros (x) of l/f(z) be- 
tween the two circles. These are the only contribu- 
tions for hyperbolic maps (a), for intermittent sys- 
tems additional contributions arise, given by the 
contour y cu , running along the branch cut (b). 



(a) 




24.2.3 Escape rate 

Our starting point for the calculation of the fraction of survivors after n time steps, 
is the integral representation (19.19) 



where the contour encircles the origin in the clockwise direction. If the contour 
lies inside the unit circle \z\ = 1, we may expand the logarithmic derivative of 
£~ l (z) as a convergent sum over all periodic orbits. Integrals and sums can be 
interchanged, the integrals can be solved term by term, and the formula (18.26) 
is recovered. For hyperbolic maps, cycle expansion methods or other techniques 
may provide an analytic extension of the dynamical zeta function beyond the lead- 
ing zero; we may therefore deform the original contour into a larger circle with ra- 
dius R which encircles both poles and zeros of C 1 ^), see figure 24.5 (a). Residue 
calculus turns this into a sum over the zeros Za and poles zp of the dynamical zeta 
function, that is 



where the last term gives a contribution from a large circle y R . We thus find 
exponential decay of T n dominated by the leading zero or pole of £~ l (z)- 

Things change considerably in the intermittent case. The point z = 1 is a 
branch cut singularity and there exists no Taylor series expansion of ^ 1 around 
z — 1. Second, the path deformation that led us to (24.19) requires more care, as it 
must not cross the branch cut. When expanding the contour to large |z| values, we 
have to deform it along the branch Re (z) > 1 , Im (z) = 0 encircling the branch cut 
in anti-clockwise direction, see figure 24.5 (b). We will denote the detour around 
the cut as y cut . We may write symbolically 




(24.18) 




(24.19) 




zeros poles 
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where the sums include only the zeros and the poles in the area enclosed by the 
contours. The asymptotics is controlled by the zero, pole or cut closest to the 
origin. 

Let us now go back to our intermittent toy map. The asymptotics of the sur- 
vival probability of the map is here governed by the behavior of the integrand 
4- log £~ l in (24.18) at the branch point z = 1. We restrict ourselves again to the 
case 1 < l/s < 2 first and write the dynamical zeta function (24.14) in the form 

l/<T(z) = a 0 + ai(l - z) + b Q (l - z) lls = G(l - z) 

and 

b — a , a \ — b 



a 0 



I — a' 1 — l/s 1 — a 

Setting u — 1 - z, we need to evaluate 



£ (! - «0~" t~ !og G(u)du (24.20) 
2m J 7cut du 



where y cut goes around the cut (i.e., the negative u axis). Expanding the integrand 
d_ 

du 



4- log G{u) - G'(u)/G(u) in powers of u and m 1 ^ at u - 0, one obtains 



^-logG(n) = ^ + -^u 1 ^ 1 + 0{u) . (24.21) 



The integrals along the cut may be evaluated using the general formula 



— £ u a {\- uy n du = F( " a 1} ~ ^-(1 + 0(1 In)) (24.22) 



which can be obtained by deforming the contour back to a loop around the point 
u = 1, now in positive (anti-clockwise) direction. The contour integral then picks 
up the («-l)st term in the Taylor expansion of the function if at u = 1, cf. (24. 12). 
For the continuous time case the corresponding formula is 



Plugging (24.21) into (24.20) and using (24.22) we get the asymptotic result 



b 0 1 1 1 a 1 - b 1 1 

p - - (24 24) 

" ao sT(l - l/s)n l ' s s-lb-aT(l-l/s) n V s ' 
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Figure 24.6: The asymptotic escape from an intermit- 
tent repeller is a power law. Normally it is preceded 
by an exponential, which can be related to zeros close 
to the cut but beyond the branch point z = 1, as in 
figure 24.5 (b). 




200 400 600 800 1000 



We see that, asymptotically, the escape from an intermittent repeller is described 
by power law decay rather than the exponential decay we are familial - with for 
hyperbolic maps; a numerical simulation of the power-law escape from an inter- 
mittent repeller is shown in figure 24.6. 

For general non-integer l/s > 0, we write 

l/£( z ) = A(u) + (u) l/s B(u) = G(u) 

with u = 1 - z and A(u), B{u) are functions analytic in a disc of radius 1 around 
u = 0. The leading terms in the Taylor series expansions of A(u) and B(u) are 



b — a a I — 

a 0 



I -a' g s (l)l-a' 



see (24.16). Expanding J^logG(w) around u = 0, one again obtains leading or- 
der contributions according to (24.21) and the general result follows immediately 
using (24.22), that is, 

a 1 -b 1 1 

r " ~ sg s (l)b-aT(l - l/s)n^ s ' (24.25) 



Applying the same arguments for integer intermittency exponents l/s = m, one 
obtains 

r „~ (24.26) 
sgmW b-an m 

So far, we have considered the survival probability for a repeller, that is we 
assumed a < b. The formulas (24.25) and (24.26) do obviously not apply for the 
case a = b, that is, for the bounded map. The coefficient ao = (b — a)/(l - a) 
in the series representation of G{u) is zero, and the expansion of the logarithmic 
derivative of G(u) (24.21) is no longer valid. We get instead 



d [ f (l + 0{u lls l )) s< I 

du 5 ( ( + 0(« 7 )) s > 1 



inter - 19nov20I2 



ChaosBook.org version 14.4. 1, Apr 21 2013 



CHAPTER 24. INTERMITTENCY 



491 



assuming non-integer l/s for convenience. One obtains for the survival probabil- 
ity. 




1 + 0(n 1_1/s ) s < 1 
lls + 0{n { l s ~ l ) s>\ 



For s > 1 , this is what we expect. There is no escape, so the survival probability 
is equal to 1 , which we get as an asymptotic result here. The result for s > 1 is 
somewhat more worrying. It says that r„ defined as sum over the instabilities of 
the periodic orbits as in (22. 1 8) does not tend to unity for large n. However, the 
case s > 1 is in many senses anomalous. For instance, the invariant density cannot 
be normalized. It is therefore not reasonable to expect that periodic orbit theories 
will work without complications. 

24.2.4 Why does it work (anyway)? 

Due to the piecewise linear nature of the map constructed in the previous section, 
we had the nice property that interval lengths did exactly coincide with the inverse 
of the stability of periodic orbits of the system, that is 



There is thus no problem in replacing the survival probability Y n given by (1.2), 
(22.2), that is the fraction of state space At surviving n iterations of the map, 



by a sum over periodic orbits of the form (18.26). The only orbit to worry about is 
the marginal fixed point 0 itself which we excluded from the zeta function (24. 11). 

For smooth intermittent maps, things are less clear and the fact that we had to 
prune the marginal fixed point is a warning sign that interval estimates by periodic 
orbit stabilities might go horribly wrong. The derivation of the survival probability 
in terms of cycle stabilities in chapter 22 did indeed rely heavily on a hyperbolicity 
assumption which is clearly not fulfilled for intermittent maps. We therefore have 
to carefully reconsider this derivation in order to show that periodic orbit formulas 
are actually valid for intermittent systems in the first place. 

We will for simplicity consider maps, which have a finite number of say s 
branches defined on intervals M s and we assume that the map maps each inter- 
val M s onto At, that is f(M s ) = At. This ensures the existence of a complete 
symbolic dynamics - just to make things easy (see figure 24.2). 



\M n \ = 1/IAiol 



in-l 




(») 
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The generating partition is composed of the domains M- The «th level parti- 
tion C ( ") - {Mi} can be constructed iteratively. Here fs are words i — S1S2 ■ ■ ■ s„ 
of length n, and the intervals M are constructed recursively 

M sj = f;\M j ), (24.27) 

where sj is the concatenation of letter s with word j of length nj < n. 

In what follows we will concentrate on the survival probability T n , postponing 
other quantities of interest, such as averages, to later considerations. In establish- 
ing the equivalence of the survival probability and the periodic orbit formula for 
the escape rate for hyperbolic systems we have assumed that the map is expand- 
ing, with a minimal expansion rate \f'(x)\ > A m j n > 1. This enabled us to bound 
the size of every survivor strip M by (22.6), the stability A,- of the periodic orbit i 
within the Mi, and bound the survival probability by the periodic orbit sum (22.7). 



The bound (22.6) 

c _L<^d <r J_ 

|A,-| \M\ 2 |A«| 

relies on hyperbolicity, and is thus indeed violated for intermittent systems. The 
problem is that now there is no lower bound on the expansion rate, the minimal 
expansion rate is Ar^n = 1. The survivor strip Myi which includes the marginal 
fixed point is thus completely overestimated by 1/|Aq»| = 1 which is constant for 
all n. 

However, bounding survival probability strip by strip is not what is required 
for establishing the bound (22.7). For intermittent systems a somewhat weaker 
bound can be established, saying that the average size of intervals along a periodic 
orbit can be bounded close to the stability of the periodic orbit for all but the 
interval My. The weaker bound applies to averaging over each prime cycle p 
separately 

Cl J_<±y^d <C2 J_, (2 4.28) 
\ A p\ n P^\M\ |A/ 

where the word i represents a code of the periodic orbit p and all its cyclic permu- 
tations. It can be shown that one can find positive constants Q, C2 independent 
of p. Summing over all periodic orbits leads then again to (22.7). 

To study averages of multiplicative weights we follow sect. 17. 1 and introduce 
a state space observable a{x) and the integrated quantity 



n- 1 



exercise 19.7 



A n {x) = Y j a{f\x)). 



k=0 
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o 

Figure 24.7: Transition graph corresponding to the al- 
phabet (0*- 1 1;0, k> 1} 




This leads us to introduce the generating function (17.10) 

where (.) denote some averaging over the distribution of initial points, which we 
choose to be uniform (rather than the a priori unknown invariant density). Again, 
all we have to show is, that constants C\ , C2 exist, such that 



Cl j£<±£ » f (24.29) 
|A.p| n p \M\ J Mq |A P | 

is valid for all p. After performing the above average one gets 

C i r »^)<^7 f eP A( - x ' n) dx<C 2 T n (J3), (24.30) 
\Jy\\ Jm 



with 

' M = ^K\- <2431) 

and a dynamical zeta function can be derived. In the intermittent case one can 
expect that the bound (24.29) holds using an averaging argument similar to the 
one discussed in (24.28). This justifies the use of dynamical zeta functions for 
intermittent systems. 

One lesson we should have learned so far is that the natural alphabet to use 
is not {0,1} but rather the infinite alphabet {&~ l l,0; k > 1}. The symbol 0 
occurs unaccompanied by any l's only in the 0 marginal fixed point which is 
disconnected from the rest of the transition graph, see figure24.7. 

What happens if we remove a single prime cycle from a dynamical zeta func- 
tion? In the hyperbolic case such a removal introduces a pole in the 1 /£ and slows 
down the convergence of cycle expansions. The heuristic interpretation of such a 
pole is that for a subshift of finite type removal of a single prime cycle leads to 
unbalancing of cancellations within the infinity of of shadowing pairs. Neverthe- 
less, removal of a single prime cycle is an exponentially small perturbation of the 
trace sums, and the asymptotics of the associated trace formulas is unaffected. 



chapter 12 



chapter 23 



inter - 19nov20I2 



ChaosBook.org version 14.4. 1, Apr 21 2013 



CHAPTER 24. INTERMITTENCY 



494 



In the intermittent case, the fixed point 0 does not provide any shadowing , 
and a statement such as 



is meaningless. It seems therefore sensible to take out the factor (1 - = 1 - z 
from the product representation of the dynamical zeta function (19.15), that is, to 
consider a pruned dynamical zeta function 1 /£ n ter(z) defined by 



We saw in the last sections, that the zeta function 1 IQnteriz) has all the nice prop- 
erties we know from the hyperbolic case, that is, we can find a cycle expansion 
with - in the toy model case - vanishing curvature contributions and we can calcu- 
late dynamical properties like escape after having understood, how to handle the 
branch cut. But you might still be worried about leaving out the extra factor 1 - z 
all together. It turns out, that this is not only a matter of convenience, omitting 
the marginal 0 cycle is a dire necessity. The cycle weight = 1 overestimates 
the corresponding interval length of A\y> in the partition of the state space M by 
an increasing amount thus leading to wrong results when calculating escape. By 
leaving out the 0 cycle (and thus also the A\y< contribution), we are guaranteed to 
get at least the right asymptotical behavior. 

Note also, that if we are working with the spectral determinant (19.3), given 
in product form as 



for intermittent maps the marginal stable cycle has to be excluded. It introduces 
an (unphysical) essential singularity at z - 1 due the presence of a factor (1 - zT 
stemming from the 0 cycle. 



24.3 Intermittency for cyclists 

Admittedly, the toy map is what is says - a toy model. The piece wise linear- 
ity of the map led to exact cancellations of the curvature contributions leaving 
only the fundamental terms. There are still infinitely many orbits included in the 
fundamental term, but the cycle weights were chosen in such a way that the zeta 
function could be written in closed form. For a smooth intermittent map this all 
will not be the case in general; still, we will argue that we have already seen al- 
most all the fundamentally new features due to intermittency. What remains are 
technicalities - not necessarily easy to handle, but nothing very surprise any more. 



l/<T(z) = (l-z)l/<W(z). 
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In the following we will sketch, how to make cycle expansion techniques work 
for general 1 -dimensional maps with a single isolated marginal fixed point. To 
keep the notation simple, we will consider two-branch maps with a complete bi- 
nary symbolic dynamics as for example the Farey map, figure24.3, or the repeller 
depicted in figure 24.2. We again assume that the behavior near the fixed point is 
given by (24.1). This implies that the stability of a family of periodic orbits ap- 
proaching the marginally stable orbit, as for example the family 10\ will increase 
only algebraically, that is we find again for large n 



1 1 

Aio" n l+l/s 



where s denotes the intermittency exponent. 

When considering zeta functions or trace formulas, we again have to take out 
the marginal orbit 0; periodic orbit contributions of the form fy>\ are now unbal- 
anced and we arrive at a cycle expansion in terms of infinitely many fundamental 
terms as for our toy map. This corresponds to moving from our binary symbolic 
dynamics to an infinite symbolic dynamics by making the identification 

10"-' -> n; 10"-' lO'"" 1 -> nm; 10"" 1 lO™" 1 If/" 1 -> nmk; . . . 



see also table 24. 1 . The topological length of the orbit is thus no longer determined 
by the iterations of our two-branch map, but by the number of times the cycle 
goes from the right to the left branch. Equivalently, one may define a new map, 
for which all the iterations on the left branch are done in one step. Such a map is 
called an induced map and the topological length of orbits in the infinite alphabet 
corresponds to the iterations of this induced map. 

For generic intermittent maps, curvature contributions in the cycle expanded 
zeta function will not vanish exactly. The most natural way to organize the cycle 
expansion is to collect orbits and pseudo orbits of the same topological length 
with respect to the infinite alphabet. Denoting cycle weights in the new alphabet 

as £ nm ... = fion-no*-!..., one obtains 



r 1 = ~](l-t p ) = l-J]ce (24.32) 

p?0 n=\ 

oo oo 



— 1 ^ ] tn ^ ] ^ ] 2 (?mn tn ) 
n=l m=l n=\ 

CO co co ^ j ^ CO CO CO CO 

~ 2 2 X ( 3 ?fa "" ~ 2 Wa + 6 htmtn) ~ 2 2 2 2 ■ 

k=l m=\ n=l " 1=1 k=\ m=l n=l 



exercise 12.1 



The first sum is the fundamental term, which we have already seen in the toy 
model, (24.11). The curvature terms c„ in the expansion are now e-fold infinite 
sums where the prefactors take care of double counting of prime periodic orbits. 
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Table 24.1: Infinite alphabet versus the original binary alphabet for the shortest periodic 
orbit families. Repetitions of prime cycles (11 = 1 2 , 0101 = 01 2 , . . .) and their cyclic 
repeats (110 = 101,1110 = 1101,...) are accounted for by cancelations and combination 
factors in the cycle expansion (24.32). 



oo - alphabet 






binary alphabet 








n = 1 


n = 2 


n = 3 


n = 4 


n = 5 


1 -cycles 


n 


1 


10 


100 


1000 


10000 


2-cycles 


mn 














In 


11 


110 


1100 


11000 


110000 




In 


101 


0101 


10100 


101000 


1010000 




3n 


1001 


10010 


100100 


1001000 


10010000 




4n 


10001 


100010 


1000100 


10001000 


100010000 


3-cycles 


kmn 














Un 


111 


1110 


11100 


111000 


1110000 




Yin 


1101 


11010 


110100 


1101000 


11010000 




\3n 


11001 


110010 


1100100 


11001000 


110010000 




2\n 


1011 


10110 


101100 


1011000 


10110000 




22n 


10101 


101010 


1010100 


10101000 


101010000 




23n 


101001 


1010010 


10100100 


101001000 


1010010000 




3ln 


10011 


100110 


1001100 


10011000 


100110000 




32n 


100101 


1001010 


10010100 


100101000 


1001010000 




33n 


1001001 


10010010 


100100100 


1001001000 


10010010000 



Let us consider the fundamental term first. For generic intermittent maps, we 
can not expect to obtain an analytic expression for the infinite sum of the form 

oo 

Rz) = Y J h nZ n - (24-33) 

n=0 

with algebraically decreasing coefficients 

h n with a > 0 

n a 

To evaluate the sum, we face the same problem as for our toy map: the power 
series diverges for z > 1, that is, exactly in the 'interesting' region where poles, 
zeros or branch cuts of the zeta function are to be expected. By carefully subtract- 
ing the asymptotic behavior with the help of (24. 12) or (24. 13), one can in general 
construct an analytic continuation of f(z) around z = 1 of the form 



f(z) ~ A{z) + {1 - zf~ l B(z) a£N (24.34) 

f(z) ~ A(z) + (I - z)°" 1 HI - z) aeM, 

where A(z) and B(z) are functions analytic in a disc around z = 1 - We thus again 
find that the zeta function (24.32) has a branch cut along the real axis Rez > 1. 
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From here on we can switch to auto-pilot and derive algebraic escape, decay of 
correlation and all the rest. We find in particular that the asymptotic behavior 
derived in (24.25) and (24.26) is a general result, that is, the survival probability 
is given asymptotically by 

r„ ~ C-^- (24.35) 

for all 1 -dimensional maps of the form (24.1). We have to work a bit harder if 
we want more detailed information like the prefactor C, exponential precursors 
given by zeros or poles of the dynamical zeta function or higher order collections. 
This information is buried in the functions A(z) and B(z) or more generally in the 
analytically continued zeta function. To get this analytic continuation, one may 
follow either of the two different strategies which we will sketch next. 

24.3.1 Resummation 

One way to get information about the zeta function near the branch cut is to de- 
rive the leading coefficients in the Taylor series of the functions A(z) and B(z) in 
(24.34) at z = 1- This can be done in principle, if the coefficients \ in sums like 
(24.33) are known (as for our toy model). One then considers a resummation of 
the form 

CO CO CO 

J] h jZ j = £ a/1 - z) j + (1 - zf~ l £ bj(l - z) j , (24.36) 

j=0 j=0 j=0 

and the coefficients aj and bj are obtained in terms of the hfs by expanding (1-z) 7 
and (1 - z) J+ on the right hand side around z = 0 using (24.12) and equating 
the coefficients. 

In practical calculations one often has only a finite number of coefficients 
hj, 0 < j < N, which may have been obtained by finding periodic orbits and 
their stabilities numerically. One can still design a resummation scheme for the 
computation of the coefficients aj and bj in (24.36). We replace the infinite sums 
in (24.36) by finite sums of increasing degrees i\, and rib, and require that 

n a n b N 

J] ai(l ~ zj + (1 - zf~ l ^ bi(l - zf - J] hd + 0(z N+i ) . (24.37) 

i=0 i=0 (=0 

One proceeds again by expanding the right hand side around z = 0, skipping all 
powers z N+l and higher, and then equating coefficients. It is natural to require that 
\rib + a — 1 - n a \ < 1, so that the maximal powers of the two sums in (24.37) are 
adjacent. If one chooses n a + + 2 = N + \, then, for each cutoff length N, the 
integers n a and are uniquely determined from a linear system of equations. The 
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price we pay is that the so obtained coefficients depend on the cutoff N. One can 
now study convergence of the coefficients aj, and bj, with respect to increasing 
values of N, or various quantities derived from aj and bj. Note that the leading 
coefficients oq and bo determine the prefactor C in (24.35), cf. (24.24). The re- 
summed expression can also be used to compute zeros, inside or outside the radius 
of convergence of the cycle expansion £ hjZ*. 

The scheme outlined in this section tacitly assumes that a representation of 
form (24.34) holds in a disc of radius 1 around z - 1. Convergence is improved 
further if additional information about the asymptotics of sums like (24.33) is used 
to improve the ansatz (24.36). 



24.3.2 Analytical continuation by integral transformations 

We will now introduce a method which provides an analytic continuation of sums 
of the form (24.33) without explicitly relying on an ansatz (24.36). The main 
idea is to rewrite the sum (24.33) as a sum over integrals with the help of the 
Poisson summation formula and find an analytic continuation of each integral by 
contour deformation. In order to do so, we need to know the n dependence of 
the coefficients h n = h(n) explicitly for all n. If the coefficients are not known 
analytically, one may proceed by approximating the large n behavior in the form 

h(n) =n" a (Ci +C 2 n~ l + ...), n * 0 , 

and determine the constants C, numerically from periodic orbit data. By using the 
Poisson resummation identity 

oo oo 

V 6(x - n) = V exp(2mmx) , (24.38) 

n=—oo m=—oo 

we may write the sum as (24.33) 

1 °° fOO 

f(z) = -K0)+ V dxe 2 « imx h{x)z x . (24.39) 

The continuous variable x corresponds to the discrete summation index n and it 
is convenient to write z = r exp(/cr) from now on. The integrals are still not con- 
vergent for r > 0, but an analytical continuation can be found by considering the 
contour integral, where the contour goes out along the real axis, makes a quarter 
circle to either the positive or negative imaginary axis and goes back to zero. By 
letting the radius of the circle go to infinity, we essentially rotate the line of inte- 
gration from the real onto the imaginary axis. For the m = 0 term in (24.39), we 
transform x — > ix and the integral takes on the form 

/~«oo /~»O0 

I dxh(x)r x e ixoT - i \ dxh(ix)r Lx e- X(T . 
Jo Jo 
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The integrand is now exponentially decreasing for all r > 0 and cr ± 0 or 2/r. The 
last condition reminds us again of the existence of a branch cut at Re z > 1 - By 
the same technique, we find the analytic continuation for all the other integrals in 
(24.39). The real axis is then rotated according to x — > sign(m)ix where sign(wi) 
refers to the sign of m. 



I 

Jo 



dxe 



±2m\m\x 



h(x) rV' 



Jo 



dxh(±ix) r~ e 



±ix —x(2n\m\±cr) 



Changing summation and integration, we can carry out the sum over \m\ explicitly 
and one finally obtains the compact expression 



1 r°° 

f( z ) = -h(0) + i dxhiix)^^ 

2 Jo 



(24.40) 



+ i 



dx - - [h(i X y x e- X(T - h{-ix)r- ix e xcr ] . 



The transformation from the original sum to the two integrals in (24.40) is exact 
for r < 1, and provides an analytic continuation for r > 0. The expression (24.40) 
is especially useful for an efficient numerical calculations of a dynamical zeta 
function for \z\ > 1, which is essential when searching for its zeros and poles. 



24.3.3 Curvature contributions 



So far, we have discussed only the fundamental term Z^Li t n in (24.32), and 
showed how to deal with such power series with algebraically decreasing coef- 
ficients. The fundamental term determines the main structure of the zeta function 
in terms of the leading order branch cut. Corrections to both the zeros and poles 
of the dynamical zeta function as well as the leading and subleading order terms 
in expansions like (24.34) are contained in the curvature terms in (24.32). The 
first curvature correction is the 2-cycle sum 



m=l n=l 



with algebraically decaying coefficients which again diverge for \z\ > 1. The 
analytically continued curvature terms have as usual branch cuts along the positive 
real z axis. Our ability to calculate the higher order curvature terms depends on 
how much we know about the cycle weights t„ m . The form of the cycle stability 
(24.6) suggests that t mn decrease asymptotically as 



1 



(nm) l+l ' s 



(24.41) 
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for 2-cycles, and in general for rc-cycles as 



1 



t, 



m\rri2...m n 



{jn\m% . . . m„) 1+1 ^ 



If we happen to know the cycle weights 4„ im2 ... mn analytically, we may proceed as 
in sect. 24.3.2, transform the multiple sums into multiple integrals and rotate the 
integration contours. 

We have reached the edge of what has been accomplished so far in computing 
and what is worth the dynamical zeta functions from periodic orbit data. In the 
next section, we describe a probabilistic method applicable to intermittent maps 
which does not rely on periodic orbits. 

24.3.4 Stability ordering for intermittent flows 



Longer but less unstable cycles can give larger contributions to a cycle 
expansion than short but highly unstable cycles. In such situations, truncation by 
length may require an exponentially large number of very unstable cycles before 
a significant longer cycle is first included in the expansion. This situation is best 
illustrated by intermittent maps. The simplest of these is the Farey map 



For the Farey map, the symbolic dynamics is of complete binary type, so the 
lack of shadowing is not due to the lack of a finite grammar, but rather to the 
intermittency caused by the existence of the marginal fixed point j§ = 0, for which 
the stability multiplier is Ao = 1. This fixed point does not participate directly in 
the dynamics and is omitted from cycle expansions. Its presence is, however, 
very much felt instead in the stabilities of neighboring cycles with n consecutive 
iterates of the symbol 0, whose stability falls of only as A ~ w, in contrast to 
the most unstable cycles with n consecutive l's, which are exponentially unstable, 



The symbolic dynamics is of complete binary type. A quick count in the style 
of sect. 15.7.2 leads to a total of 74,248,450 prime cycles of length 30 or less, 
not including the marginal point xq = 0. Evaluating a cycle expansion to this 
order is an impressive computational feat. However, stability of the least unstable 
cycle omitted is roughly A 10 3o ~ 30 2 = 900, so it yields a 0.1% correction. The 
situation may be much worse than this estimate suggests, because the next 
cycle contributes a similar amount, and could easily reinforce the error. Adding 
up all such omitted terms, we arrive at an estimated error of about 3%, for a cycle- 
length truncated cycle expansion based on more than Iff pseudo-cycle terms! On 
the other hand, if one truncates by stability at Anax = 3000, only 409 prime cycles 
suffice to attain the same accuracy of about a 3% error, figure 24.8. 




0 < x < 1/2 
1/2 < x< 1 



(24.42) 



|AoH~[(V5 + l)/2] 
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Figure 24.8: Comparison of cycle expansion trun- 
cation schemes for the Farey map (24.42); the 
deviation of the truncated cycles expansion for 
|l/£v(0)| from the exact flow conservation value 
l/£(0) = 0 is a measure of the accuracy of the 
truncation. The jagged line is the logarithm of 
the stability ordering truncation error; the smooth 
line is smoothed according to sect. 20.6.2; the di- 
amonds indicate the error due to the topological 
length truncation, with the maximal cycle length N 
shown. They are placed along the stability cutoff 
axis at points determined by the condition that the 
total number of cycles is the same for both trunca- 
tion schemes. 
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As the Farey map maps the unit interval onto itself, the leading eigenvalue 
of the Perron-Frobenius operator should equal $ = 0, so 1/^(0) = 0. Devia- 
tions from this exact result give an indication of the convergence of a given cycle 
expansion. Errors corresponding to different truncation schemes are indicated in 
figure 24.8. We see that topological length truncation schemes are hopelessly bad 
in this case; stability length truncations are somewhat better, but still rather bad. 
In simple cases like this one, where intermittency is caused by a single marginal 
fixed point, convergence can be improved by going to infinite alphabets. 



^■r So far we have focused on 1-d models as the simplest setting in which to 
investigate dynamical implications of marginal fixed points. We now take an al- 
together different track and describe how probabilistic methods may be employed 
in order to write down approximate dynamical zeta functions for intermittent sys- 
tems. 

We will discuss the method in a very general setting, for a flow in arbitrary 
dimension. The key idea is to introduce a surface of section P such that all tra- 
jectories traversing this section will have spent some time both near the marginal 
stable fixed point and in the chaotic phase. An important quantity in what follows 
is (3.5), the first return time t(x), or the time of flight of a trajectory starting in 
x to the next return to the surface of section P. The period of a periodic orbit p 
intersecting the P section n p times is 



where f(x) is the Poincare map, and x p € P is a periodic point. The dynamical 



24.4 BER zeta functions 




k=0 
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zeta function (19.15) 



/ n P( J3A p - S T p X "Pj 

l/az, s,P) = J J 1 — , A p = 2, a(j\x p )), (24.43) 

n \ \ p\ I k=Q 



chapter 17 

associated with the observable a(x) captures the dynamics of both the flow and the 
Poincare map. The dynamical zeta function for the flow is obtained as \l((s,f3) = 
1 /£(1, s,0), and the dynamical zeta function for the discrete time Poincare map is 

Our basic assumption will be probabilistic. We assume that the chaotic in- 
terludes render the consecutive return (or recurrence) times T(%), T(xi+\) and ob- 
servables a{x{}, a(xi+\) effectively uncorrected. Consider the quantity $ A (xo,n)-sT(x 0 ,ri) 
averaged over the surface of section P. With the above probabilistic assumption 
the large n behavior is 



{ ^A( X0 ,n)-sT( Xa ,n) )p „ U ^sr^A" 



where p(x) is the invariant density of the Poincare map. This type of behavior is 
equivalent to there being only one zero ®(s,P) = f (fi a{x) - ST{x) p{x)dxof 1/£( Z , s,P) 
in the z-/3 plane. In the language of Ruelle-Pollicott resonances this means that 
there is an infinite gap to the first resonance. This in turn implies that 1 (z, s,f5) 
may be written as 



remark 17.1 



_ j J3a(x)-ST(x] 



\/az,s,P) = z- e Baix) - ST{x) p(x)dx , (24.44) 



where we have neglected a possible analytic and non-zero prefactor. The dynam- 
ical zeta function of the flow is now 



l/S(s,fi) = l/(;(l,s,p) = l- f (P a{x) p{x)e- ST(x) dx . (24.45) 

Normally, the best one can hope for is a finite gap to the leading resonance of 
the Poincare map. with the above dynamical zeta function only approximatively 
valid. As it is derived from an approximation due to Baladi, Eckmann, and Ruelle, 
we shall refer to it as the BER zeta function 1 /^ber( s ^) m what follows. 

A central role is played by the probability distribution of return times 

(Kt) - f S(t - T(x))p(x)dx (24.46) 
Jp 



exercise 25.6 
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The BER zeta function at /3 = 0 is then given in terms of the Laplace transform of 
this distribution 

1/&ErC*) = 1- Hr)e' ST dT. 
Jo 

exercise 24.5 

Example 24.1 Return times for the Bernoulli map. For the Bernoulli shift map 
(23.6) 

x h-> f(x) = 2x mod 1 , 

one easily derives the distribution of return times 
1 

tl/ n = — n > 1 . 

The BER zeta function becomes (by the discrete Laplace transform ( 18.9)) 

co oo _ 

n=l n=l 

= ^—^:=r\z)/(l-z/A 0 ). (24.47) 
1 - z/2 

Thanks to the uniformity of the piecewise linear map measure (16.13) the "approximate^ 
zeta function is in this case the exact dynamical zeta function, with the periodic point 0 
pruned. 



Example 24.2 Return times for the model of sect. 24.2.1. For the toy model of 
sect. 24.2. 1 one gets = \M\\, and iff„ = \M„\(l - b)/(l - a), for n > 2, leading to a 
BER zeta function 

co 

m B ER(z) = l-z\Mi\-Y J \M n \z", 

n=2 

which again coincides with the exact result, (24. 11). 

It may seem surprising that the BER approximation produces exact results in 
the two examples above. The reason for this peculiarity is that both these systems 
are piecewise linear and have complete Markov partitions. As long as the map 
is piecewise linear and complete, and the probabilistic approximation is exactly 
fulfilled, the cycle expansion curvature terms vanish. The BER zeta function and 
the fundamental part of a cycle expansion discussed in sect. 20. 1 . 1 are indeed 
intricately related, but not identical in general. In particular, note that the BER zeta 
function obeys the flow conservation sum rule (20.17) by construction, whereas 
the fundamental part of a cycle expansion as a rule does not. 
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Resume 

The presence of marginally stable fixed points and cycles changes the analytic 
structure of dynamical zeta functions and the rules for constructing cycle ex- 
pansions. The marginal orbits have to be omitted, and the cycle expansions 
now need to include families of infinitely many longer and longer unstable or- 
bits which accumulate toward the marginally stable cycles. Correlations for such 
non-hyperbolic systems may decay algebraically with the decay rates controlled 
by the branch cuts of dynamical zeta functions. Compared to pure hyperbolic 
systems, the physical consequences are drastic: exponential decays are replaced 
by slow power-law decays, and transport properties, such as the diffusion may 
become anomalous. 



Commentary 

Remark 24.1 What about the evolution operator formalism? The main virtue of 
evolution operators was their semigroup property ( 17.27). This was natural for hyper- 
bolic systems where instabilities grow exponentially, and evolution operators capture this 
behavior due to their multiplicative nature. Whether the evolution operator formalism is a 
good way to capture the slow, power law instabilities of intermittent dynamics is less clear. 
The approach taken here leads us to a formulation in terms of dynamical zeta functions 
rather than spectral determinants, circumventing evolution operators altogether. It is not 
known if the spectral determinants formulation would yield any benefits when applied to 
intermittent chaos. Some results on spectral determinants and intermittency can be found 
in [2]. A useful mathematical technique to deal with isolated marginally stable fixed point 
is that of inducing, that is, replacing the intermittent map by a completely hyperbolic map 
with infinite alphabet and redefining the discrete time; we have used this method implic- 
itly by changing from a finite to an infinite alphabet. We refer to refs. [ 3, 20] for detailed 
discussions of this technique, as well as applications to 1-dimensional maps. 

Remark 24.2 Intermittency. Intermittency was discovered by Manneville andPomeau [ 1] 
in their study of the Lorenz system. They demonstrated that in neighborhood of parameter 
value r c = 166.07 the mean duration of the periodic motion scales as (r— r c )^ 2 . In ref. [5] 
they explained this phenomenon in terms of a 1-dimensional map (such as (24.1)) near 
tangent bifurcation, and classified possible types ofintermittency. 

Piecewise linear models like the one considered here have been studied by Gaspard 
and Wang [6]. The escape problem has here been treated following ref. [ 7], resummations 
following ref. [8]. The proof of the bound (24.28) is given in P. Dahlqvist's notes, see 
ChaosBook . org/extras/PDahlqvistEscape . pdf. 

Farey map (24.42) has been studied widely in the context of intermittent dynamics, 
for example in refs. [16, 17, 3, 18, 19, 16, 2]. The Fredholm determinant and the dyn- 
amical zeta functions for the Farey map (24.42) and the related Gauss shift map (16.47) 
have been studied by Mayer [16]. He relates the continued fraction transformation to the 
Riemann zeta function, and constructs a Hilbert space on which the evolution operator is 
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self-adjoint, and its eigenvalues are exponentially spaced, just as for the dynamical zeta 
functions [24] for "Axiom A" hyperbolic systems. 



Remark 24.3 Tauberian theorems. In this chapter we used Tauberian theorems for 
power series and Laplace transforms: Feller's monograph [9] is a highly recommended 
introduction to these methods. 



Remark 24.4 Probabilistic methods, BER zeta functions. Probabilistic description 
of intermittent chaos was introduced by Geisal and Thomae [ ]. The BER approximation 
studied here is inspired by Baladi, Eckmann and Ruelle [ 14], with further developments 
inrefs. [13, 15]. 
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Exercises 



24.1. Integral representation of Jonquiere functions. 24.6. 
Check the integral representation 



24.4. 



J(Z, a) 



7 /^*° 



d£ 



'f -7 



for a > 0 .(24.48) 



Note how the denominator is connected to Bose- 
Einstein distribution. Compute J(x + ie) — J(x - is) for 
a real x > 1 . 

24.2. Power law correction to a power law. Expand 
(24.21) further and derive the leading power law correc- 
tion to (24.24). 

24.3. Power-law fall off. In cycle expansions the stabilities 24.7. 
of orbits do not always behave in a geometric fashion. 
Consider the map / 




This map behaves as / — » x as x — > 0. Define a sym- 
bolic dynamics for this map by assigning 0 to the points 
that land on the interval [0, 1/2) and 1 to the points that 
land on (1/2, 1]. Show that the stability of orbits that 
spend a long time on the 0 side goes as n 2 . In particular, 
show that 



A 



00-0 1 



Power law fall-off of Floquet multipliers in the sta- 
dium billiard**. From the cycle expansions point 
of view, the most important consequence of the shear in 
J" for long sequences of rotation bounces in (8.13) 
is that the A„ grows only as a power law in number of 
bounces: 



A„ oc n 



(24.49) 



Check. 



24.5. Probabilistic zeta function for maps. Derive the 
probabilistic zeta function for a map with recurrence dis- 
tribution iff n . 



Accelerated diffusion. Consider a map h, such that 
h = /, but now running branches are turner into stand- 
ing branches and vice versa, so that 1,2, 3,4 are stand- 
ing while 0 leads to both positive and negative jumps. 
Build the corresponding dynamical zeta function and 
show that 



a-\t) 



I for a > 2 

tint for a = 2 

for ore (1,2) 
for a — 1 
for a € (0, 1) 



t 2 /\nt 



Anomalous diffusion (hyperbolic maps). Anoma- 
lous diffusive properties are associated to deviations 
from linearity of the variance of the phase variable we 
are looking at: this means the diffusion constant ( 17.13) 
either vanishes or diverges. We briefly illustrate in this 
exercise how the local local properties of a map are cru- 
cial to account for anomalous behavior even for hyper- 
bolic systems. 

Consider a class of piecewise linear maps, relevant to 
the problem of the onset of diffusion, defined by 



/*(*) = 



Ax 

a — A e>r |x — x + | 
1 - A'(x - x+) 
1 - a + A £ , r |x - 
1 + A(x - 1) 



for 


x e 


0,4] 


for 


x e 


X i ' x 2 


for 


x e 


x 2 ' X \ 


for 


x e 


x \ ' x 2 


for 


X € 


x-,l] 



where A = (1/3 - e lly Y\ A' = (1/3 - 2e 1/y ), A f , r 



l+e,x + = 1/3,4 



i/y 



and the usual symmetry properties (25.1 1) are satisfied. 

Thus this class of maps is characterized by two escap- 
ing windows (through which the diffusion process may 
take place) of size 2e'^ r : the exponent y mimicks the or- 
der of the maximum for a continuous map, while piece- 
wise linearity, besides making curvatures vanish and 
leading to finite cycle expansions, prevents the appear- 
ance of stable cycles. The symbolic dynamics is eas- 
ily described once we consider a sequence of param- 
eter values {e,„}, where e m = A~ (m+1) : we then par- 
tition the unit interval though the sequence of points 
0, 4> 4' 4' ^ ' x 2 ' 1 an< ^ l arj el the corresponding 
sub-intervals 1, s a , Sb, 2, <4, d a , 3: symbolic dynamics is 
described by an unrestricted grammar over the following 
set of symbols 



(l,2,3,s # - V,d#-3 k 



# — a,b i,k — m,m + 1 
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This leads to the following dynamical zeta function: a matter of fact, from (24.51) we get the asymptotic be- 

2 m+i / [haviorD ~ e'' r , which shows how the onset of diffusion 

<fn 1 (z,a) = 1 4 coshfaO&J/ 7-1 1 1 ) is governed by the order of the map at its maximum. 

i0 A A' A" \ A/ 

from which, by (25.8) we get _ „ miM _ _ 

Remark 24.5 Onset of diffusion for continuous maps. 

2e„/ r_1 A _m (l - l/A) -1 The zoology of behavior for continuous maps at the on- 

i _ i i , i/r-i / m+i i \^ ' lei of diffusion is described in refs. [15, 16, 25]: our 

A A' I A"'M-1/A~t A" ,+I II-I/AI 2 / • a a • 

v v ' ' ' ' treatment tor piecewise linear maps was introduced in 

The main interest in this expression is that it allows ex- ref. [26]. 
ploring how D vanishes in the f h 0 (m h ra) limit: as 



D = 
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Chapter 25 



Deterministic diffusion 



This is a bizzare and discordant situation. 
— M.V. Berry 



(R. Artuso and P. Cvitanovic) 




he advances in the theory of dynamical systems have brought a new life to 



Boltzmann's mechanical formulation of statistical mechanics. Sinai, Ruelle 



and Bowen (SRB) have generalized Boltzmann's notion of ergodicity for a 
constant energy surface for a Hamiltonian system in equilibrium to dissipative sys- 
tems in nonequilibrium stationary states. In this more general setting the attractor 
plays the role of a constant energy surface, and the SRB measure of sect. 16.1 is 
a generalization of the Liouville measure. Such measures are purely microscopic 
and indifferent to whether the system is at equilibrium, close to equilibrium or far 
from it. "Far for equilibrium" in this context refers to systems with large devia- 
tions from Maxwell's equilibrium velocity distribution. Furthermore, the theory 
of dynamical systems has yielded new sets of microscopic dynamics formulas for 
macroscopic observables such as diffusion constants and the pressure, to which 
we turn now. 

We shall apply cycle expansions to the analysis of transport properties of 
chaotic systems. 

The resulting formulas are exact; no probabilistic assumptions are made, and 
the all correlations are taken into account by the inclusion of cycles of all periods. 
The infinite extent systems for which the periodic orbit theory yields formulas for 
diffusion and other transport coefficients are spatially periodic, the global state 
space being tiled with copies of a elementary cell. The motivation are physical 
problems such as beam defocusing in particle accelerators or chaotic behavior of 
passive tracers in 2-dimensional rotating flows, problems which can be described 
as deterministic diffusion in periodic arrays. 

In sect. 25.1 we derive the formulas for diffusion coefficients in a simple phys- 
ical setting, the 2-dimensional periodic Lorentz gas. This system, however, is not 
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Figure 25.1: Deterministic diffusion in a finite horizon 
periodic Lorentz gas. (T. Schreiber) 




the best one to illustrate the theory, due to its complicated symbolic dynamics. 
Therefore we apply the theory first to diffusion induced by a 1 -dimensional maps 
in sect. 25.2. 



The 2-dimensional Lorentz gas is an infinite scatterer array in which diffusion of a 
light molecule in a gas of heavy scatterers is modeled by the motion of a point par- 
ticle in a plane bouncing off an array of reflecting disks. The Lorentz gas is called 
"gas" as one can equivalently think of it as consisting of any number of pointlike 
fast "light molecules" interacting only with the stationary "heavy molecules" and 
not among themselves. As the scatterer array is built up from only defocusing 
concave surfaces, it is a pure hyperbolic system, and one of the simplest non- 
trivial dynamical systems that exhibits deterministic diffusion, figure 25.1. We 
shall now show that the periodic Lorentz gas is amenable to a purely determin- 
istic treatment. In this class of open dynamical systems quantities characterizing 
global dynamics, such as the Lyapunov exponent, pressure and diffusion constant, 
can be computed from the dynamics restricted to the elementary cell. The method 
applies to any hyperbolic dynamical system that is aperiodic tilingVVl - Uner M« 
of the dynamical state space M by translates Afy, of an elementary cell At, with 
T the abelian group of lattice translations. If the scattering array has further dis- 
crete symmetries, such as reflection symmetry, each elementary cell may be built 
from a. fundamental domain M by the action of a discrete (not necessarily abelian) 
group G. The symbol M refers here to the full state space, i.e.,, both the spatial 
coordinates and the momenta. The spatial component of At is the complement of 
the disks in the whole space. 



25.1 Diffusion in periodic arrays 



Chaos happens - let's make a better use of it. 
— Edward Tenner 
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Figure 25.2: Tiling of M, a periodic lattice of reflect- 
ing disks, by the fundamental domain M. Indicated is 
an example of a global trajectory x(f) together with the 
corresponding elementary cell trajectory x(t) and the 
fundamental domain trajectory x(t). (Courtesy of J.-P. 
Eckmann) 



We shall now relate the dynamics in At to diffusive properties of the Lorentz 
gas in At. 

These concepts are best illustrated by a specific example, a Lorentz gas based 
on the hexagonal lattice Sinai billiard of figure 25.2. We distinguish two types 
of diffusive behavior; the infinite horizon case, which allows for infinite length 
flights, and the finite horizon case, where any free particle trajectory must hit a 
disk in finite time. In this chapter we shall restrict our consideration to the finite 
horizon case, with disks sufficiently large so that no infinite length free flight is 
possible. In this case the diffusion is normal, with x(tf growing like t. We shall 
discuss the anomalous diffusion case in sect. 25.3. 

As we will work with three kinds of state spaces, good manners require that 
we repeat what tildes, nothings and hats atop symbols signify: 

fundamental domain, triangle in figure 25.2 

elementary cell, hexagon in figure 25.2 

full state space, lattice in figure 25.2 (25.1) 

It is convenient to define an evolution operator for each of the 3 cases of fig- 
ure 25.2. x(t) = f'(x) denotes the point in the global space At reached by the 
flow in time t. x{t) - f'(xo) denotes the corresponding flow in the elementary 
cell; the two are related by 

h t (xo) = f(x 0 ) - f(x 0 ) € T , (25.2) 

the translation of the endpoint of the global path into the elementary cell At. The 
quantity x(t) = f\x) denotes the flow in the fundamental domain At; f\x) is 
related to f'(x) by a discrete symmetry g e G which maps x(t) e At to x(t) € At . 

Fix a vector ft e W 1 , where d is the dimension of the state space. We will 
compute the diffusive properties of the Lorentz gas from the leading eigenvalue of 
the evolution operator (17.11) 

s(fi) = \im-\og{^ m - x) ) M , (25.3) 
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where the average is over all initial points in the elementary cell, x € At. 

If all odd derivatives vanish by symmetry, there is no drift and the second 
derivatives 



d_d_ 



2dD U = — — S W 



= lim -<m) - xum - x)j) M 



yield a diffusion matrix. This symmetric matrix can, in general, be anisotropic 
(i.e., have d distinct eigenvalues and eigenvectors). The spatial diffusion constant 
is then given by the Einstein relation (17.13) 



1 v- d 2 



1 , 

= lim xjliiqit) -q) )m , 



where the i sum is restricted to the spatial components q of the state space vectors 
x = (q, p), i.e., if the dynamics is Hamiltonian, the sum is over the d the degrees 
of freedom. 

We now turn to the connection between (25.3) and periodic orbits in the ele- 
mentary cell. As the full At — > At reduction is complicated by the non-abelian 
nature of G, we discuss only the abelian M — > M reduction. 



25.1.1 Reduction from M to M 

The key idea follows from inspection of the relation 

yeM 

\M\ - J M dx is the volume of the elementary cell At. Due to translational symme- 
try, it suffices to start with a density of trajectories defined over a single elementary 
cell At. As in sect. 17.2, we have used the identity 1 = J M dy 8{y - x{t)) to moti- 
vate the introduction of the evolution operator £(y, x). There is a unique lattice 
translation h such that y — y — h, with the endpoint y e At translated back to the 
elementary cell, and f(x) given by (25.2). The difference is a translation by a 
constant lattice vector h, and the Jacobian for changing integration from dy to dy 
equals unity. Therefore, and this is the main point, translation invariance can be 
used to reduce this average to the elementary cell: 

(e^CSO-*)) = 1 f dxdy^ x) - x) 8(y-f{x)). (25.4) 

|At| J x , y eM 

As this is a translation, the Jacobian is \dy/dy\ = 1. In this way the global f' (x) 
flow, infinite volume state space averages can be computed by following the flow 
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f'(xo) restricted to the compact, finite volume elementary cell At. The equation 
(25.4) suggests that we study the evolution operator 



-C'(y,x) 



Km-*) 



5{y-f\x)), 



(25.5) 



where x(t) = f (x) e At is the displacement in the full space, but x, fix), y e At. 
It is straightforward to check that this operator satisfies the semigroup property 
(17.27), 



f. 

JM 



dz£' 2 (y,z)£' l (z,x) = £ 2+tl (y,x) 



For (3 = 0, the operator (25.5) is the Perron-Frobenius operator (16.10), with the 
leading eigenvalue e 10 = 1 because there is no escape from this system (see the 
flow conservation sum rule (20.17)). 

The rest is old hat. The spectrum of X is evaluated by taking the trace 



section 18.2 



tr£' = f dx^ h ' {x) 5{x-x{t)) . 

JM 



Here h t (x) is the discrete lattice translation defined in (25.2). Two kinds of orbits 
periodic in the elementary cell contribute. A periodic orbit is called standing 
if it is also periodic orbit of the infinite state space dynamics, j^'ix) = x, and it 
is called running if it corresponds to a lattice translation in the dynamics on the 
infinite state space, f T, '(x) — x + h p . We recognize the shortest repeating segment 
of a running orbit as our old 'relative periodic orbit' friend from chapter9. In the 
theory of area-preserving maps such as the standard map of example 7.7 these 
orbits are called accelerator modes, as the diffusion takes place along the momen- 
tum rather than the position coordinate. The traveled distance hp = fiT p (xo) is 
independent of the starting point xo, as can be easily seen by continuing the path 
periodically in At. 

The final result is the spectral determinant (19.6) 



det(jCS)-^) - [~[exp - 



I e iP-n p -sT p )r 



ti r det(l-Afj)! 



(25.6) 



or the corresponding dynamical zeta function (19.15) 



nm S ) = Y] i- 



? (p-n p -sT p ) 

|An| 



(25.7) 
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The dynamical zeta function cycle averaging formula (20.24) for the diffusion 
constant (17.13), zero mean drift (x,) = 0 , is given by 



- = 2d7rT,L ia_. ...a_.i ■ (25 - 8) 



2d (T) ( 2d(T) ( ^ \A pr --A 



where the sum is over all distinct non-repeating combination of prime cycles. The 
derivation is standard, still the formula is strange. Diffusion is unbounded motion 
across an infinite lattice; nevertheless, the reduction to the elementary cell enables 
us to compute relevant quantities in the usual way, in terms of periodic orbits. 

A sleepy reader might protest that x(T p ) - jc(0) is manifestly equal to zero for 
a periodic orbit. That is correct; h p in the above formula refers to a displacement 
x(T p ) on the infinite periodic lattice, while p refers to closed orbit of the dynamics 
fix) reduced to the elementary cell, with x p a periodic point in the closed prime 
cycle p. 

Even so, this is not an obvious formula. Globally periodic orbits have £ = 0, 
and contribute only to the time normalization (TV. The mean square displace- 
ment (x 2 ^i gets contributions only from the periodic runaway trajectories; they 
are closed in the elementary cell, but on the periodic lattice each one grows like 
%(t) 2 = (np/T p ) 2 t 2 = v 2 p t 2 . So the orbits that contribute to the trace formulas 
and spectral determinants exhibit either ballistic transport or no transport at all: 
diffusion arises as a balance between the two kinds of motion, weighted by the 
l/|Ap| measure. If the system is not hyperbolic such weights may be abnormally 
large, with l/|A p | « l/T p a rather than l/|A p | « e~ T ' ,A , where A is the Lyapunov 
exponent, and they may lead to anomalous diffusion - accelerated or slowed down 
depending on whether the probabilities of the running or the standing orbits are 
enhanced. 

We illustrate the main idea, tracking of a globally diffusing orbit by the as- 
sociated confined orbit restricted to the elementary cell, with a class of simple 
1 -dimensional dynamical systems where all transport coefficients can be evalu- 
ated analytically. 



section 25.3 



25.2 Diffusion induced by chains of 1 -dimensional maps 

In a typical deterministic diffusive process, trajectories originating from a given 
scatterer reach a finite set of neighboring scatterers in one bounce, and then the 
process is repeated. As was shown in chapter 1 1 , the essential part of this pro- 
cess is the stretching along the unstable directions of the flow, and in the crud- 
est approximation the dynamics can be modeled by 1 -dimensional expanding 
maps. This observation motivates introduction of a class of particularly simple 
1 -dimensional systems. 
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Figure 25.3: (a) fix), the full space sawtooth map 
(25.9), A > 2. (b) f(x), the sawtooth map re- 
stricted to the unit circle (25.12), A = 6. 




Example 25.1 Chains of piecewise linear maps. We start by defining the map f on 
the unit interval as 

a»-{£ + i-a itftpft ■ A > 2 ' < 25 " 

and then extending the dynamics to the entire real line, by imposing the translation 
property 

f{x + h) = f(x) + h h 6 Z . (25.10) 

As the map is discontinuous at x = 1/2, /(1/2) is undefined, and the x = 1/2 point 
has to be excluded from the Markov partition. The map is antisymmetric under the 
x-coordinate flip 

fix) = -fi-x), (25.11) 

so the dynamics will exhibit no mean drift; all odd derivatives of the generating function 
(17.11) with respect to p, evaluated atp = 0, will vanish. 

The map (25.9) is sketched in figure 25.3(a). Initial points sufficiently close to 
either of the fixed points in the initial unit interval remain in the elementary cell for one 
iteration; depending on the slope A, other points jump h cells, either to the right or to 
the left. Repetition of this process generates a random walk for almost every initial 
condition. 

The translational symmetry (25.10) relates the unbounded dynamics on the 
real line to dynamics restricted to the elementary cell - in the example at hand, the unit 
interval curled up into a circle. Associated to f (x) we thus also consider the circle map 

fix) = fix) - [fix)] , x = X-[x]e [0, 1] (25. 12) 

figure 25.3(b), where [■ ■ ■] stands for the integer part (25.2). For the piecewise linear 
map of figure 25.3 we can evaluate the dynamical zeta function in closed form. Each 
branch has the same value of the slope, and the map can be parameterized by a single 
parameter, for example its critical value a = f(l/2), the absolute maximum on the 
interval [0, 1] related to the slope of the map by a - A/2. The larger A is, the stronger 
is the stretching action of the map. 
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As noted in sect. 25.1.1, the elementary cell cycles correspond to either stand- 
ing or running orbits for the map on the full line: we shall refer to ^ € Z as the 
jumping number of the p cycle, and take as the cycle weight 

tp = fr<J*r/\A p \. (25.13) 

The diffusion constant formula (25.8) for 1 -dimensional maps is 

D =2«7- <2514) 

where the "mean cycle time" is given by (20.25) 
d 1 



<«> ? = z- 



z =i ^ I a p.--- a aI 



and the "mean cycle displacement squared" by (20.28) 



/ o\ d 2 1 

n = ^iwv, . = - z ^ ■A.-.A.? ■ (25 - i6) 



<5/? 2 ^06, 1) 



(n pj +■■■ + %) 



0=0 l A Pi ' ' ' A A 



the primed sum indicating all distinct non-repeating combinations of prime cy- 
cles. The evaluation of these formulas for the simple system of example 25.1 will 
require nothing more than pencil and paper. 



Example 25.2 Unrestricted symbolic dynamics. Whenever A is an integer num- 
ber, the symbolic dynamics is exceedingly simple. For example, for the case A = 6 illus- 
trated in figure 25.3(b), the elementary cell map consists of 6 full branches, with uniform 
stretching factor A = 6. The branches have different jumping numbers: for branches 1 
and 2 we have h - 0, for branch 3 we have h = +1, for branch 4 h = -1, and finally for 
branches 5 and 6 we have respectively h = +2 and h = -2. The same structure reap- 
pears whenever A is an even integer A = 2a: all branches are mapped onto the whole 
unit interval and we have two h = 0 branches, one branch for which h = +1 and one for 
which h = -1, and so on, up to the maximal jump \h\ = a - 1. The symbolic dynamics 
is thus full, unrestricted shift in 2a symbols {0+, 1+, . . . , (a - 1) + , (a - 1)_, . . . , 1_, 0_}, 
where the symbol indicates both the length and the direction of the corresponding jump. 

For the piecewise linear maps with uniform stretching the weight associated 
with a given symbol sequence is a product of weights for individual steps, t sq = t s t q . For 
the map of figure 25.3 there are 6 distinct weights (25.13): 

t\ = t 2 = z/A 

h = e^z/A, t A = e~ p zlA, f 5 =e 2/3 z/A, t 6 = e~ v z/A. 
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The piecewise linearity and the simple symbolic dynamics lead to the full cancelation 
of all curvature corrections in (20. 7). The exact dynamical zeta function ( 15. 15) is given 
by the fixed point contributions: 



l/£(/3,Z) = l-fo + -fO_ f(a-l) + - *(a-l)_ 



1 + J] coshOS;) 



7=1 



(25.17) 



The leading (and only) eigenvalue of the evolution operator (25.5) is 



a 



1 + J] coshOS;) 



7=1 



A = 2a, a integer. 



(25.18) 



The flow conservation (20. 1 7) sum rule is manifestly satisfied, so s(0) = 0. The first 
derivative s(0)' vanishes as well by the left/right symmetry of the dynamics, implying 
vanishing mean drift (x) = 0. The second derivative s(p)" yields the diffusion constant 
(25.14): 



(n) i = 2a- = l, (x 2 ) =2- + 2- + 2- + -.. + 2^^ 

( A \'f A A A A 



A " \- It "A "A "A 
Using the identity YJl=\ ^ = n ( n + 1)( 2 « + l)/6 we obtain 



D = l(A-l)(A-2), 



A even integer. 



Similar calculation for odd integer A = 2k - 1 y/'e/ds 



£=^(A 2 -lh 



A odd integer. 



(25.19) 



(25.20) 

exercise 25.1 
(25.21) 



25.2.1 Higher order transport coefficients 

The same approach yields higher order transport coefficients 

S 2 = D, (25.22) 

/?=o 

known for k > 2 as the Burnett coefficients. The behavior of the higher or- 
der coefficients yields information on the relaxation to the asymptotic distribution 
function generated by the diffusive process. Here % is the relevant dynamical 
variable and S^s are related to moments of arbitrary order. 

Were the diffusive process purely Gaussian 

e tm = _L= f d x^e- fliADt) - <? Dt (25.23) 

the only St coefficient different from zero would be 3i - D. Hence, nonvan- 
ishing higher order coefficients signal deviations of deterministic diffusion from a 
Gaussian stochastic process. 



1 d k 



diffusion - 12jan2009 



ChaosBook.org versionl4.4.1, Apr 21 2013 



CHAPTER 25. DETERMINISTIC DIFFUSION 



518 



Figure 25.4: (a) A partition of the unit interval 
into six intervals, labeled by the jumping number 
h(x) I = (0 + , 1 + , 2+, 2_, 1_, 0_). The partition is 
Markov, as the critical point is mapped onto the 
right border of Mi + . (b) The transition graph for 
this partition, (c) The transition graph in the com- 
pact notation of (25.26) (introduced by Vadim Mo- 
roz). 



(a) 




Example 25.3 &4 Burnett coefficient. For the map under consideration the first 
Burnett coefficient coefficient £4 (or kurtosis (17.21)) is easily evaluated. For example, 
using (25.18) in the case of even integer slope A = 2a we obtain 



£4 



1 



41-60 



(a - l)(2a- l)(4a 2 -9fl + 7). 



exercise 25.2 



(25.24) 



We see that deterministic diffusion is not a Gaussian stochastic process. Higher 
order even coefficients may be calculated along the same lines. 



25.2.2 Finite Markov partitions 



For piecewise-linear maps exact results may be obtained whenever the critical 
points are mapped in finite numbers of iterations onto partition boundary points, 
or onto unstable periodic orbits. We will work out here an example for which 
this occurs in two iterations, leaving other cases as exercises. The key idea 
is to construct a Markov partition (11.2), with intervals mapped onto unions of 
intervals. 



Example 25.4 A finite Markov partition. As an example we determine a value 
of the parameter 4 < A < 6 for which /(/(1/2)) = 0. As in the integer A case, 
we partition the unit interval into six intervals, labeled by the jumping number h(x) e 
{v\4o + , M\ + ,Mi + , Mi_, Ati_, MoJ, ordered by their placement along the unit interval, 
figure 25.4 (a). 

In general the critical value a = f(l/2) will not correspond to an interval border, 
but now we choose a such that the critical point is mapped onto the right border of 
M\ + . Equating /(1/2) with the right border of M\ + , x = 1/A, we obtain a quadratic 
equation with the expanding solution A = 2( V2 + 1). For this parameter value f(M\J = 
M) + UMi + , f(M2_) = Mo_ \JMi_, while the remaining intervals map onto the whole 
unit interval M. The transition matrix (14.1) is given by 



b' = T<p = 



^ 1 1 1 0 1 
1110 1 



10 0 1 

10 0 1 

10 11 

10 11 



<P\- 



(25.25) 
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One could diagonalize (25.25) on a computer, but, as we saw in chapter 14, the tran- 
sition graph of figure 25.4 (b) corresponding to map figure 25.4 (a) offers more insight 
into the dynamics. Figure 25.4 (b) can be redrawn more compactly as transition graph 
figure 25.4 (c) by replacing parallel lines in a graph by their sum 

— « >>> . = h + h + h ■ (25.26) 

The dynamics is unrestricted in the alphabet 

m = {0 + , 1+, 2 + 0+, 2+1+, 2_1_, 2_0_, 1_, 0_) . 

Applying the loop expansion (15.15) of sect. 15.3, we are led to the dynamical zeta 
function 

l/Z(P,z) = 1 - 1 0+ - t u - t 2 +o + - h+i + - h_\_ - 1 2 _ 0 _ - h_ - 1 0 _ 

= 1-^(1+ coshOS)) - %r (cosh(2/3) + cosh(3/3)) . (25.27) 




For grammar as simple as this one, the dynamical zeta function is the sum over fixed 
points of the unrestricted alphabet. As the first check of this expression for the dynam- 
ical zeta function we verify that 

1/^(0,1) =l-±--^ = 0, 



as required by the flow conservation (20. 1 7). Conversely, we could have started by 
picking the desired Markov partition, writing down the corresponding dynamical zeta 
function, and then fixing A by the 1 /£(0, 1) = 0 condition. For more complicated transi- 
tion graphs this approach, together with the factorization (25.35), is helpful in reducing 
the order of the polynomial condition that fixes A. 

The diffusion constant follows from (25. 14) 



exercise 25.3 



1 2 / ,\ l 2 2 2 3 2 

^ = 4 A + V' n = 2 A +2 A^ +2 A^ 

D = H±l^. (25.28) 
16 + 8 V2 



It is by now clear how to build an infinite hierarchy of finite Markov partitions: 
tune the slope in such a way that the critical value /( 1 /2) is mapped into the fixed 
point at the origin in a finite number of iterations p / p (l/2) = 0. By taking higher 
and higher values of p one constructs a dense set of Markov parameter values, 
organized into a hierarchy that resembles the way in which rationals are densely 
embedded in the unit interval. For example, each of the 6 primary intervals can 
be subdivided into 6 intervals obtained by the 2-nd iterate of the map, and for the 
critical point mapping into any of those in 2 steps the grammar (and the corre- 
sponding cycle expansion) is finite. So, if we can prove continuity of D = D(A), 
we can apply the periodic orbit theory to the sawtooth map (25.9) for a random 
"generic" value of the parameter A, for example A = 4.5. The idea is to bracket 
this value of A by a sequence of nearby Markov values, compute the exact diffu- 
sion constant for each such Markov partition, and study their convergence toward 



diffusion - 12jan2009 



ChaosBook.org version!4.4.1, Apr 21 2013 



CHAPTER 25. DETERMINISTIC DIFFUSION 



520 




the value of D for A = 4.5. Judging how difficult such problem is already for a 
tent map (see sect. 15.5), this is not likely to take only a week of work. 

Expressions like (25.20) may lead to an expectation that the diffusion coeffi- 
cient (and thus transport properties) are smooth functions of parameters control- 
ling the chaoticity of the system. For example, one might expect that the diffusion 
coefficient increases smoothly and monotonically as the slope A of the map (25.9) 
is increased, or, perhaps more physically, that the diffusion coefficient is a smooth 
function of the Lyapunov exponent A. This turns out not to be true: D as a func- 
tion of A is a fractal, nowhere differentiable curve illustrated in figure 25. 5. The 
dependence of D on the map parameter A is rather unexpected - even though for 
larger A more points are mapped outside the unit cell in one iteration, the diffusion 
constant does not necessarily grow. 
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Figure 25.6: (a) A map with marginal fixed point, 
(b) The map restricted to the unit circle. 



(a) 



F(X) 




(b) -1/2 



f (X) 



1/2 




This is a consequence of the lack of structural stability, even of purely hyper- 
bolic systems such as the Lozi map and the 1 -dimensional diffusion map (25.9). 
The trouble arises due to non-smooth dependence of the topological entropy on 
system parameters - any parameter change, no mater how small, leads to creation 
and destruction of infinitely many periodic orbits. As far as diffusion is concerned 
this means that even though local expansion rate is a smooth function of A, the 
number of ways in which the trajectory can re-enter the initial cell is an irregular 
function of A. 

The lesson is that lack of structural stability implies lack of spectral stability, 
and no global observable is expected to depend smoothly on the system parame- 
ters. If you want to master the material, working through one of the deterministic 
diffusion projects on ChaosBook . org/pages is strongly recommended. 



25.3 Marginal stability and anomalous diffusion 

What effect does the intermittency of chapter 24 have on transport properties? A 
marginal fixed point affects the balance between the running and standing orbits, 
thus generating a mechanism that may result in anomalous diffusion. 

Example 25.5 Anomalous diffusion. Consider a 1 -dimensional map of the real line 
on itself shown in figure 25.6 (a), with the same properties as in sect. 25.2, except for a 
marginal fixed point at x = 0. The corresponding circle map is given in figure 25.6 (b). 
As in sect. 24.2.1, a branch with support in Mi, i = 1,2,3,4 has constant slope A„ 
while J\m 0 is of intermittent form. To keep you nimble, this time we take a slightly 
different choice of slopes. The toy example of sect. 24.2. 1 was cooked up so that the 
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1 Is branch cut in dynamical zeta function was the whole answer. Here we shall take a 

slightly different route, and pick piecewise constant slopes such that the dynamical zeta 

function for intermittent system can be expressed in terms of the Jonquiere function , nr . _, 
3 ^ ^ remark 25.7 



(25.29) 



k=\ 



Once the 0 fixed point is pruned away, the symbolic dynamics is given by the 
infinite alphabet {1,2,3,4, 0''l , 0 j 2, 0*3, 0'4), i, j,k,l = 1,2,... (compare with table 24. 1 ). 
The partitioning of the subinterval Mo is induced by M 0 i( right) = f^ ght) (M3 
(where f^ gM) denotes the inverse of the right branch of /\m 0 ) ar, d the same reason- 
ing applies to the leftmost branch. These are regions over which the slope of /\m 0 is 
constant. Thus we have the following stabilities and jumping numbers associated to 
letters: 



0*3, 0*4 




q/2 


h p = 


1 




O'l, 0'2 




,l« 
- q/2 


hp = 


-1 




3, 4 


A, 


= +A 


h p = 


1 




2, 1 


A, 


= +A 


h p = 


-1 , 


(25.30) 



where a = 1/s is determined by the intermittency exponent (24.1), while q is to be 
determined by the flow conservation (20. 1 7) for f: 



- + 2q{(a + 1) = 1 
A 

(where £ is the Riemann zeta function), so that q - {A- 4)/(2A£(a + 1 )). The dynamical 
zeta function picks up contributions just by the alphabet's letters, as we have imposed 
piecewise linearity, and can be expressed in terms of a Jonquiere function (25.29): 



l/ftfcjS) = 1 -yz cosh/3 - JV 4 zcoshff-Jfog + l). 
A A£(l + a) 

Its first zero z(fi) is determined by 

4 A-4 1 
—z + , — zz ' J{z, a + 1) = 



(25.31) 



A A£(l+a) 



cosh/3 



D vanishes by the implicit function theorem, ^'(J3)\p =l = 0 when a < 1. The 
physical interpretation is that a typical orbit will stick for long times near the 0 
marginal fixed point, and the 'trapping time' will be larger for higher values of 
the intermittency parameter s (recall a = l/s). As always, we need to look more 
closely at the behavior of traces of high powers of the transfer operator. 

The evaluation of transport coefficient requires one more derivative with re- 
spect to expectation values of state space observables (see sect. 25.1): if we use 
the diffusion dynamical zeta function (25.7), we may write the diffusion coeffi- 
cient as an inverse Laplace transform, in such a way that the distinction between 
maps and flows has vanished. In the case of 1 -dimensional diffusion we thus have 



D = lim 

t— >0O 



d 2 i r 

dp 2 2ni J a _ 



dse 



(25.32) 



/?=() 
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where the refers to the derivative with respect to s. 

The evaluation of inverse Laplace transforms for high values of the argument 
is most conveniently performed using Tauberian theorems. We shall take 



Jo 



cj(A) =| dxe Ax u(x), 



with u(x) monotone as x — » oo; then, as A i-> 0 and x \-* oo respectively (and 
P e (0, oo), 

if and only if 

1 

u{x) ~ —— XT L(x), 
T(p) 

where L denotes any slowly varying function with linv^oo L(ty)/L(t) = 1. Now 

\lti(e-\{5) _ (x + aTTO C^j a + 0 + cosh l 
l/Zo(e~',J3) 1 - coshyS - ^^e-^g-*, a + 1) cosh/?/ ' 

Taking the second derivative with respect to /? we obtain 

d 2 



-g a (s). (25.33) 



(l-^-'-Xatfe'-'^-'.a+D) 



The asymptotic behavior of the inverse Laplace transform (25.32) may then be 
evaluated via Tauberian theorems, once we use our estimate for the behavior of 
Jonquiere functions near z = 1 - The deviations from normal behavior correspond 
to an explicit dependence of D on time. Omitting prefactors (which can be calcu- 
lated by the same procedure) we have 

( s~ 2 for a > 1 

ga(s) ~ for a e (0, 1) 

{ l/(s 2 lns) for ar= 1. 

The anomalous diffusion exponents follow: 

exercise 25.6 

for are (0,1) 

({x-x Q y) t ~ { f/lnf for a=\ (25.34) 
t for a > 1 . 
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Resume 

Perfection itself is imperfection. 
— Vladimir Horowitz 

With initial data accuracy 8x = |<5x(0)| and system size L, a trajectory is predictable 
only to the finite Lyapunov time 7Ly ap » AT 1 In \L/6x\ . Beyond that, chaos rules. 
We have discussed the implications in sect. 1.8: chaos is good news for prediction 
of long term observables such as transport in statistical mechanics. 

The classical Boltzmann equation for evolution of 1 -particle density is based 
on stosszahlansatz, neglect of particle correlations prior to, or after a 2-particle 
collision. It is a very good approximate description of dilute gas dynamics, but 
a difficult starting point for inclusion of systematic corrections. In the theory 
developed here, no correlations are neglected - they are all included in the cycle 
averaging formula such as the cycle expansion for the diffusion constant 

n = 1 1 y'( uk + l ( h Pi +---+n Pk ) 2 
2d(T)^ K \^ Pl ---A Pk \ ■ 

Such formulas are exact, the issue in their applications is what are the most ef- 
fective schemes of estimating the infinite cycle sums required for their evaluation. 
Unlike most statistical mechanics, here there are no phenomenological macro- 
scopic parameters; quantities such as transport coefficients are calculable to any 
desired accuracy from the microscopic dynamics. 

For systems of a few degrees of freedom these results are on rigorous footing, 
but there are indications that they capture the essential dynamics of systems of 
many degrees of freedom as well. 

Though superficially indistinguishable from the probabilistic random walk 
diffusion, deterministic diffusion is quite recognizable, at least in low dimen- 
sional settings, through fractal dependence of the diffusion constant on the system 
parameters, and through non-Gaussion relaxation to equilibrium (non-vanishing 
Burnett coefficients). 

That Smale's "structural stability" conjecture turned out to be wrong is not a 
bane of chaotic dynamics - it is actually a virtue, perhaps the most dramatic ex- 
perimentally measurable prediction of chaotic dynamics. As long as microscopic 
periodicity is exact, the prediction is counterintuitive for a physicist - transport 
coefficients are not smooth functions of system parameters, rather they are non- 
monotonic, nowhere differentiable functions. 

Actual evaluation of transport coefficients is a test of the techniques developed 
above in physical settings. In cases of severe pruning the trace formulas and er- 
godic sampling of dominant cycles might be more effective strategy than the cycle 
expansions of dynamical zeta functions and systematic enumeration of all cycles. 
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Commentary 

Remark 25.1 Lorentz gas. The original pinball model proposed by Lorentz [4] 
consisted of randomly, rather than regularly placed scatterers. 

Remark 25.2 Who's dunnit? Cycle expansions for the diffusion constant of a parti- 
cle moving in a periodic array have been introduced by R. Artuso [ 5] (exact dynamical 
zeta function for 1-dimensional chains of maps (25.8)), by W.N. Vance [6], and by P. Cvi- 
tanovic, J. -P. Eckmann, and P. Gaspard [ 7] (the dynamical zeta function cycle expansion 
(25.8) applied to the Lorentz gas). 

Remark 25.3 Lack of structural stability for D. Expressions like (25.20) may lead 
to an expectation that the diffusion coefficient (and thus transport properties) are smooth 
functions of the chaoticity of the system (parameterized, for example, by the Lyapunov 
exponent A = In A). This turns out not to be true: D as a function of A is a fractal, nowhere 
differentiable curve shown in figure 25.5. The dependence of D on the map parameter A 
is rather unexpected - even though for larger A more points are mapped outside the unit 
cell in one iteration, the diffusion constant does not necessarily grow. We refer the reader 
to refs. [15, 16] for early work on the deterministic diffusion induced by 1-dimensional 
maps. The sawtooth map (25.9) was introduced by Grossmann and Fujisaka [ 17] who 
derived the integer slope formulas (25.20) for the diffusion constant. The sawtooth map is 
also discussed in refs. [18]. The fractal dependence of diffusion constant on the map pa- 
rameter is discussed in refs. [9, 8, 10]. Sect. 1.8 gives a brief summary of the experimental 
implications; for the the current state of the art of fractal transport coefficients consult the 
first part of Klage's monograph [12]. Would be nice if someone would eventually check 
these predictions in experiments... Statistical mechanicians tend to believe that such com- 
plicated behavior is not to be expected in systems with very many degrees of freedom, as 
the addition to a large integer dimension of a number smaller than 1 should be as unno- 
ticeable as a microscopic perturbation of a macroscopic quantity. No fractal-like behavior 
of the conductivity for the Lorentz gas has been detected so far [14]. (P. Cvitanovic and 
L. Rondoni) 

Remark 25.4 Symmetry factorization in one dimension. In the ft = 0 limit the 
dynamics (25.1 1) is symmetric under x — » -x, and the zeta functions factorize into prod- 
ucts of zeta functions for the symmetric and antisymmetric subspaces, as described in 
sect. 21.1.1: 

1 1 1 



— - - — — + — — - (25 35) 

8z f ( s dz £„ £j dz ( s ' 

The leading (material flow conserving) eigenvalue z = 1 belongs to the symmetric sub- 
space l/£j(0, 1) = 0, so the derivatives (25.15) also depend only on the symmetric sub- 
space: 

dz((0,z) z= i 
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length 


# cycles 


«0,0) 


/I 


1 


5 


-1.216975 




2 


10 


-0.024823 


1.745407 


3 


32 


-0.021694 


1.719617 


4 


104 


0.000329 


1.743494 


5 


351 


0.002527 


1.760581 


6 


1243 


0.000034 


1.756546 



Table 25.1: Fundamental domain, w=0.3 . 



1 d 
■ z- 



1 



U0,z) dz f,(0,z) 



(25.36) 



Implementing the symmetry factorization is convenient, but not essential, at this level of 
computation. 



Remark 25.5 Lorentz gas in the fundamental domain. The vector valued nature of 
the generating function (25.3) in the case under consideration makes it difficult to perform 
a calculation of the diffusion constant within the fundamental domain. Yet we point out 
that, at least as regards scalar quantities, the full reduction to M. leads to better estimates. 
A proper symbolic dynamics in the fundamental domain has been introduced in ref. [19]. 

In order to perform the full reduction for diffusion one should express the dynamical 
zeta function (25.7) in terms of the prime cycles of the fundamental domain M. of the 
lattice (see figure 25.2) rather than those of the elementary (Wigner-Seitz) cell M. This 
problem is complicated by the breaking of the rotational symmetry by the auxiliary vector 
/?, or, in other words, the non-commutativity of translations and rotations: see ref. [ 7], 



Remark 25.6 Anomalous diffusion. Anomalous diffusion for 1-dimensional inter- 
mittent maps was studied in the continuous time random walk approach in refs. [ 10, 1 1]. 
The first approach within the framework of cycle expansions (based on truncated dynam- 
ical zeta functions) was proposed in ref. [ 12]. Our treatment follows methods introduced 
in ref. [13], applied there to investigate the behavior of the Lorentz gas with unbounded 
horizon. 



Remark 25.7 Jonquiere functions. In statistical mechanics Jonquiere function 
(25.29) appears in the theory of free Bose-Einstein gas, see refs. [22, 23]. 
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Exercises 



25.1. Diffusion for odd integer A. Show that when the 
slope A = 2k - 1 in (25.9) is an odd integer, the dif- 
fusion constant is given by D = (A 2 - l)/24, as stated in 
(25.21). 

25.2. Fourth-order transport coefficient. Verify (25.24). 
You will need the identity 



S' 

k=i 



1 9 

— n(n + l)(2n + l)(3n 2 + 3n 
30 



1). 



versa, so that 1 , 2, 3, 4 are standing while 0 leads to both 
positive and negative jumps. Build the corresponding 
dynamical zeta function and show that 



<T 2 {f) 



for a > 2 

for a - 2 

f" for a e (1,2) 

t 2 1 In t for a — 1 

r 2 for a e (0, 1) 



t 

tint 

f3-o 



25.3. Finite Markov partitions. 



25.4. Maps with variabie peak shape: 

lowing piecewise linear map 



25.5. 



Verify (25.28). 

Consider the fol- 



fs(x) = 



3.x 
IS 
3 



1 



x e M\ 

xeM 2 (25.37) 
x e A-I3 



where Mi = [0, ±(1 - 5)], M 2 = [|(1 - S), \{2 + 5)\ 

M3 = [|(2 + 6), ij, and the map in [1/2, 1] is obtained 
by antisymmetry with respect to x = 1 /2, y = 1/2, Write 
the corresponding dynamical zeta function relevant to 
diffusion and then show that 



D = 



5(2 + 5) 
4(1-5) 



See refs. [2 1 , 22] for further details. 

Two-symboi cycies for the Lorentz gas. Write down 
all cycles labeled by two symbols, such as (0 6), (1 7), 
(1 5) and (0 5). 

ChaosBook.org/pages offers several project-length 
deterministic diffusion exercises. 



25.6. Acceierated diffusion, (medium difficulty) Consider 
a map h, such that h-fof figure 25.6 (b), but now run- 
ning branches are turned into standing branches and vice 



25.7. Recurrence times for Lorentz gas with infinite hori- 
zon. Consider the Lorentz gas with unbounded 
horizon with a square lattice geometry, with disk ra- 
dius R and unit lattice spacing. Label disks accord- 
ing to the (integer) coordinates of their center: the se- 
quence of recurrence times {t j} is given by the set of 
collision times. Consider orbits that leave the disk sit- 
ting at the origin and hit a disk far away after a free 
flight (along the horizontal corridor). Initial conditions 
are characterized by coordinates (<p, a) (</> determines the 
initial position along the disk, while a gives the an- 
gle of the initial velocity with respect to the outward 
normal: the appropriate measure is then d(p cos a da 
(<p e [0, 2k), a e [—n/2, n/2]. Find how (p(T) scales 
for large values of T: this is equivalent to investigating 
the scaling of portions of the state space that lead to a 
first collision with disk (n, 1), for large values of n (as 
b h 00 n ^ T). 

25.8. Diffusion reduced to the fundamental domain. 



Maps such as figure 25.3 are antisymmetric. Re- 
duce such antisymmetric maps as in example 9.4, and 
write down the formula (25.14) for the diffusion con- 
stant D in terms of the fundamental domain cycles (rela- 
tive periodic orbits) alone (P. Gaspard says it cannot be 
done [7]). 
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Turbulence? 



I am an old man now, and when I die and go to Heaven 
there are two matters on which I hope enlightenment. One 
is quantum electro-dynamics and the other is turbulence of 
fluids. About the former, I am rather optimistic. 

— Sir Horace Lamb 



There is only one honorable cause that would justify sweating through so much 
formalism - this is but the sharpening of a pencil in order that we may attack 
the Navier-Stokes equation, 

— + u- Vu = -— + vV 2 u + f, (26.1) 
ot p 

and solve the problem of turbulence. 

Flows described by partial differential equations [PDEs] are said to be 'in- 
finite dimensional' because if one writes them down as a set of ordinary dif- 
ferential equations [ODEs], one needs infinitely many of them to represent the 
dynamics of one partial differential equation. Even though the state space is 
infinite-dimensional, the long-time dynamics of many systems of physical inter- 
est is finite-dimensional, contained within an inertial manifold. 

Being realistic, we are not so foolhardy to immediately plunge into the prob- 
lem - there are too many dimensions and indices. Instead, we start small, in one 
spatial dimension, u — > u, u • Vu —> ud x , assume constant p, forget about the pres- 
sure p, and so on. This line of reasoning, as well as many other equally sensible 
threads of thought, such as the amplitude equations obtained via weakly nonlin- 
ear stability analysis of steady flows, leads to a small set of frequently studied 
nonlinear PDEs, like the one that we turn to now. 
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26.1 Fluttering flame front 



Romeo: 'Misshapen chaos of well seeming forms!' 

— W. Shakespeare, Romeo and Julliet, Act I, Scene I 



The Kuramoto-Sivashinsky [KS] system describes the flame front flutter of 
gas burning on your kitchen stove, figure 26.1 (a), and many other problems of 
greater import, is one of the simplest nonlinear systems that exhibit 'turbulence' 
(in this context often referred to more modestly as 'spatiotemporally chaotic be- 
havior'). The time evolution of the 'flame front velocity' u - u{x, t) on a periodic 
domain u(x, t) = u(x + L, t) is given by 

u t + \{u 2 ) x + u xx + vu xxxx = 0 , x e [0, L] . (26.2) 

In this equation t > 0 is the time and x is the spatial coordinate. The subscripts x 
and t denote partial derivatives with respect to x and t: u, = du/dd, u xxxx stands 
for the 4th spatial derivative of u = u(x, t) at position x and time t. In what follows 
we use interchangeably the 'dimensionless system size'L, or the periodic domain 
size L = 2nL, as the system parameter. We take note, as in the Navier-Stokes 
equation (26.1), of the 'inertial' term ud x u, the 'anti-diffusive' term (f x u (with a 
"wrong" sign), '(hyper-) viscosity' v, etc.. 

The Kuramoto-Sivashinsky equation (26.2) is space translationally invariant, 
time translationally invariant, and invariant under reflection x — > -x, u — > -u. 
Comparing u t and (u 2 ) x terms we note that u has dimensions of [x]/[f], hence u is 
the 'velocity', rather than the 'height' of the flame front. Indeed, the Kuramoto- 
Sivashinsky equation is Galilean invariant: if u(x, t) is a solution, then v + u(x - 
vt, t), with v an arbitrary constant velocity, is also a solution. Without loss of 
generality, in our calculations we shall work in the mean velocity zero frame 



In terms of the system size L, the only length scale available, the dimensions of 
terms in (26.2) are [x] = L, [t] = L 2 , [u] = L , [v] = L 2 . Scaling out the 
"viscosity" v by x —> xv 1 ^ 2 , t — » tv , u —> uv~ 1 ^ 2 , brings the Kuramoto-Siva- 
shinsky equation (26.2) to a non-dimensional form 



In this way we trade in the "viscosity" v and the system size L for a single dimen- 
sionless system size parameter 




(26.3) 



u t + uu x + u xx + u 



xxxx 



= 0, 



xe [0,Lv _1/2 ] - [0,27i-L]. 



(26.4) 



L = L/(2n Vv) 



(26.5) 
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which plays the role of a "Reynolds number" for the Kuramoto-Sivashinsky sys- 
tem. Sometimes L is used as the system parameter, with v fixed to 1 , and at other 
times v is varied with L fixed to either 1 or 2n. Physically, varying L is the right 
thing to do if one is interested in taking L large, and studying 'spatio-temporal 
chaos.' In what follows we shall state results of all calculations in units of dimen- 
sionless system size L. The time units also have to be rescaled; for example, if 
T p * is a period of a periodic solution of (26.2) with a given v and L = In, then the 
corresponding solution of the non-dimensionalized (26.4) has period 

T p = T p */v. (26.6) 

The term (u 2 ) x makes this a nonlinear system. This is one of the simplest 
conceivable nonlinear PDE, playing the role in the theory of spatially extended 
systems a bit like the role that the j? nonlinearity plays in the dynamics of iterated 
mappings. The time evolution of a typical solution of the Kuramoto-Sivashinsky 
system is illustrated by figure 26.1 (b). 

Spatial periodicity u(x, t) = u(x + L, t) makes it convenient to work in the 
Fourier space, 



u( x , t) = J] a k (t)e ikx/L , (26.7) 



with the 1 -dimensional PDE (26.2) replaced by an infinite set of ODEs for the 
complex Fourier coefficients a k {t): 



a k 



2 4 ~ 

Vjt(a) = (q k - q k ) a k - i— 2_j a m a k -m , qk = k/L . (26.8) 



section 3.3 
remark 26.1 



As do = 0, ao is a conserved quantity, in our calculations fixed to at) - 0 by 
the vanishing mean (u) condition (26.3) for the front velocity. The velocity 
field u(x, f) is real, so a k = a\, and we can replace the sum by an m > 0 sum. 
This is the infinite set of ordinary differential equations promised in this chapter's 
introduction. 

Due to the hyperviscous damping u xxxx , long time solutions of Kuramoto- 
Sivashinsky equation are smooth, a k drop off fast with k, and truncations of (26.8) 
to ,/V terms, 16 < 7Y < 128, yield highly accurate solutions for system sizes con- 
sidered here. Robustness of the Fourier representation of KS as a function of 
the number of modes kept in truncations, a subtle issue. Adding an extra mode 
to a truncation introduces a small perturbation. However, this can (and often 
will) throw the dynamics into a different asymptotic state. A chaotic attractor 
for jV = 15 can collapse into an attractive period-3 cycle for Af = 16, and so on. 
If we compute, for example, the Lyapunov exponent Ay for a strange attractor of 
the system (26.8), there is no reason to expect % to smoothly converge to a limit 
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Figure 26.1: (a) Kuramoto-Sivashinsky dynam- 
ics visualized as the Bunsen burner flame flutter, 
with u = u(x, t) the "velocity of the flame front" at 
position x and time t. (b) A typical "turbulent" so- 
lution of the Kuramoto-Sivashinsky equation, sys- 
tem size L = 20ny/2 a 88.86. The color (gray 
scale) indicates the value of u at a given position 



and instant in time. The x coordinate is scaled 
with the most unstable wavelength 2ny/2, which 
is approximately also the mean wavelength of the 
turbulent flow. The dynamics is typical of a large 
system, in this case approximately 10 mean wave- 
lengths wide, (fromref. [1]) 




(a) x/(2ttV2) 

(b) 

value A, as N — > oo, because of the lack of structural stability both as a function 
of truncation N, and the system size L. However, later in this chapter we explore 
both equilibria and short periodic orbits, which are robust under mode truncations 
and small system parameter L changes. 

Spatial representations of PDEs (such as figure 26. 1 (b) and the 3D snapshots 
of velocity and vorticity fields in Navier-Stokes) offer little insight into detailed 
dynamics of low-Re flows. Much more illuminating are the state space represen- 
tations. 



Example 26.1 Kuramoto-Sivashinsky antisymmetric subspace: The Fourier co- 
efficients a k are in general complex numbers. We can isolate the antisymmetric sub- 
space u(x, t) - -u(-x, t) by considering the case ofa k pure imaginary a k — > ia k , where 
a k - -a- k are real, with the evolution equations 

a k = q\ (l - qfj a k - y ^ a m a k - m . (26.9) 

m=-oo 

By picking this subspace we eliminate the continuous translational symmetry from our 
considerations; that is not an option for an experimentalist, but will do for our purposes. 
In the antisymmetric subspace the translational invariance of the full system reduces 
to the invariance under discrete translation by half a spatial period L. In the Fourier 
representation (26.9) this corresponds to invariance under 

aim — » aim ,a2m+i — * ~aim+\ ■ (26.10) 



PDEs - 4apr2013 



ChaosBook.org versionI4.4.1, Apr 21 2013 



CHAPTER 26. TURBULENCE? 



533 



Figure 26.2: Spatiotemporally periodic solution 
u 0 (x, t), with period T 0 = 30.01 18 . The antisymmetric 
subspace, u(x,t) = —u{—x,t), so we plot x e [0, L/2]. 
System size L = 2.89109, N = 16 Fourier modes trun- 
cation. (From ref. [12]) 




-ft/T 



x/tt 



26.2 Infinite-dimensional flows: Numerics 



The computer is not a mere mathematical excrescence, 
useful for technological ends. Rather, I believe that it 
is a meta-development that might very well change what 
mathematics is considered to be. 



— P. J. Davis [ ] 



The trivial solution u(x, t) = 0 is an equilibrium point of (26.2), but that is basically 
all we know as far as useful analytical solutions are concerned. To develop some 
intuition about the dynamics we turn to numerical simulations. 

How are solutions such as figure 26. 1 (b) computed? The salient feature of 
such partial differential equations is a theorem saying that for state space con- 
tracting flows, the asymptotic dynamics is describable by a finite set of 'inertial 
manifold' ordinary differential equations. How you solve the equation (26.2) nu- 
merically is up to you. Here are some options: 

Discrete mesh: You can divide the x interval into a sufficiently fine discrete grid of 
N points, replace space derivatives in (26.2) by approximate discrete derivatives, 
and integrate a finite set of first order differential equations for the discretized 
spatial components Uj(t) = u(jL/N, t), by any integration routine you trust. 

Fourier modes: You can integrate numerically the Fourier modes (26.8), trun- 
cating the ladder of equations to a finite number of modes N, i.e., set = 0 for 
k > N. In the applied mathematics literature more sophisticated variants of such 

exercis 

truncations are called Gdlerkin truncations, or Galerkin projections. You need to 
worry about 'stiffness' of the equations and the stability of your integrator. For the 
parameter values explored in this chapter, truncations ,/V in range 16 to 64 yield 
sufficient accuracy. 

Pseudo-spectral methods: You can mix the two methods, exploiting the speed 
of Fast Fourier Transforms. 

Example 26.2 Kuramoto-Sivashinsky simulation, antisymmetric subspace: To 

get started, we set v = 0.029910, L-2n in the Kuramoto-Sivashinsky equation (26.2), 
or, equivalently v = 1, L = 36.33052 in the non-dimensionalized (26.9). Consider the 
antisymmetric subspace (26.9), so the non-dimensionalized system size is L = L/2n = 
2.89109. Truncate (26.9) to 0 < k < 16, and integrate an arbitrary initial condition. Let 
the transient behavior settle down. 
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Figure 26.3: Projections of a typical 16- 
dimensional trajectory onto different 3- 
dimensional subspaces, coordinates (a) [a\, 02,03), 
(b) {a u a 2 ,a A }. System size L = 2.89109, N = 16 
Fourier modes truncation. (From ref. [12].) 




Why this L? For this system size L the dynamics appears to be chaotic, as 
far as can be determined numerically. Why N = 16? In practice one repeats the 
same calculation at different truncation cutoffs N, and makes sure that the inclusion of 
additional modes has no effect within the desired accuracy. For this system size N = 16 
suffices. 

Once a trajectory is computed in Fourier space, we can recover and plot the 
corresponding spatiotemporal pattern u(x, t) over the configuration space using (26.7), 
as in figure 26. 1 (b) and figure 26.2. Such patterns give us a qualitative picture of the 
flow, but no detailed dynamical information; for that, tracking the evolution in a high- 
dimensional state space, such as the space of Fourier modes, is much more informa- 
tive. 



26.3 Visualization 



The ultimate goal, however, must be a rational theory of 
statistical hydrodynamics where [• • ■] properties of turbu- 
lent flow can be mathematically deduced from the funda- 
mental equations of hydromechanics. 
— E. Hopf 



The problem with high-dimensional representations, such as truncations of 
the infinite tower of equations (26.8), is that the dynamics is difficult to visualize. 
The best we can do without much programming is to examine the trajectory's 
projections onto any three axes aj,oy, a&, as in figure 26.3. 

The question is: how is one to look at such a flow? It is not clear that restricting 
the dynamics to a Poincare section necessarily helps - after all, a section reduces 
a {d + l)-dimensional flow to a <i-dimensional map, and how much is gained by 
replacing a continuous flow in 16 dimensions by a set of points in 15 dimensions? 
The next example illustrates the utility of visualization of dynamics by means of 
Poincare sections. 



section 26.3 



Example 26.3 Kuramoto-Sivashinsky Poincare return maps: Consider the 
Kuramoto-Sivashinsky equation in the N Fourier modes representation. We pick (arbi- 
trarily) the hyperplane a\ = 0 as the Poincare section, and integrate (26.8) with a\ = 0, 
and an arbitrary initial point (a 2 ,...,a N ). When the flow crosses the ai = 0 hyper- 
plane in the same direction as initially, the initial point is mapped into (a' 2 ,...a' N ) - 
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Figure 26.4: The attractor of the Kuramoto-Sivashin- 
sky system (26.8), plotted as the cif, component of the 
cii =0 Poincare section return map. Here 10,000 
Poincare section returns of a typical trajectory are plot- 
ted. Also indicated are the periodic points 0, 1,01 and 
10. System size L = 2.89109, N = 16 Fourier modes 
truncation. (From ref. [12].) 




P(a 2 , . . . , a N ). This defines P, the Poincare return map (3.1) of the (N - X)-dimensional 
a\ = 0 hyperplane into itself. 

Figure 26.4 is a typical result. We have picked - again arbitrarily - a subspace 
such as a 6 (n + 1) vs. a 6 (n) in order to visualize the dynamics. While the topology of the 
attractor is still obscure, one thing is clear: even though the flow state space is infinite 
dimensional, the attractor is finite and thin, barely thicker than a line. 



The above example illustrates why a Poincare section gives a more informa- 
tive snapshot of the flow than the full flow portrait. While no fine structure is 
discernible in the full state space flow portraits of the Kuramoto-Sivashinsky dy- 
namics, figure 26.3, the Poincare return map figure 26.4 reveals the fractal struc- 
ture in the asymptotic attractor. 

In order to find a better representation of the dynamics, we now turn to its 
topological invariants. 



26.4 Equilibria of equilibria 

(Y. Lan and P. Cvitanovic) 

The set of equilibria and their stable / unstable manifolds form the coarsest topo- 
logical framework for organizing state space orbits. 

The equilibrium condition u t - 0 for the Kuramoto-Sivashinsky equation PDE 
(26.4) is the ODE 

1 2 

which can be analyzed as a dynamical system in its own right. Integrating once 
we get 

1 2 

—u +u x + u xxx = c, (26.11) 
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where c is an integration constant whose value strongly influences the nature of the 
solutions. Written as a 3 -dimensional dynamical system with spatial coordinate x 
playing the role of "time," this is a volume preserving flow 

u x — v , v x - w , w x — u - v - c , (26.12) 
with the "time" reversal symmetry, 

x — > —x, u — > —u, v —> v, w —> —w . 
From (26.12) we see that 

(u + W)x — u - c . 

If c < 0, u + w increases without bound with x — > oo, and every solution escapes 
to infinity. If c - 0, the origin (0, 0, 0) is the only bounded solution. 

For c > 0 there is much c-dependent interesting dynamics, with complicated 
fractal sets of bounded solutions. The sets of the solutions of the equilibrium 
condition (26.12) are themselves in turn organized by the equilibria of the equi- 
librium condition, and the connections between them. For c > 0 the equilibrium 
points of (26.12) are c + = ( sfc, 0, 0) and c_ = (- Vc, 0, 0). Linearization of the 
flow around c+ yields Floquet multipliers [2A , -A ± id] with 

A = — — sinh cf> , 6 - cosh (p , 
V3 

and (p fixed by sinh 30 = 3 V5c. Hence c + has a 1 -dimensional unstable manifold 
and a 2-dimensional stable manifold along which solutions spiral in. By the x — > 
"time reversal" symmetry, the invariant manifolds of c- have reversed stability 
properties. 

The non-wandering set of this dynamical system is quite pretty, and surpris- 
ingly hard to analyze. However, we do not need to explore the fractal set of the 
Kuramoto-Sivashinsky equilibria for infinite size system here; for a fixed system 
size L with periodic boundary condition, the only surviving equilibria are those 
with periodicity L. They satisfy the equilibrium condition for (26.8) 

il ~ ilk) a k + X amClk - m ~ 0 • (26. 13) 

m=-oo 

Periods of spatially periodic equilibria are multiples of L. Every timeL crosses an 
integer value L - n, Ti-cell states are generated through pitchfork bifurcations. In 
the full state space they form an invariant circle due to the translational invariance 
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Figure 26.5: The non-wandering set under study 
appears to consist of three patches: the left part 
(S l), the center part (Sc) and the right part (Sr), 
each centered around an unstable equilibrium: (a) 
central Ci equilibrium, (b) side R t equilibrium on 
the interval [0, L]. 
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of (26.4). In the antisymmetric subspace considered here, they corresponds to two 
points, half-period translates of each other of the form 



i(x, t) = —2 a kn sin(knx) , 



where at n e K.. 

For any fixed spatial period L the number of spatially periodic solutions is fi- 
nite up to a spatial translation. This observation can be heuristically motivated as 
follows. Finite dimensionality of the inertial manifold bounds the size of Fourier 
components of all solutions. On a finite-dimensional compact manifold, an ana- 
lytic function can only have a finite number of zeros. So, the equilibria, i.e., the 
zeros of a smooth velocity field on the inertial manifold, are finitely many. 

For a sufficiently small L the number of equilibria is small, mostly concen- 
trated on the low wave number end of the Fourier spectrum. These solutions may 
be obtained by solving the truncated versions of (26.13). 



Example 26.4 Some Kuramoto-Sivashinsky equilibria: 

See figure 26.5. 



26.5 Why does a flame front flutter? 



I understood every word. 
— Fritz Haake 

We start by considering the case where ciq is an equilibrium point (2.9). Ex- 
panding around the equilibrium point Oq, and using the fact that the matrix A = 
A{a q ) in (4.2) is constant, we can apply the simple formula (5.1) also to the Jaco- 
bian matrix of an equilibrium point of a PDE, 



section 18.2 



J'{a q ) 



A = A{a q ) . 
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Figure 26.6: Lyapunov exponents A- { k versus k for the 
least unstable spatio-temporally periodic orbit 1 of the 
Kuramoto-Sivashinsky system, compared with the Flo- 
quet exponents of the u(x, 0 = 0 stationary solution, 
A k = k 2 - vk 4 . The eigenvalue A- k for k > 8 falls be- 
low the numerical accuracy of integration and are not 
meaningful. The cycle 1 was computed using meth- 
ods of chapter 13. System size L = 2.89109, N = 16 
Fourier modes truncation. (From ref. [12]) 



Example 26.5 Stability matrix, antisymmetric subspace: The Kuramoto-Siva- 
shinsky flat flame front u(x, t) = 0 is an equilibrium point of (26.2). The stability matrix 
(4.3) follows from (26.8) 

A k j{a) = = (qj ~ qttSkj - qkak-j . (26. 14) 

For the u(x, t) = 0 equilibrium solution the stability matrix is diagonal, and - as in (4.31) 
- so is the Jacobian matrix J' kj (0) = d^e^ - ^' . 

For L < 1 , u(x, t) = 0 is the globally attractive stable equilibrium. As the 
system size L is increased, the "flame front" becomes increasingly unstable and 
turbulent, the dynamics goes through a rich sequence of bifurcations on which we 
shall not dwell here. 

The long wavelength perturbations of the flat-front equilibrium are linearly 
unstable, while all short wavelength perturbations are strongly contractive. The 
high k eigenvalues, corresponding to rapid variations of the flame front, decay so 
fast that the corresponding eigen-directions are physically irrelevant. To illustrate 
the rapid contraction in the non-leading eigen-directions we plot in figure26.6 the 
eigenvalues of the equilibrium in the unstable regime, for relatively small system 
size, and compare them with the Floquet multipliers of the least unstable cycle 
for the same system size. The equilibrium solution is very unstable, in 5 eigen- 
directions, the least unstable cycle only in one. Note that for k > 7 the rate of 
contraction is so strong that higher eigen-directions are numerically meaningless 
for either solution; even though the flow is infinite-dimensional, the attracting set 
must be rather thin. 

While in general for L sufficiently large one expects many coexisting attrac- 
tors in the state space, in numerical studies most random initial conditions settle 
converge to the same chaotic attractor. 

From (26.8) we see that the equilibrium u(x, t) = 0 has Fourier modes as 
the linear stability eigenvectors. For \k\ < L, the corresponding Fourier modes 
are unstable. The most unstable mode has k = L/ V2 and defines the scale of 
basic building blocks of the spatiotemporal dynamics of the Kuramoto-Sivashin- 
sky equation in large system size limit. 
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Consider now the case of initial sufficiently small that the bilinear G^a&_ m 
terms in (26.8) can be neglected. Then we have a set of decoupled linear equa- 
tions for ak whose solutions are exponentials, at most a finite number for which 
k 2 > vk 4 is growing with time, and infinitely many with vk 4 > k 2 decaying in 
time. The growth of the unstable long wavelengths (low \k\) excites the short 
wavelengths through the a m ak- m nonlinear term. The excitations thus transferred 
are dissipated by the strongly damped short wavelengths, and a "chaotic equilib- 
rium" can emerge. The very short wavelengths \k\ » 1/V^ remain small for all 
times, but the intermediate wavelengths of order \k\ ~ 1 / ^fv play an important role 
in maintaining the dynamical equilibrium. As the damping parameter decreases, 
the solutions increasingly take on shock front character poorly represented by the 
Fourier basis, and many higher harmonics may need to be kept in truncations of 
(26.8). 

Hence, while one may truncate the high modes in the expansion (26.8), care 
has to be exercised to ensure that no modes essential to the dynamics are chopped 
away. 

In other words, even though our starting point (26.2) is an infinite-dimensional 
dynamical system, the asymptotic dynamics unfolds on a finite-dimensional at- 
tracting manifold, and so we are back on the familiar territory of sect. 2.2: the 
theory of a finite number of ODEs applies to this infinite-dimensional PDE as 
well. 

We can now start to understand the remark on page 43 that for infinite di- 
mensional systems time reversibility is not an option: evolution forward in time 
strongly damps the higher Fourier modes. There is no turning back: if we re- 
verse the time, the infinity of high modes that contract strongly forward in time 
now explodes, instantly rendering evolution backward in time meaningless. As so 
much you are told about dynamics, this claim is also wrong, in a subtle way: if 
the initial u(x, 0) is in the non-wandering set (2.3), the trajectory is well defined 
both forward and backward in time. For practical puiposes, this subtlety is not of 
much use, as any time-reversed numerical trajectory in a finite-mode truncation 
will explode very quickly, unless special precautions are taken. 

When is an equilibrium important? There are two kinds of roles equilibria 
play: 

"Hole " in the natural measure. The more unstable eigen-directions it has (for 
example, the u = 0 solution), the more unlikely it is that an orbit will recur in its 
neighborhood. 

Unstable manifold of a "least unstable" equilibrium. Asymptotic dynamics 
spends a large fraction of time in neighborhoods of a few equilibria with only a 
few unstable eigen-directions. 



PDEs- 4apr2013 



ChaosBook.org versionI4.4.1, Apr 21 2013 



CHAPTER 26. TURBULENCE? 



540 



0.8r 



Figure 26.7: The Poincare return map of the 
Kuramoto-Sivashinsky system (26.8) figure 26.4, from 
the unstable manifold of the 1 fixed point to the (neigh- 
borhood of) the unstable manifold. Also indicated are 
the periodic points 0 and 01. 
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26.6 Intrinsic parametrization 

Both in the Rossler flow of example 3.3, and in the Kuramoto-Sivashinsky system 
of example 26.3 we have learned that the attractor is very thin, but otherwise the 
return maps that we found were disquieting - neither figure 3.3 nor figure 26.4 
appeared to be one-to-one maps. This apparent loss of invertibility is an artifact of 
projection of higher-dimensional return maps onto lower-dimensional subspaces. 
As the choice of lower-dimensional subspace is arbitrary, the resulting snapshots 
of return maps look rather arbitrary, too. Other projections might look even less 
suggestive. 

Such observations beg a question: Does there exist a 'natural', intrinsically 
optimal coordinate system in which we should plot of a return map? 

As we shall now argue (see also sect. 13.1), the answer is yes: The intrinsic 
coordinates are given by the stable/unstable manifolds, and a return map should 
be plotted as a map from the unstable manifold back onto the immediate neigh- 
borhood of the unstable manifold. 

Examination of numerical plots such as figure26.3 suggests that a more thought- 
ful approach would be to find a coordinate transformation y = h(x) to a 'center 
manifold', such that in the new, curvilinear coordinates large-scale dynamics takes 
place in (yi,y2) coordinates, with exponentially small dynamics in y$, y<\ ■ ■ ■. But 
- thinking is extra price - we do not know how to actually accomplish this, and we 
do not believe it can be accomplished globally. 

Both in the example of the Rossler flow and of the Kuramoto-Sivashinsky 
system we sketched the attractors by running a long chaotic trajectory, and noted 
that the attractors are very thin, but otherwise the return maps that we plotted were 
disquieting - neither figure 3.3 nor figure 26.4 appeared to be 1-to-l maps. In this 
section we show how to use such information to approximately locate cycles. 
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26.7 Energy budget 



Mathematical physics is three things: Gaussian integrals, 
integration by parts and ... (nobody remembers exactly 
what the third thing was, including Joel). 
— Joel Lebowitz, in a seminar 

In physical settings where the observation times are much longer than the dy- 
namical 'turnover' and Lyapunov times (statistical mechanics, quantum physics, 
turbulence) periodic orbit theory provides highly accurate predictions of measur- 
able long-time averages such as the dissipation and the turbulent drag. Physical 
predictions have to be independent of a particular choice of ODE representation 
of the PDE under consideration and invariant under all symmetries of the dynam- 
ics. In this section we discuss a set of such physical observables for the l-d KS 
invariant under reflections and translations. Here we shall show that they offer 
a visualization of solutions of dynamics in which the symmetries are explicitly 
quotiented out. 

The space average of a function a = a(x, t) = a(u(x, t)) periodic on the interval 
L is given by 



(a) - - CD dx a(x,t) , (26.15) 



We note that total derivatives vanish by the spatial periodicity on the L domain, 
and that by integration by parts 

(A) = 0, (f x g) = -(fg x ) (26.16) 

for any L-periodic functions /, g. In general (a) is time dependent. Its mean value 
is given by the time average 

ir* i r' i r 

a = lim - dr (a) = lim - — (£) dT dx a(x, t) . (26.17) 
t->°° t Jo t Jo LJ 

The mean value of a q = a{u q ) evaluated on equilibrium or relative equilibrium 

u(x, t) = u q (x - ct), is 

a q = (a) q =a q . (26.18) 

Evaluation of the infinite time average (26.17) on a function of a periodic orbit 
or relative periodic orbit u p (x, t) - u p (x + d p , t + T p ) requires only a single T p 
traversal, 



ap = T L 

1 p Jo 



dr(a). (26.19) 
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Equation (26.2) can be written as 

u t --V x , V(x,t) = ju 2 + u x + u xxx , (26.20) 

and E can be interpreted as the mean energy density (26.21). So, even though KS 
is a phenomenological small-amplitude equation, the time-dependent Z? norm of 
u (simplified using integration by parts, as in (26.16)), 



E= ^£dxV(x,t) = jSdx 1 ^- , (26.21) 



has a physical interpretation as the average 'energy' density of the flame front. 
This analogy to the mean kinetic energy density for the Navier-Stokes motivates 
what follows. 

The energy (26.21) is intrinsic to the flow, independent of the particular ODE 
basis set chosen to represent the PDE. As the Fourier amplitudes are eigenvec- 
tors of the translation operator, in the Fourier space the energy is a diagonalized 
quadratic norm, 



E = Y,E k , E k = \\a k \ 2 , (26.22) 
k=i 

and explicitly invariant term by term under translations and reflections. 

Take time derivative of the energy density (26.21), substitute (26.2) and inte- 
grate by parts, as in (26.16): 

E = (u t u) - - ((u 2 /2 + u x + u xxx } 

= (u x u 2 /2 + u 2 + u x Uxxxj ■ (26.23) 

The first term in (26.23) vanishes by integration by parts, 3(u x u 2 ^ = ({u 3 ) x ^ - 0 , 
and integrating the third term by parts yet again one gets that the energy variation 
in the Kuramoto-Sivashinsky equation (26.2) 

E = P-D, P = {u 2 ), D = {u xx 2 ) (26.24) 

balances the power P pumped in by anti-diffusion u xx against the energy dissipa- 
tion rate D by hyper- viscosity u xxxx . 

In figure 26.8 we plot the power input (u 2 ^ vs. dissipation (u xx 2 ^ for all 
L -22 equilibria and relative equilibria determined so far, several periodic orbits 
and relative periodic orbits, and for a typical "turbulent" evolution. The time 
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Figure 26.8: Power input (u x 2 \ vs. dissipation (u xx 2 \ 
for L = 22 equilibria and relative equilibria, for sev- 
eral periodic orbits and relative periodic orbits, and 
for a typical 'turbulent' state. Note that (u p , x ) 2 of the 
{T p ,d p ) = (32.8,10.96) relative periodic orbitwhich 
appears well embedded within the turbulent state, is 
close to the turbulent expectation (u x ) 2 . 

Figure 26.9: EQ { (red), EQ 2 (green), EQ 3 (blue), 
connections from EQ\ to A(L/A)EQi (green), from 
A(E/4)EQ[ to EQ t (yellow-green) and from EQ 3 to 
A(E/4)EQi (blue), along with a generic long-time 
"turbulent" evolution (grey) for L = 22. Three differ- * 
ent projections of the (E, (u x 2 \ , («h 2 )) ~ ( m a- 2 )) rep- 
resentation are shown. 



averaged energy density E computed on a typical orbit goes to a constant, so the 
mean values (26.17) of drive and dissipation exactly balance each other: 

E = lim- f dTE = P-D = Q. (26.25) 

In particular, the equilibria and relative equilibria fall onto the diagonal in fig- 
ure ?? (a), and so do time averages computed on periodic orbits and relative peri- 
odic orbits: 

- 1 C T " - 1 C T " - 

E p = — dxE{f) , P P = — drP(T) = D p . (26.26) 
1 p Jo 1 p Jo 

In the Fourier basis (26.22) the conservation of energy on average takes form 



0 = Z («* ~ $ Ek ' Ek{t) = 5^wi 2 ■ < 26 - 27 ) 

k=-oo 

The large k convergence of this series is insensitive to the system size L; i& have 
to decrease much faster than q^ 4 . Deviation of Ek from this bound for small k 
determines the active modes. This may be useful to bound the number of equilib- 
ria, with the upper bound given by zeros of a small number of long wavelength 
modes. 
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Resume 



Turbulence is the graveyard of theories 
— Hans W. Liepmann 



We have learned that an instanton is an analytic solution of Yang-Mills equa- 
tions of motion, but shouldn't a strongly nonlinear field theory dynamics be dom- 
inated by turbulent solutions? How are we to think about systems where every 
spatiotemporal solution is unstable? 

Here we think of turbulence in terms of recurrent spatiotemporal patterns. 
Pictorially, dynamics drives a given spatially extended system through a repertoire 
of unstable patterns; as we watch a turbulent system evolve, every so often we 
catch a glimpse of a familiar pattern: 



For a finite spatial resolution and a finite time, a pattern belonging to a finite al- 
phabet of admissible patterns is observed; the long term dynamics can be thought 
of as a walk through the space of such patterns. Recasting this image into mathe- 
matics is the subject of this book. 

The problem one faces with high-dimensional flows is that their topology is 
hard to visualize, and that even with a decent starting guess for a point on a peri- 
odic orbit, methods like the Newton-Raphson method are likely to fail. Methods 
that start with initial guesses for a number of points along the cycle, such as the 
multipoint shooting method of sect. 13.3, are more robust. The relaxation (or 
variational) methods take this strategy to its logical extreme, and start by a guess 
of not a few points along a periodic orbit, but a guess of the entire orbit. As 
these methods are intimately related to variational principles and path integrals, 
we postpone their introduction to chapter 29. 

At present the theory is in practice applicable only to systems with a low 
intrinsic dimension - the minimum number of coordinates necessary to capture its 
essential dynamics. If the system is very turbulent (a description of its long time 
dynamics requires a space of very high intrinsic dimension) we are out of luck. 



Remark 26.1 Model PDE systems. The theorem on finite dimensionality of inertial 
manifolds of state space contracting PDE flows is proven in ref. [11]. The Kuramoto- 
Sivashinsky equation was introduced in refs. [?, 3], Holmes, Lumley and Berkooz [13] 




other swirls 
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offer a delightful discussion of why this system deserves study as a staging ground for 
studying turbulence in full-fledged Navier-Stokes equation. How good a description of a 
flame front this equation is not a concern here; suffice it to say that such model amplitude 
equations for interfacial instabilities arise in a variety of contexts - see e.g. ref. [ 14] - 
and this one is perhaps the simplest physically interesting spatially extended nonlinear 
system. 

Physical interpretation of the average "energy" density of the flame front of sect. 26.7, 
in analogy with the mean kinetic energy density for the Navier-Stokes, comes from Greene 
and Kim paper [7], a recommended reading. For equilibria the L-independent bound on 
E is given by Michelson [ ]. The best current bound [16, 17] on the long-time limit of 
£ as a function of the system size L scales as E oc L 3/2 . 

The work described in this chapter was initiated by Putkaradze's 1996 term project 
(see ChaosBook . org/extras), and continued by Christiansen, Cvitanovic, Davidchack, 
Gibson, Halcrow, Lan, and Siminos [ 12, 18, 19, ?, ?, 1, 5, 20]. 



Exercises 



26. 1 . Galilean invariance of the Kuramoto-Sivashinsky equation. 



(a) Verify that the Kuramoto-Sivashinsky equation is 
Galilean invariant: if u(x, t) is a solution, then 
v + u(x + 2vt, t), with v an arbitrary constant ve- 
locity, i s also a solution. 

(b) Verify that mean 



i r 

(u) = — I dxu 
L Jl 



is conserved by the flow. 

(c) Argue that the choice ( 26.3) of the vanishing mean 
velocity, (u) = 0 leads to no loss of generality in 
calculations that follow. 



(d) 



[thinking is extra cost] Inspection of 
various "turbulent" solutions of Kuramoto-Siva- 
shinsky equation reveals subregions of "traveling 
waves" with locally nonzero (u). Is there a way 
to use Galilean invariance locally, even though we 
eliminated it by the (u) = 0 condition? 



as an example, a concentration 0 diffusing on R accord- 
ing to the diffusion equation 

d t </> = lv 2 0. 

(a) Interpret the partial differential equation as an infi- 
nite dimensional dynamical system. That is, write 
it as x = F{x) and find the velocity field. 

(b) Show by examining the norm 

||0|| 2 = f dxcp\x) 

that the vector field F is not continuous. 

(c) Try the norm 

||0| = sup |#x)|. 

xeR 

Is F continuous? 

(d) Argue that the semi-flow nature of the problem is 
not the cause of our difficulties. 

(e) Do you see a way of generalizing these results? 



26.2. Infinite dimensional dynamical systems are not 2 6.3. Kuramoto-Sivashinsky energy transfer rates, 
smooth. Many of the operations we consider natural 

for finite dimensional systems do not have smooth be- (a) Derive (26.21) from (26.20). Now that you have 

havior in infinite dimensional vector spaces. Consider, your integration by parts skills hone, also show 
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that 

(u.xxxU 2 ) = (u x 3 ) 
(uxxxxxu 2 ) = -5 (u x u xx 2 ) . (26.28) 

(b) Derive the power - dissipation rate relation 
(26.24). 

(c) Prove that for an equilibrium E is constant. 

(d) Derive formulas for P, D, E and j t (u x / in terms 
of space averages (■ ■ ■). You will note that higher 
derivatives of u appear. The guiding principle is to 
use integration by parts until the number of such 
derivatives is minimized. 

(e) Invent another such formula. 

26.4. Navier-Stokes energy transfer rates. 

The Millenium Prize tempts you to ponder the Navier- 
Stokes equations 

d,Vj + vjdjVj = -dtp + vdjjVj (26.29) 



in the utterly unphysical setting, a periodic 3D box of 
size [L x L x L]. The space average of a function 
a = a(x, t) = a{v(x, ?)) on the interval L is given by 

(a) = —^>dx 3 a(x, t) . (26.30) 

(a) Prove conservation of momentum 

^(Vi> = 0 (26.31) 
at 

(b) Prove power-dissipation rate relation 

\jM)-^ 2 \) ^ 

(c) Prove conservation of helicity. 

1 d 

<v ■ oj) = -v (cj ■ V x cd) (26.33) 

2 dt 

(d) While you are on the roll: derive another such 
formula. Pipe or plane Couette flow power- 
dissipation relation E — P — D would be partic- 
ularly useful. 
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Irrationally winding 



I don't care for islands, especially very small ones. 
— D.H. Lawrence 

(R. Artuso and P. Cvitanovic) 

This chapter is concerned with the mode locking problems for circle maps: 
besides its physical relevance it nicely illustrates the use of cycle expan- 
sions away from the dynamical setting, in the realm of renormalization 
theory at the transition to chaos. 

The physical significance of circle maps is connected with their ability to 
model the two-frequencies mode-locking route to chaos for dissipative systems. 
In the context of dissipative dynamical systems one of the most common and 
experimentally well explored routes to chaos is the two-frequency mode-locking 
route. Interaction of pairs of frequencies is of deep theoretical interest due to the 
generality of this phenomenon; as the energy input into a dissipative dynamical 
system (for example, a Couette flow) is increased, typically first one and then two 
of intrinsic modes of the system are excited. After two Hopf bifurcations (a fixed 
point with inward spiralling stability has become unstable and outward spirals to 
a limit cycle) a system lives on a two-torus. Such systems tend to mode-lock: 
the system adjusts its internal frequencies slightly so that they fall in step and 
minimize the internal dissipation. In such case the ratio of the two frequencies 
is a rational number. An irrational frequency ratio corresponds to a quasiperiodic 
motion - a curve that never quite repeats itself. If the mode-locked states overlap, 
chaos sets in. The likelihood that a mode-locking occurs depends on the strength 
of the coupling of the two frequencies. 

Our main concern in this chapter is to illustrate the "global" theory of circle 
maps, connected with universality properties of the whole irrational winding set. 
We shall see that critical global properties may be expressed via cycle expansions 
involving "local" renormalization critical exponents. The renormalization theory 
of critical circle maps demands rather tedious numerical computations, and our 
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Figure 27.1: Unperturbed circle map (k = 0 in (27.1)) 
with golden mean rotation number. 



f(x) 




X 



intuition is much facilitated by approximating circle maps by number-theoretic 
models. The models that arise in this way are by no means mathematically triv- 
ial, they turn out to be related to number-theoretic abysses such as the Riemann 
conjecture, already in the context of the "trivial" models. 



27.1 Mode locking 

The simplest way of modeling a nonlinearly perturbed rotation on a circle is by 
1 -dimensional circle maps x —> x 1 = f(x), restricted to the one dimensional torus, 
such as the sine map 

k 

x n +\ = f(xn) - x n + Q - — sin(27rx„) mod 1 . (27.1) 

2n 

f(x) is assumed to be continuous, have a continuous first derivative, and a con- 
tinuous second derivative at the inflection point (where the second derivative van- 
ishes). For the generic, physically relevant case (the only one considered here) the 
inflection is cubic. Here k parametrizes the strength of the nonlinear interaction, 
and Q. is the bare frequency. 

The state space of this map, the unit interval, can be thought of as the elemen- 
tary cell of the map 

x n +i = f(x„) = x n + O - ^- sin(27rx„) . (27.2) 

2tt 

where "is used in the same sense as in chapter 25. 
The winding number is defined as 



W(k,Q.) = lim(x n - xo)/n. 



(27.3) 



n — >oo 



and can be shown to be independent of the initial value %. 
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For k = 0, the map is a simple rotation (the shift map) see figure 27.1 

x n +\ - x n + Q mod 1 , (27.4) 
and the rotation number is given by the parameter Q. 
W(k = 0, Q) = Q . 

For given values of Q. and k the winding number can be either rational or irra- 
tional. For invertible maps and rational winding numbers W = P/Q the asymptotic 
iterates of the map converge to a unique attractor, a stable periodic orbit of period 

Q 

f Q {xi) = x i + p, ; = o,i,2,---,<2-i . 

This is a consequence of the independence of y$ previously mentioned. There is 
also an unstable cycle, repelling the trajectory. For any rational winding number, 
there is a finite interval of values of Q. values for which the iterates of the circle 
map are attracted to the P/Q cycle. This interval is called the P/Q mode-locked 
(or stability) interval, and its width is given by 

Ap /G = Q- 2 ^ = - d e p f Q . (27.5) 

where Qp^Q i^pfg) denote the biggest (smallest) value of Q. for which W(k, D.) = 
P/Q. Parametrizing mode lockings by the exponent p rather than the width A 
will be convenient for description of the distribution of the mode-locking widths, 
as the exponents p turn out to be of bounded variation. The stability of the P/ Q 
cycle is 

Ap/e = § = /'(*o)/'(*i) ■ • • f{xQ-x) 

For a stable cycle |Ap/g| lies between 0 (the superstable value, the "center" of the 
stability interval) and 1 (the ^pfg, ^pfn endpoints of (27.5)). For the shift map 
(27.4), the stability intervals are shrunk to points. As Q. is varied from 0 to 1, 
the iterates of a circle map either mode-lock, with the winding number given by 
a rational number P/Q e (0, 1), or do not mode-lock, in which case the winding 
number is irrational. A plot of the winding number W as a function of the shift 
parameter Q is a convenient visualization of the mode-locking structure of circle 
maps. It yields a monotonic "devil's staircase" of figure 27.2 whose self-similar 
structure we are to unravel. Circle maps with zero slope at the inflection point ^ 
(see figure 27.3) 

f'(x c ) = 0, f"(x c ) = 0 



exercise 27.1 
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Figure 27.2: The critical circle map (k = 1 in (27.1)) 
devil's staircase [3]; the winding number W as function 
of the parameter CI. 



1.0 



Figure 27.3: Critical circle map (k = 1 in (27.1)) with 
golden mean bare rotation number. 



0.8 - 




X 



(k - 1, x c - 0 in (27.1)) are called critical: they delineate the borderline of chaos 
in this scenario. As the nonlinearity parameter k increases, the mode-locked 
intervals become wider, and for the critical circle maps (k = 1) they fill out the 
whole interval. A critical map has a superstable P/Q cycle for any rational P/Q, 
as the stability of any cycle that includes the inflection point equals zero. If the 
map is non-invertible (k > 1), it is called supercritical; the bifurcation structure of 
this regime is extremely rich and beyond the scope of this exposition. 

The physically relevant transition to chaos is connected with the critical case, 
however the apparently simple "free" shift map limit is quite instructive: in essence 
it involves the problem of ordering rationals embedded in the unit interval on a hi- 
erarchical structure. From a physical point of view, the main problem is to identify 
a (number-theoretically) consistent hierarchy susceptible of experimental verifi- 
cation. We will now describe a few ways of organizing rationals along the unit 
interval: each has its own advantages as well as its drawbacks, when analyzed 
from both mathematical and physical perspective. 

27.1.1 Hierarchical partitions of the rationals 

Intuitively, the longer the cycle, the finer the tuning of the parameter Q. required to 
attain it; given finite time and resolution, we expect to be able to resolve cycles up 
to some maximal length Q. This is the physical motivation for partitioning mode 
lockings into sets of cycle length up to Q. In number theory such sets of rationals 
are called Farey series. They are denoted by Tq and defined as follows. The 
Farey series of order Q is the monotonically increasing sequence of all irreducible 
rationals between 0 and 1 whose denominators do not exceed Q. Thus Pi/Qj 
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belongs to Tq if 0 < P, < Qi < Q and (P/IGD = 1. For example 
1 1 1 2 1 3 2 3 4 1 



(1112132341 
5 " 15' 4' 3' 5' 2' 5' 3' 4' 5' T 



A Farey series is characterized by the property that if Pi-i/Qi-i and PilQi are 
consecutive terms of Tq, then 

PiQi-i ~ Pi-iQi = 1. 
The number of terms in the Farey series Fq is given by 



3>(0 - X 0(G) = -4" + O(GlnG). (27.6) 

n=l 



Here the Euler function 0(G) is the number of integers not exceeding and rel- 
atively prime to Q. For example, 0(1) = 1, 0(2) = 1, 0(3) = 2, ...,0(12) = 
4,0(13) - 12,... 

From a number-theorist's point of view, the continued fraction partitioning of 
the unit interval is the most venerable organization of rationals, preferred already 
by Gauss. The continued fraction partitioning is obtained by ordering rationals 
corresponding to continued fractions of increasing length. If we turn this ordering 
into a way of covering the complementary set to mode-lockings in a circle map, 
then the first level is obtained by deleting A[i], Apj, • • • , A[ fll ], • • • mode-lockings; 
their complement are the covering intervals . . . ,l ax , . . . which contain all 
windings, rational and irrational, whose continued fraction expansion starts with 
[ai,...] and is of length at least 2. The second level is obtained by deleting 
A[i,2], A [U] , •••.Ap.2], A [2 ,3], •••,A [ , vn] ,--- and so on. 

The nth level continued fraction partition «5>, = \a\ai • • - a n \ is defined as the 
monotonically increasing sequence of all rationals P/Gi between 0 and 1 whose 
continued fraction expansion is of length n: 



Pi 1 
— - [a\,a%, • • • , a n ] 



Qi " J 1 
a\ h 



1 

a 2 + ... — 



The object of interest, the set of the irrational winding numbers, is in this partition- 
ing labeled by <Sco = {ai£?2fl , 3 ■ ■ •}> Ok e Z + , i.e., the set of winding numbers with 
infinite continued fraction expansions. The continued fraction labeling is particu- 
larly appealing in the present context because of the close connection of the Gauss 
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shift to the renormalization transformation R, discussed below. The Gauss map 



1 

T(x) = - 
x 



x±0 



1 

X 

0 , x = 0 (27.7) 



([• • •] denotes the integer part) acts as a shift on the continued fraction representa- 
tion of numbers on the unit interval 



[ai,a 2 ,a 3 ,...] -> T(x) - [a 2 , a 3 , . . .] . (27.8) 



into the "mother" interval 4 2 a 3 ...- 

However natural the continued fractions partitioning might seem to a number 
theorist, it is problematic in practice, as it requires measuring infinity of mode- 
lockings even at the first step of the partitioning. Thus numerical and experimental 
use of continued fraction partitioning requires at least some understanding of the 
asymptotics of mode-lockings with large continued fraction entries. 

The Farey tree partitioning is a systematic bisection of rationals: it is based 
on the observation that roughly halfways between any two large stability intervals 
(such as 1/2 and 1/3) in the devil's staircase of figure 27. 2 there is the next largest 
stability interval (such as 2/5). The winding number of this interval is given by the 
Farey mediant (P+P')/(Q+Q') of the parent mode-lockings P/Q and P/Q'. This 
kind of cycle "gluing" is rather general and by no means restricted to circle maps; 
it can be attained whenever it is possible to arrange that the Qth iterate deviation 
caused by shifting a parameter from the correct value for the Q-cycle is exactly 
compensated by the <2'th iterate deviation from closing the £7-cycle; in this way 
the two near cycles can be glued together into an exact cycle of length Q+Q. The 
Farey tree is obtained by starting with the ends of the unit interval written as 0/1 
and 1/1, and then recursively bisecting intervals by means of Farey mediants. 

We define the «th Farey tree level T n as the monotonically increasing sequence 
of those continued fractions [a\ ,a 2 ,..., at] whose entries a\ > 1, i — 1, 2, . . . ,k — 
1, a^ > 2, add up to J^ =1 c?,- = n + 2. For example 

r 2 = {[4], [2,2], [1,1, 2], [1,3]} = (I'l'f'l)- (27 " 9) 

The number of terms in T n is 2". Each rational in T n -\ has two "daughters" in T n , 
given by 



[•••,«] 

[•••,£?- 1,2] [•••,« + !] 
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Figure 27.4: Farey tree: alternating binary or- 
dered labeling of all Farey denominators on the nth 
Farey tree level. 




Iteration of this rule places all rationals on a binary tree, labeling each by a unique 
binary label, figure 27.4. 

The smallest and the largest denominator in T n are respectively given by 

[n-2] = — !— , [1,1 1,2] = ^cxp" , (27.10) 

n-2 F n+2 

where the Fibonacci numbers F n are defined by F n+ \ - F n +F„_i; F$ = 0, F\ = 
1 , and p is the golden mean ratio 



1 + V5 

p - — - — - 1.61803... (27.11) 



Note the enormous spread in the cycle lengths on the same level of the Farey tree: 
n < Q < p n . The cycles whose length grows only as a power of the Farey tree level 
will cause strong non-hyperbolic effects in the evaluation of various averages. 

Having defined the partitioning schemes of interest here, we now briefly sum- 
marize the results of the circle-map renormalization theory. 



27.2 Local theory: "Golden mean" renormalization 

• 

^Br The way to pinpoint a point on the border of order is to recursively ad- 
just the parameters so that at the recurrence times t = n\,n%, n-$, • • • the trajectory 
passes through a region of contraction sufficiently strong to compensate for the 
accumulated expansion of the preceding }% steps, but not so strong as to force the 
trajectory into a stable attracting orbit. The renormalization operation R imple- 
ments this procedure by recursively magnifying the neighborhood of a point on 
the border in the dynamical space (by rescaling by a factor a), in the parameter 
space (by shifting the parameter origin onto the border and rescaling by a factor <5), 



irrational - 22sep2000 



ChaosBook.org version 14.4. 1, Apr 21 2013 



CHAPTER 27. IRRATIONALLY WINDING 



556 



and by replacing the initial map / by the nth iterate f restricted to the magnified 
neighborhood 



There are by now many examples of such renormalizations in which the new func- 
tion, framed in a smaller box, is a rescaling of the original function, i.e., the fix- 
point function of the renormalization operator R. The best known is the period 
doubling renormalization, with the recurrence times n- = 2'. The simplest circle 
map example is the golden mean renormalization, with recurrence times i\ = F, 
given by the Fibonacci numbers (27.10). Intuitively, in this context a metric self- 
similarity arises because iterates of critical maps are themselves critical, i.e., they 
also have cubic inflection points with vanishing derivatives. 

The renormalization operator appropriate to circle maps acts as a generaliza- 
tion of the Gauss shift (27.38); it maps a circle map (represented as a pair of 
functions (g,f), of winding number [a,b,c, . . .] into a rescaled map of winding 
number [b,c, ...]: 



Acting on a map with winding number [a, a, a,. . .],Ra returns a map with the same 
winding number [a, a, . . .], so the fixed point of Ra has a quadratic irrational wind- 
ing number W = [a,a,a, . . .]. This fixed point has a single expanding eigenvalue 
5 a . Similarly, the renormalization transformation Ra ...R ai R ai = R aia2 ...a p has 
a fixed point of winding number W p = [a\,a2, . ■ . ,a n ,a\,U2, . ■ .], with a single 
expanding eigenvalue 8 p . 

For short repeating blocks, 8 can be estimated numerically by comparing suc- 
cessive continued fraction approximants to W. Consider the P r /Q r rational ap- 
proximation to a quadratic irrational winding number W p whose continued frac- 
tion expansion consists of r repeats of a block p. Let Q be the parameter for which 
the map (27.1) has a superstable cycle of rotation number P r IQ r = [p,p,. . . , p]. 
The 8 P can then be estimated by extrapolating from 



What this means is that the "devil's staircase" of figure 27.2 is self-similar under 
magnification by factor 5 p around any quadratic irrational W p . 

The fundamental result of the renormalization theory (and the reason why all 
this is so interesting) is that the ratios of successive P r / Q r mode-locked intervals 
converge to universal limits. The simplest example of (27.13) is the sequence of 
Fibonacci number continued fraction approximants to the golden mean winding 
number W = [1,1,1,...] =(V5- l)/2. 



f p {x) Rf p {x) = af"(x/a) 




(27.12) 



n,. - O r+1 oc 5 p r . 



(27.13) 
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When global problems are considered, it is useful to have at least and idea on 
external scaling laws for mode-lockings. This is achieved, in a first analysis, by 
fixing the cycle length Q and describing the range of possible asymptotics. 

For a given cycle length Q, it is found that the narrowest interval shrinks with 
a power law 



Ai/ G <x Q- 3 (27.14) 



For fixed Q the widest interval is bounded by P/Q = F n -i/F n , the nth con- 
tinued fraction approximant to the golden mean. The intuitive reason is that the 
golden mean winding sits as far as possible from any short cycle mode-locking. 

The golden mean interval shrinks with a universal exponent 

Ap /e oc Q- 2 ^ (27.15) 

where P = F n -\, Q - F n and fi\ is related to the universal Shenker number 5\ 
(27. 13) and the golden mean (27. 1 1) by 



yui = = 1.08218... (27.16) 

21np 



The closeness of /u\ to 1 indicates that the golden mean approximant mode-lockings 
barely feel the fact that the map is critical (in the k=0 limit this exponent is /j = 1). 

To summarize: for critical maps the spectrum of exponents arising from the 
circle maps renormalization theory is bounded from above by the harmonic scal- 
ing, and from below by the geometric golden-mean scaling: 

3/2 >Hmin> 1.08218- ••. (27.17) 



27.3 Global theory: Thermodynamic averaging 

Consider the following average over mode-locking intervals (27.5): 



«(T) = Yj Z A P/Q- < 27 - 18) 
Q=l (P\Q)=l 

The sum is over all irreducible rationals P/Q, P < Q, and Ap/g is the width of the 
parameter interval for which the iterates of a critical circle map lock onto a cycle 
of length Q, with winding number P/Q. 
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The qualitative behavior of (27.18) is easy to pin down. For sufficiently neg- 
ative r, the sum is convergent; in particular, for r = -1, O(-l) = 1, as for the 
critical circle maps the mode-lockings fill the entire £1 range [15]. However, as t 
increases, the contributions of the narrow (large Q) mode-locked intervals zV/q 
get blown up to and at some critical value of r the sum diverges. This oc- 

curs for t < 0, as 0(0) equals the number of all rationals and is clearly divergent. 

The sum (27.18) is infinite, but in practice the experimental or numerical 
mode-locked intervals are available only for small finite Q. Hence it is necessary 
to split up the sum into subsets S n = {i} of rational winding numbers Pi/Qi on 
the "level" n, and present the set of mode-lockings hierarchically, with resolution 
increasing with the level: 

Z„(t) = A r T - (27.19) 

ieS„ 

The original sum (27.18) can now be recovered as the z = 1 value of a "gener- 
ating" function Q.(z, r) = 2j„z"Z n (T). As z is anyway a formal parameter, and 
n is a rather arbitrary "level" in some ad hoc partitioning of rational numbers, 
we bravely introduce a still more general, P/ Q weighted generating function for 
(27.18): 

CO 

Q(q,T) = 2 2 e~ 9Vp/e <3 2TWe - (27.20) 
2=1 (P|0=l 

The sum (27. 18) corresponds to q - 0. Exponents vp/q will reflect the importance 
we assign to the P/Q mode-locking, i.e., the measure used in the averaging over 
all mode-lockings. Three choices of of the vp/q hierarchy that we consider here 
correspond respectively to the Farey series partitioning 



n(?,T) = 2<D(0H ^ Q 2T ^ , (27.21) 

Q=l (P|fi)=l 

the continued fraction partitioning 

oo 

Q( 9 , T ) = V e -<? " ^ Q 2 ™ 1 " 1 an] ' (27.22) 

n=\ [ai,..„a„] 

and the Farey tree partitioning 

oo 2" 

Q(g,T) = ^ 2-«" £ ef r/i ' , Qj/Pj e T n . (27.23) 



k=n i= 1 



We remark that we are investigating a set arising in the analysis of the parameter 
space of a dynamical system: there is no "natural measure" dictated by dynamics, 
and the choice of weights reflects only the choice of hierarchical presentation. 
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27.4 Hausdorff dimension of irrational windings 

A finite cover of the set irrational windings at the "nth level of resolution" is 
obtained by deleting the parameter values corresponding to the mode-lockings in 
the subset S n ; left behind is the set of complement covering intervals of widths 

t t = n™ Qr - ttffa . (27.24) 

Here ^p'"q ^ respectively the lower (upper) edges of the mode-locking 

intervals Ap r /Q r (Ap,/g ; ) bounding <?,- and i is a symbolic dynamics label, for ex- 
ample the entries of the continued fraction representation P/Q = \(\,ai, —,a n ] of 
one of the boundary mode-lockings, £i provide a finite cover for 

the irrational winding set, so one may consider the sum 

Z n (r) = £ (27.25) 

ieS„ 

The value of -r for which the n — > oo limit of the sum (27.25) is finite is the 
Hausdorff dimension Dh of the irrational winding set. Strictly speaking, this is 
the Hausdorff dimension only if the choice of covering intervals if is optimal; 
otherwise it provides an upper bound to Djj. As by construction the ^ intervals 
cover the set of irrational winding with no slack, we expect that this limit yields 
the Hausdorff dimension. This is supported by all numerical evidence, but a proof 
that would satisfy mathematicians is lacking. 

The physically relevant statement is that for critical circle maps Dh = 0.870 . . . 
is a (global) universal number. 



exercise 27.2 



27.4.1 The Hausdorff dimension in terms of cycles 

Estimating the n —> oo limit of (27.25) from finite numbers of covering intervals 
i\ is a rather unilluminating chore. Fortunately, there exist considerably more 
elegant ways of extracting Dh- We have noted that in the case of the "trivial" 
mode-locking problem (27.4), the covering intervals are generated by iterations 
of the Farey map (27.37) or the Gauss shift (27.38). The nth level sum (27.25) can 
be approximated by where 

£Ay,x) = 6(x-r l (y))\f'(y)\ T 

This amounts to approximating each cover width % by \df n /dx\ evaluated on the 
ith interval. We are thus led to the following determinant 

de«<l- z X T ) = exp-X2- T -i- 

V P r=\ ' J 
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Ki-*W/a£) 



(27.26) 



p k=0 



The sum (27.25) is dominated by the leading eigenvalue of £ T ; the Hausdorff 
dimension condition Zn(-Du) = 0(1) means that r = -Dh should be such that 
the leading eigenvalue is z = 1. The leading eigenvalue is determined by the 
k = 0 part of (27.26); putting all these pieces together, we obtain a pretty formula 
relating the Hausdorff dimension to the prime cycles of the map f(x): 



°=no 



l/IApl^) 



(27.27) 



For the Gauss shift (27.38) the stabilities of periodic cycles are available analytical- 
ly, as roots of quadratic equations: For example, the Xj fixed points (quadratic 
irrationals with x a = [a, a, a . . .] infinitely repeating continued fraction expansion) 
are given by 



-a + Va 2 +4 



Xn — 



A fl = 



(27.28) 



and the Xab - [a, b, a, b, a,b,.. .] 2-cycles are given by 



X-ab 
Kib 



-ab + y](ab) 2 + 4ab 



2b 



-2 



ab + 2+ yjab(ab + 4) 



(27.29) 



We happen to know beforehand that Dh = 1 (the irrationals take the full mea- 
sure on the unit interval, or, from another point of view, the Gauss map is not a 
repeller), so is the infinite product (27.27) merely a very convoluted way to com- 
pute the number 1? Possibly so, but once the meaning of (27.27) has been grasped, 
the corresponding formula for the critical circle maps follows immediately: 



0 = [~[(l-l/|5p| D ") 



(27.30) 



The importance of this formula relies on the fact that it expresses Dh in terms 
of universal quantities, thus providing a nice connection from local universal ex- 
ponents to global scaling quantities: actual computations using (27.30) are rather 
involved, as they require a heavy computational effort to extract Shenker's scaling 
dp for periodic continued fractions, and moreover dealing with an infinite alpha- 
bet requires control over tail summation if an accurate estimate is to be sought. In 
table 27. 1 we give a small selection of computed Shenker's scalings. 
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Table 27.1: Shenker's 6 P for a few periodic continued fractions, from ref. [ 1 ]. 



p 


s p 


[1111 ...] 


-2.833612 


[2 2 2 2...] 


-6.7992410 


[3 3 3 3...] 


-13.760499 


[4444...] 


-24.62160 


[5 55 5...] 


-40.38625 


[6 6 6 6...] 


-62.140 


[1 2 1 2...] 


17.66549 


[13 13 ...] 


31.62973 


[1 4 14...] 


50.80988 


[15 15 ...] 


76.01299 


[2 32 3...] 


91.29055 



27.5 Thermodynamics of Farey tree: Farey model 



We end this chapter by giving an example of a number theoretical model 
motivated by the mode-locking phenomenology. We will consider it by means of 
the thermodynamic formalism of chapter L, by looking at the free energy. 

Consider the Farey tree partition sum (27.23): the narrowest mode-locked 
interval (27. 15) at the nth level of the Farey tree partition sum (27.23) is the golden 
mean interval 



(27.31) 



It shrinks exponentially, and for t positive and large it dominates q(r) and bounds 
dq{T)ldT. 



In |fr | 
In 2 



1.502642... 



(27.32) 



However, for r large and negative, q(r) is dominated by the interval (27. 14) which 
shrinks only harmonically, and q(r) approaches 0 as 



<7(r) 3 In n 
t 7i In 2 



(27.33) 



So for finite n, q n (T) crosses the t axis at —t = D n , but in the n —> oo limit, the 
q(r) function exhibits a phase transition; q(r) = 0 for t < -Dn, but is a non-trivial 
function of t for -Dh < r. This non-analyticity is rather severe - to get a clearer 
picture, we illustrate it by a few number-theoretic models (the critical circle maps 
case is qualitatively the same). 

An approximation to the "trivial" Farey level thermodynamics is given by the 
"Farey model," in which the intervals fy/Q are replaced by Q~ 2 : 



2" 



Z n (r) = ^ G- T - 



(27.34) 
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Here Qj is the denominator of the ith Farey rational P,/<2,-. For example (see 
figure 27.4), 



Z 2 (l/2) - 4 + 5 + 5+4. 



By the annihilation property (27.38) of the Gauss shift on rationals, the nth Farey 
level sum Z„(-l) can be written as the integral 

Z n (-1) = J dx8{f\x)) = J] 1/|/;..,, A . (0)| , 

and in general 

Z„(t) = J dxC T (0,x), 

with the sum restricted to the Farey level a\ + . . . + a\ — n + 2. It is easily checked 
that f ax ak (0) = (-1) <2iL a y so the Farey model sum is a partition generated by 
the Gauss map preimages of x - 0, i.e., by rationals, rather than by the quadratic 
irrationals as in (27.26). The sums are generated by the same transfer operator, so 
the eigenvalue spectrum should be the same as for the periodic orbit expansion, but 
in this variant of the finite level sums we can can evaluate q{r) exactly for r = k/2, 
k a nonnegative integer. First, one observes that Z^(0) = 2". It is also easy to check 
that Z„(l/2) = >Zi Qi = 2- 3". More surprisingly, Z„(3/2) = £i Q 3 = 54 • l n ~ l . 
A few of these "sum rules" are listed in the table 27.2, they are consequence of 
the fact that the denominators on a given level are Farey sums of denominators on 
preceding levels. 

A bound on D# can be obtained by approximating (27.34) by 
Z„(t) = n 2r + 2 n p 2nT . (27.35) 



exercise 27.3 



In this approximation we have replaced all fy/Q, except the widest interval £\/ n , 
by the narrowest interval ^F n _,/F„ (see (27.15)). The crossover from the harmonic 
dominated to the golden mean dominated behavior occurs at the r value for which 
the two terms in (27.35) contribute equally: 



/lnn\ - In 2 

D n = D + 0\ , D=-— = J2... (27.36) 

n j 2 In p 



For negative t the sum (27.35) is the lower bound on the sum (27.25) , so D is 
a lower bound on D#. 

From a general perspective the analysis of circle maps thermodynamics has 
revealed the fact that physically interesting dynamical systems often exhibit mix- 
tures of hyperbolic and marginal stabilities. In such systems there are orbits that 



irrational - 22sep2000 



ChaosBook.org version 14.4. 1, Apr 21 2013 



CHAPTER 27. IRRATIONALLY WINDING 



563 



T/2 


Z„(t/2)/Z„_,(t/2) 


0 


2 


1 


3 


2 


(5 + VT7)/2 


3 


7 


4 


(5 + VT7)/2 


5 


7 + 4V6 


6 


26.20249 . . . 



Table 27.2: Partition function sum rules for the Farey model. 

stay 'glued' arbitrarily close to stable regions for arbitrarily long times. This is a 
generic phenomenon for Hamiltonian systems, where elliptic islands of stability 
coexist with hyperbolic homoclinic tangles. Thus the considerations of chapter24 
are important also in the analysis of renormalization at the onset of chaos. 



Resume 

The mode locking problem, and the quasiperiodic transition to chaos offer an 
opportunity to use cycle expansions on hierarchical structures in parameter space: 
this is not just an application of the conventional thermodynamic formalism, but 
offers a clue on how to extend universality theory from local scalings to global 
quantities. 



Commentary 

Remark 27.1 The physics of circle maps. Mode-locking phenomenology is re- 
viewed in ref. [6], a more theoretically oriented discussion is contained in ref. [3]. While 
representative of dissipative systems we may also consider circle maps as a crude ap- 
proximation to Hamiltonian local dynamics: a typical island of stability in a Hamiltonian 
2-dimensional map is an infinite sequence of concentric KAM tori and chaotic regions. In 
the crudest approximation, the radius can here be treated as an external parameter £2, and 
the angular motion can be modeled by a map periodic in the angular variable [ 9, 12]. By 
losing all of the 'island-within-island' structure of real systems, circle map models skirt 
the problems of determining the symbolic dynamics for a realistic Hamiltonian system, 
but they do retain some of the essential features of such systems, such as the golden mean 
renormalization [5,9] and non-hyperbolicity in form of sequences of cycles accumulating 
toward the borders of stability. In particular, in such systems there are orbits that stay 
"glued" arbitrarily close to stable regions for arbitrarily long times. As this is a generic 
phenomenon in physically interesting dynamical systems, such as the Hamiltonian sys- 
tems with coexisting elliptic islands of stability and hyperbolic homoclinic tanglees, de- 
velopment of good computational techniques is here of utmost practical importance. 

Remark 27.2 Critical mode-locking set The fact that mode-lockings completely fill 
the unit interval at the critical point has been proposed in refs. [ 3, 14], The proof that the 
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set of irrational windings is of zero Lebesgue measure in given in ref. [15]. 

Remark 27.3 Counting noise for Farey series. The number of rationals in the Farey 
series of order Q is <p(Q), which is a highly irregular function of Q: incrementing Q by 1 
increases <S>(Q) by anything from 2 to Q terms. We refer to this fact as the "Euler noise." 

The Euler noise poses a serious obstacle for numerical calculations with the Farey 
series partitionings; it blocks smooth extrapolations to Q — » oo limits from finite Q data. 
While this in practice renders inaccurate most Farey-sequence partitioned averages, the 
finite Q Hausdorff dimension estimates exhibit (for reasons that we do not understand) 
surprising numerical stability, and the Farey series partitioning actually yields the best 
numerical value of the Hausdorff dimension (27.25) of any methods used so far; for ex- 
ample the computation in ref. [16] for critical sine map (27.1), based on 240 < Q < 250 
Farey series partitions, yields Dh — .87012 + .00001. The quoted error refers to the vari- 
ation of Dh over this range of Q; as the computation is not asymptotic, such numerical 
stability can underestimate the actual error by a large factor. 

Remark 27.4 Farey tree presentation function. The Farey tree rationals can be 
generated by backward iterates of 1/2 by the Farey presentation function [17]: 

fo(x) = x/(l - x) 0 < x < 1/2 

fi{x) = (l-x)/x 1/2 <x< 1 . 

The Gauss shift (27.7) corresponds to replacing the binary Farey presentation function 
branch /o in (27.37) by an infinity of branches 

fa(x) = f\ ° fo^ix) = - - a, 

x 

fab-c(x) = f c °-° fb ° fa(x) . 

A rationale = [a^ao, ... ,«a] is annihilated by the Ath iterate of the Gauss shift, f ai a T -atix) = 
0. The above maps look innocent enough, but note that what is being partitioned is not 
the dynamical space, but the parameter space. The flow described by (27.37) and by its 
non-trivial circle-map generalizations will turn out to be a renormalization group flow 
in the function space of dynamical systems, not an ordinary flow in the state space of a 
particular dynamical system. 

The Farey tree has a variety of interesting symmetries (such as "flipping heads and 
tails" relations obtained by reversing the order of the continued-fraction entries) with as 
yet unexploited implications for the renormalization theory: some of these are discussed 
in ref. [13]. 

An alternative labeling of Farey denominators has been introduced by Knauf [ 7] in 
context of number-theoretical modeling of ferromagnetic spin chains: it allows for a num- 
ber of elegant manipulations in thermodynamic averages connected to the Farey tree hi- 
erarchy. 

Remark 27.5 Circle map renormalization The idea underlying golden mean renor- 
malization goes back to Shenker [12], A renormalization group procedure was formu- 
lated in refs. [8, 10, 11], where moreover the uniqueness of the relevant eigenvalue is 



1 1 

<x< -, 

a— I a 



(27.38) 
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claimed. This statement has been confirmed by a computer-assisted proof [18], and in the 
following we will always assume it. There are a number of experimental evidences for 
local universality, see refs. [19, 20]. 

On the other side of the scaling tale, the power law scaling for harmonic fractions 
(discussed in refs. [2, 13]) is derived by methods akin to those used in describing inter- 
mittency [24]: l/Q cycles accumulate toward the edge of 0/1 mode-locked interval, and 
as the successive mode-locked intervals l/Q, 1 /(Q — 1 ) lie on a parabola, their differences 
are of order Q 3 . 

Remark 27.6 Farey series and the Riemann hypothesis The Farey series thermo- 
dynamics is of a number theoretical interest, because the Farey series provide uniform 
coverings of the unit interval with rationals, and because they are closely related to the 
deepest problems in number theory, such as the Riemann hypothesis [ 25, 26] . The dis- 
tribution of the Farey series rationals across the unit interval is surprisingly uniform - 
indeed, so uniform that in the pre-computer days it has motivated a compilation of an 
entire handbook of Farey series [27]. A quantitative measure of the non-uniformity of the 
distribution of Farey rationals is given by displacements of Farey rationals for P ,72/ e Tq 
from uniform spacing: 

6i = — — , 2= 1,2,---,$(Q) 

0.(0 q.; 

The Riemann hypothesis states that the zeros of the Riemann zeta function lie on the 
s = 1/2 + it line in the complex s plane, and would seem to have nothing to do with 
physicists' real mode-locking widths that we are interested in here. However, there is 
a real-line version of the Riemann hypothesis that lies very close to the mode-locking 
problem. According to the theorem of Franel and Landau [28, 25, 26], the Riemann 
hypothesis is equivalent to the statement that 

2 1*1 = o(Q^) 

Q,<Q 

for all e as Q — » oo. The mode-lockings Ap/g contain the necessary information for 
constructing the partition of the unit interval into the tj covers, and therefore implicitly 
contain the 6, information. The implications of this for the circle-map scaling theory have 
not been worked out, and is not known whether some conjecture about the thermodynam- 
ics of irrational windings is equivalent to (or harder than) the Riemann hypothesis, but the 
danger lurks. 

Remark 27.7 Farey tree partitioning. The Farey tree partitioning was introduced 
in refs. [29, 30, 13] and its thermodynamics is discussed in detail in refs. [16, 17]. The 
Farey tree hierarchy of rationals is rather new, and, as far as we are aware, not previously 
studied by number theorists. It is appealing both from the experimental and from the 
golden-mean renormalization point of view, but it has a serious drawback of lumping 
together mode-locking intervals of wildly different sizes on the same level of the Farey 
tree. 
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Remark 27.8 Local and global universality. Numerical evidences for global universal 
behavior have been presented in ref. [ ]. The question was reexamined in ref. [16], where 
it was pointed out how a high-precision numerical estimate is in practice very hard to 
obtain. It is not at all clear whether this is the optimal global quantity to test but at least 
the Hausdorff dimension has the virtue of being independent of how one partitions mode- 
lockings and should thus be the same for the variety of thermodynamic averages in the 
literature. 

The formula (27.30), linking local to global behavior, was proposed in ref. [ 1]. 

The derivation of (27.30) relies only on the following aspects of the "hyperbolicity 
conjecture" of refs. [13, 21, 22, 23]: 

1 . limits for Shenker <5's exist and are universal. This should follow from the renor- 
malization theory developed in refs. [8, 10, 1 1, 18], though a general proof is still 
lacking. 

2. S p grow exponentially with n p , the length of the continued fraction block p. 

3. Sp for p = a\a%...n with a large continued fraction entry n grows as a power 
of n. According to (27.14), lim^oo 5 P oc « 3 . In the calculation of ref. [I] the 
explicit values of the asymptotic exponents and prefactors were not used, only the 
assumption that the growth of 6 P with n is not slower than a power of n. 

Remark 27.9 Farey model. The Farey model (27.33) has been proposed in ref. [16]; 
though it might seem to have been pulled out of a hat, the Farey model is as sensible 
description of the distribution of rationals as the periodic orbit expansion ( 27.26). 

Remark 27.10 Symbolic dynamics for Hamiltonian rotational orbits. The rotational 
codes of ref. [40] are closely related to those for maps with a natural angle variable, for 
example for circle maps [37, 39] and cat maps [41]. Ref. [40] also offers a systematic rule 
for obtaining the symbolic codes of "islands around islands" rotational orbits [ ]. These 
correspond, for example, to orbits that rotate around orbits that rotate around the elliptic 
fixed point; thus they are defined by a sequence of rotation numbers. 

A different method for constructing symbolic codes for "islands around islands" was 
given in refs. [46, 44]; however in these cases the entire set of orbits in an island was 
assigned the same sequence and the motivation was to study the transport implications for 
chaotic orbits outside the islands [43, 45]. 



[ 



Exercises 

27.1. Mode-locked intervals. Check that when k + 0 the small k the width of Ao/i is an increasing function of k. 

interval Ap/g have a non-zero width (look for instance 
at simple fractions, and consider k small). Show that for 
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27.2. Bounds on Hausdorff dimension. By making use of 
the bounds (27.17) show that the Hausdorff dimension 
for critical mode lockings may be bounded by 

2/3 <D H < .9240 . . . 

27.3. Farey model sum rules. Verify the sum rules re- 
ported in table 27.2. An elegant way to get a number of 
sum rules for the Farey model is by taking into account 
an lexical ordering introduced by Contucci and Knauf, 
see ref. [31]. 

27.4. Metric entropy of the Gauss shift. Check that 
the Lyapunov exponent of the Gauss map ( 27.7) is given 
by 7T 2 /61n2. This result has been claimed to be rele- 
vant in the discussion of "mixmaster" cosmologies, see 
ref. [33]. 

27.5. Refined expansions. Show that the above estimates 
can be refined as follows: 

F(z,2) ~ f(2) + (l-z)log(l-z)-(l-z) 

and 

f( Z ,s) ~ as) + m-s)d-zy- 1 -s(s)a-z) 



for s € (1,2) (S(s) being expressed by a converging 
sum). You may use either more detailed estimate for 
({s, a) (via Euler summation formula) or keep on sub- 
tracting leading contributions [34]. 

27.6. j„ and a cr . Look at the integration region and how it 
scales by plotting it for increasing values of n. 

27.7. Estimates of the Riemann zeta function. Try to 

approximate numerically the Riemann zeta function for 
s — 2, 4, 6 using different acceleration algorithms: check 
your results withrefs. [35, 36]. 

27.8. Farey tree and continued fractions I. Consider the 
Farey tree presentation function / : [0, 1] i-» [0, 1], such 
that if / = [0, 1/2) and J = [1/2, l],f\, = x/(l-x) and 
f\ j = (1 - x)/x. Show that the corresponding induced 
map is the Gauss map g(x) = 1/x - [1/x]. 

27.9. Farey tree and continued fraction II. (Lethal weapon 
II). Build the simplest piecewise linear approxima- 
tion to the Farey tree presentation function (hint: sub- 
stitute first the righmost, hyperbolic branch with a lin- 
ear one): consider then the spectral determinant of the 
induced map g , and calculate the first two eigenvalues 
besides the probability conservation one. Compare the 
results with the rigorous bound deduced in ref. [ 17]. 
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Qunadry: all these cycles, but how many do I need?. Any physical system suffers 
background noise, any numerical prediction suffers computational roundoff noise, 
and any set of equations models nature up to a given accuracy, since degrees of 
freedom are always neglected. If the noise is weak, the short-time dynamics is not altered 

significantly: short periodic orbits of the deterministic flow still partition coarsely the state 
space. 
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Noise 



He who establishes his argument by noise and command 
shows that his reason is weak. 
— M. de Montaigne 

This chapter (which reader can safely skip on the first reading) is about noise, 
how it affects classical dynamics, and the ways it mimics quantum dynam- 
ics. 

Why - in a monograph on deterministic and quantum chaos - start discussing 
noise? First, in physical settings any dynamics takes place against a noisy back- 
ground, and whatever prediction we might have, we have to check its robustness to 
noise. Second, as we show in this chapter, to the leading order in noise strength, 
the semiclassical Hamilton-Jacobi formalism applies to weakly stochastic flows 
in toto. As classical noisy dynamics is more intuitive than quantum dynamics, 
understanding effects of noise helps demystify some of the formal machinery of 
semiclassical quantization. Surprisingly, symplectic structure emerges here not 
as a deep principle of mechanics, but an artifact of the leading approximation to 
quantum/noisy dynamics. Third, the variational principle derived here turns out 
to be a powerful tool for determining periodic orbits, see chapter29. And, last but 
not least, upon some reflection, the whole enterprize of replacing deterministic 
trajectories by deterministic evolution operators, chapters 16 to 20, seems fatally 
flowed; if we have given up infinite precision in specifying initial conditions, why 
do we alow ourselves the infinite precision in the specification of evolution laws, 
i.e., define the evolution operator by means of the Dirac delta function <5(j -/(*))? 
It will be comforting to learn that the deterministic evolution operators survive un- 
scathed, as the leading approximation to the noisy ones in the limit of weak noise. 

Another key result derived here is the evolution law (28.45) for the covariance 
matrix Q a of a linearly evolved Gaussian density, 

Qa+l = M a Q a M a T + A a . 
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To keep things simple we shall describe covariance evolution in the discrete time 
dynamics context, but the results apply both to the continuous and discrete time 
flows. The most important lesson, however, is that physicist's Brownian diffusion 
intuition -that the effect of the noise is to spread out the trajectory as y/i- is wrong: 
In nonlinear dynamics the noise is always local, determined by balancing local 
nonlinear dynamics against the memory of the noise past. 



We start by deriving the continuity equation for purely deterministic, noiseless 
flow, and then incorporate noise in stages: diffusion equation, Langevin equation, 
Fokker-Planck equation, stochastic path integrals, Hamilton- Jacobi formulation. 



The large body of accrued wisdom on the subject of flows called fluid dynamics 
is about physical flows of media with continuous densities. On the other hand, the 
flows in state spaces of dynamical systems frequently require more abstract tools. 
To sharpen our intuition about those, it is helpful to outline the more tangible fluid 
dynamical vision. 

Consider first the simplest property of a fluid flow called material invariant. 
A material invariant I(x) is a property attached to each point x that is preserved 
by the flow, I(x) = I(f(x)); for example, at point x{t) = f(x)) a green particle 
(more formally: a passive scalar) is embedded into the fluid. As I(x) is invariant, 
its total time derivative vanishes, t{x) = 0. Written in terms of partial derivatives 
this is the conservation equation for the material invariant 



Let the density of representative points be p(x, t). The manner in which the flow 
redistributes I(x) is governed by a partial differential equation whose form is rel- 
atively simple because the representative points are neither created nor destroyed. 
This conservation property is expressed in the integral statement 



where V is an arbitrary volume in the state space M, dV is its surface, h is its out- 
ward normal, and repeated indices are summed over throughout. The divergence 
theorem turns the surface integral into a volume integral, 



28.1 Deterministic transport 



(E.A. Spiegel and P. Cvitanovic) 



d t l + vdl = 0. 



(28.1) 



d t I dxpl - - 
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where dj is the partial derivative operator with respect to Since the integration 
is over an arbitrary volume, we conclude that 

d t (pl) + dtiplvd = 0 . (28.2) 

The choice 1=1 yields the continuity equation for the density: 

d t p + diipv,) = 0 . (28.3) 

Here we have used the language of fluid mechanics to ease the visualization, but, 
as we have seen in our previous derivation of the continuity equation (16.25), any 
deterministic state space flow satisfies the continuity equation in any dimension. 

Why -even though the dynamics is nonlinear- is this equation linear? As each 
deterministic orbit is distinct and intersects no other orbit, no 'particles' are cre- 
ated or destroyed, they are non-interacting, hence description in terms of linear 
evolution operators possible. 



28.2 Brownian diffusion 



Consider tracer molecules, let us say big, laggardly green molecules, embedded 
in a denser gas of light molecules. Assume that the density of tracer molecules p 
compared to the background gas density is low, so we can neglect green-green col- 
lisions. Each green molecule, jostled by frequent collisions with the background 
gas, executes its own Brownian motion. The molecules are neither created nor 
destroyed, so their number within an arbitrary volume V changes with time only 
by the current density y, flow through its surface dV (with h its outward normal): 

d t I dxp = - I dcrhijj. (28.4) 
Jv JdV 

The divergence theorem turns this into the conservation law for tracer density: 

dtp + diji = 0. (28.5) 

The tracer density p is denned as the average density of a 'material particle,' av- 
eraged over a subvolume large enough to contain many green (and still many 
more background) molecules, but small compared to the macroscopic observa- 
tional scales. What is p. If the density is constant, on the average as many 
molecules leave the material particle volume as they enter it, so a reasonable phe- 
nomenological assumption is that the average current density {not the individual 
particle current density pv,- in (28.3)) is driven by the density gradient 

ji = -D^- - (28.6) 

OXj 
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This is the Fick law, with the diffusion constant D a phenomenological parameter. 
Substituting this current into (28.5) yields the diffusion or heat equation, 

<Lp{x,t) = D^p{x,t) . (28.7) 
ot ox 1 

More generally, diffusion is described by a space- and time-dependent symmet- 
ric diffusion tensor A,y = Ay,-, with j, = -^Ajjdjp, leading to the anisotropic 
diffusion equation 

d !P (x, t) = X -di (Aijix) d jP (x, t)) . (28.8) 

For sake of streamlining the argument we have assumed above that diffusion in d 
dimensions is homogenous and iso tro pic, A(x) = 2 D 1. In practice, the diffusion 
tensor is almost always anisotropic: for example, physicist's Brownian diffusion 
is a flow in the 6-dimensional {configuration, velocity} phase space, with white 
noise probability distribution exp(-v 2 /2^r), modeling random force kicks ap- 
plied only to the 3 velocity variables v. In this case one thinks of diffusion coeffi- 
cient D = kgT/2 as temperature. 



28.2.1 Heat kernel 



Fourier transforming the heat equation (28.7), 



d_ 

df 



/die 
—p(k,t)e ikx , (28.9) 
2n 



integrating, 



Pix,t) = J d ±p(k,0)e^- D ^, 



and Fourier transforming back we obtain an exact solution of the heat equation in 
terms of an initial Dirac delta density distribution, p(x, 0) = 8{x - %y), 

p(x,t) = £ FP (x,t;x 0 ,0) - l —r 7 re J± ^, (28.10) 

(AnDt) 1 

in the spirit of the quantum free particle propagation of sect. 33.2.2. The average 
distance covered in time t obeys the diffusion formula 

((* - x 0 ) 2 ) t - J dxp(x, t)(x - x 0 ) 2 = 2dDt . (28.11) 

The classical Einstein formula describes 3-dimensional Brownian motion; here 
the diffusion takes place in the dynamical state space of dimension d. 
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28.2.2 Random walks 

So far we have considered the evolution of the density of tracer molecules. One 
can alternatively consider a J-dimensional random walk of an individual tracer 
molecule kicked by a stochastic term, 

dx 

— = ^(t). (28.12) 
dt 

A way to make sense of £(t) is to first construct the probability distribution for 
additive noise £ at short but finite time steps 8t, with t, 1+ i = t n + 8t, and the 
particle x n = x(t n ) at time t n executing a random walk, x n+ \ = x n + g(t n ) , where x 
is a cf-dimensional state vector, and x„j is its jth component at time n. The natural 
choice is that probability that the trajectory reaches x^\ is given by a normalized 
Gaussian 



J~fpi x n+iit n +i',x n ,t n ) — ) _ __ exp 



1 

V(27T(5T) rf detA 



1 T 1 

' ~ c- (mi T^n) 
LOT A 



, (28.13) 



f„ = x n+ \ - x n , characterized by zero mean and the diffusion tensor (co variance 
matrix), 

= 0 , (f/(f m ) f/ft,)) - StAij 5 nm , (28. 14) 

where (• • •) stands for ensemble average over many realizations of the noise, and 
the superfix T indicates a transpose. As the time discretization St is arbitrary, 
the diffusing cloud of noisy trajectories should be described by a distribution that 
keeps its form as 5t —> 0. Indeed, the semigroup property of a Gaussian kernel, 

£ FP (x,t;x",t") = J~dx£, FP (x, t; x, t') £, FP (x , t'; x" , t") , (28.15) 

ensures that the distribution keeps its form under successive diffusive steps. 
£ FP (x, t; xo, 0) describes the diffusion at any time, including the integer time in- 
crements {t n } = {8t, 26t, ■ ■ ■ ,u6t, ■ ■ •}, and thus provides a bridge between the 
continuous and discrete time formulations of noisy evolution. 



Example 28.1 Random walk in one dimension The white noise = x„+i - x„ 
for a 1 -dimensional diffusion process is a normally distributed random variable, with 
standard normal (i.e., Gaussian) probability distribution function, 



£ FP (x, t; x', t') 1 exp 



(x - x'f 



(28.16) 



\D(t-t') 

of mean 0, variance 2D(t-t'), and standard deviation yj2D(t - Fj, uncorrected in time: 

(x n +i - x„) = 0 , <(x m+ i - x m )(x n+ i - x,,)) = 2D 6t6,„„ . (28. 17) 

section 28.4 
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28.3 Noisy trajectories: Continuous time 



The connection between path integration and Brownian 
motion is so close that they are nearly indistinguishable. 
Unfortunately though, like a body and its mirror image, 
the sum over paths for Brownian motion is a theory hav- 
ing substance, while its path integral image exists mainly 
in the eye of the beholder. 
— L. S. Schulman 

(P. Cvitanovic and D. Lippolis) 

So far we have considered tracer molecule dynamics which is purely Brownian, 
with no deterministic "drift." Consider next a J-dimensional deterministic flow 
x - v(x) perturbed by a stochastic term f (f), 



at 



(28.18) 



where the deterministic velocity field v(x) is called 'drift' in the stochastic litera- 
ture, and g(t) is additive noise, uncorrected in time. We shall refer to equations 
of this type as Langevin equations. The more general case of a tensor A(x) which 
is a state space position dependent but time independent can be treated along the 
same lines. In this case the stochastic flow (28.18) is written as 



dx = v(x) dt + o-{x) d£(t) , = 1 6 nm , A 



cro" 



(28.19) 



cr(x) is called the 'diffusion matrix', and the noise is referred to as 'multiplicative'. 
Explicit time dependence in A(x, t) would take us into world of non-autonomous, 
externally driven flows, beyond the comfort zone of ChaosBook.org. 

As in (28.12), a way to make sense of (28.18) is to first construct the proba- 
bility distribution for additive noise £ at a short but finite time 8t. In time 5t the 
deterministic trajectory advances by v(Xn) St. As 6t is arbitrary, it is desirable that 
the diffusing cloud of noisy trajectories is given by a distribution that keeps its 
form as 8t —> 0. This holds if the noise is Brownian, i.e., the probability that the 
trajectory reaches x n+ \ is given by a normalized Gaussian (28.13), 



£ FP (x n+ i,5T;x n ,0) = -^exp 



1 T J_ 

2dT A„ 



(28.20) 



Here = 5x n - v(x„)6t, the deviation of the noisy trajectory from the deter- 
ministic one, can be viewed either in terms of velocities {x, v(x)} (continuous time 
formulation), or finite time maps \x n — > x n+ i,x n — > f ST (x„)} (discrete time formu- 
lation), 



5x n = x„+i - x n - x n 8t , f T (x n ) -x n - v(x n ) 5t , 



(28.21) 
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where 



{xq, X\, • • • , x n , • ■ • , Xk) = {x(0), x(6t), ■ ■ ■ , x(nSr), • • • , x(t)} 



(28.22) 



is a sequence of k + 1 points x n = x(t n ) along the noisy trajectory, separated by 
time increments 6t = t/k. 

The phenomenological Fick law current (28.6) is now a sum of two compo- 
nents, the material particle deterministic drift v(x) and the weak noise term 



ji = Vjp - D— , 

OXi 



Vip- -Aij(x)djp(x,i) 



(28.23) 



with the full, anisotropic and space-dependent version indicated in [• • •]. Substi- 
tuting this j into (28.5) yields the Fokker-Planck equation 



d t p + diipvi) = Dd 2 p, 



= ^d i {A ij (x)d j p(x,t)) 



(28.24) 



The left hand side, dp/dt = d t p + d • (pv), is deterministic, with the continuity 
equation (28.3) recovered in the weak noise limit D — > 0. The right hand side 
describes the diffusive transport in or out of the material particle volume. If the 
density is lower than in the immediate neighborhood, the local curvature is posi- 
tive, d p > 0, and the density grows. Conversely, for negative curvature diffusion 
lowers the local density, thus smoothing the variability of p. Where is the density 
going globally? 

If the system is bound, the probability density vanishes sufficiently fast outside 
the central region, p(x, t) —> 0 as \x\ — > oo, and the total probability is conserved 



dxp(x, t) — 1 . 



Any initial density p(x, 0) is smoothed by diffusion and with time tends to the 
natural measure, the invariant density 



p 0 (x) = lim p(x, t) , 

t~>oo 



(28.25) 



an eigenfunction p(x,t) = e 5 'po(^) of the time-independent Fokker-Planck equa- 
tion 



(diVi-Dd 2 + s a )p a = 0, 



(28.26) 



with vanishing eigenvalue sq = 0. Provided the noiseless classical flow is hyper- 
bolic, in the vanishing noise limit the leading eigenfunction of the Fokker-Planck 
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equation tends to natural measure (16.17) of the corresponding deterministic flow, 
the leading eigenvector of the Perron-Frobenius operator. 

If the system is open, there is a continuous outflow of probability from the 
region under study, the leading eigenvalue is contracting, $ < 0, and the density 
of the system tends to zero. In this case the leading eigenvalue $ of the time- 
independent Fokker-Planck equation (28.26) can be interpreted by saying that a 
finite density can be maintained by pumping back probability into the system at 
a constant rate y = -so. The value of y for which any initial probability density 
converges to a finite stationary equilibrium density is called the escape rate. In 
the noiseless limit this coincides with the deterministic escape rate (17.36). 

The distribution (28.13) describes how an initial density of particles concen- 
trated in a Dirac delta function at x n spreads in time 5t. In the Fokker-Planck 
description individual noisy Langevin trajectories (28.18) are replaced by the evo- 
lution of the density of noisy trajectories. The finite time Fokker-Planck evolution 
p(x, t) - ^L'pp o pj (x, 0) of an initial density p(xq, 0) is obtained by a sequence of 
consecutive short-time steps (28.13) 

£ FP (x k , t; xq, 0) = J [dx] exp Tj [x "^ ~ f^^A > ( 2 8- 27 ) 

where t - kSr, and the Gaussian normalization factor in (28.13) is absorbed into 
intermediate integrations by defining 

m ■ Of 

N„ = (2nST) d/2 [det A(x n )] 1/2 (anisotropic diffusion tensor A) 

- {ADStY 12 (isotropic diffusion, A(jc) - 2D 1) . (28.28) 

As D —> 0, the distribution tends to the noiseless, deterministic Dirac delta func- 
tion Perron-Frobenius operator (16.10). The stochastic flow (28.18) can now be 
understood as the continuous time, St — > 0 limit, with the velocity noise £(t) a 
Gaussian random variable of zero mean and covariance matrix 

(1/(0 ) = 0 , {hit) f/O) = Ay S(t - t') . (28.29) 

It is worth noting that the continuous time flow noise f(/) in (28.18) and (28.29) is 
dimensionally a velocity [ac]/[f], as £ Fp (x n+ i,dT; x n , 0) is a probability density for 
velocity while the discrete time noise in (28.13), (28.14) is dimensionally a 
length [x], as p(x, t) is a probability density for position x. The important point is 
that the same diffusion tensor A(x) describes the diffusion both in the configuration 
space and the velocity space. 
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The continuous time limit of (28.27), St = t/k — > 0, defines formally the 
Fokker-Planck evolution operator 

£ FP (x, t; xq, 0) = J [dx] exp j-^ [x(r) - v(*(t))] 2 Jt| (28.30) 



as a stochastic path (or Wiener) integral for a noisy flow, and the associated con- 
tinuous time Fokker-Planck (or forward Kolmogorov) equation (28.24) describes 
the time evolution of a density of noisy trajectories. We have introduced noise 
phenomenologically, and used the weak noise assumption in retaining only the 
first derivative of p in formulating the Fick law (28.6) and including noise addi- 
tively in (28.23). The St — > 0 limit and the proper definition of x(t) are delicate 
issues of no import for the applications studied here. A full theory of stochastic 
ODEs is much subtler, but this will do for our purposes. 

The exponent 

4D St 1 J 4D 

can be interpreted as a cost function which penalizes deviation of the noisy trajec- 
tory dx from its deterministic prediction v St, or, in the continuous time limit, the 
deviation of the noisy trajectory tangent x from the deterministic velocity field v. 
Its minimization is one of the most important tools of the optimal control theory, 
with velocity x(t) along a trial path varied with aim of minimizing its distance to 
the target v(jc(t)). 



28.4 Noisy maps: Discrete time 



(P. Cvitanovic and D. Lippolis) 



For pedagogical reasons we shall often find it convenient to consider a noisy dis- 
crete time dynamical system 

x n +i = /Ui) + %n , (28.32) 

where x is a <i-dimensional state vector, and Xnj is its 7th component at time n. 
In the Fokker-Planck description individual noisy trajectories are replaced by the 
evolution of the density of noisy trajectories, with the = x n +\ -f{x n ) probability 
distribution of zero mean and diffusion tensor, and the time increment in (28.14) 
set to St - 1 , 

(fa) = 0 , (fa Cj) = \j(Xn) Snm ■ (28.33) 
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As we shall show, in nonlinear dynamics the noise is never isotropic and/or ho- 
mogeneous. Even if the infinitesimal time step noise (28.13) covariance matrix 
in (28.19) were independent of the state space position x, this cannot be true of 
A(jc) for the discrete time flow (28.32) obtained by the Poincare section reduction 
method of sect. 3.1, as the return times (3.1) and the noise accumulated along the 
corresponding trajectory segments depend on the starting Poincare section point. 
Indeed, as we shall argue in sect. 28.5, in nonlinear dynamics all noise is local. 
As long as the noise distribution at x is autonomous (not explicitly dependent on 
time) the stochastic flow (28.32) can be written as x n+ \ = x n + cr{x n )i; n , where 
A = o~<x T , and cr{x) is the multiplicative noise diffusion matrix defined in (28.19). 



The action of discrete one-time step Fokker-Planck evolution operator on the 
density distribution p at time k, 



Pk+l(f) = [-C FP Pk](y) = J dx£ FP 



_ (y,x)pk(x) 

£pp (y, x) = * 4(y-/(*)) T ^-/M), (28.34) 
N(x) 

is centered on the deterministic step f(x) and smeared out diffusively by noise. 
Were diffusion uniform and isotropic, A(jc) = 2D 1, the Fokker-Planck evolution 
operator would be proportional to exp(-{y - /(;c)} 2 /2A), i.e., the penalty for stray- 
ing from the deterministic path is just a quadratic error function. The kth iterate 
of JL FP {xk\ xq) - L FP {xk, t; xq, 0) is a J-dimensional path integral over the k - 1 
intermediate noisy trajectory points, 

£ k FP (x k ;x 0 ) = J[j x ] e -3 2«fe + i-/fc) T )A(bfe + i-/te)) ) (28.35) 

where the Gaussian normalization factor in (28.34) is absorbed into intermediate 
integrations by defining 

[dx] = Y\ -jf , N n = y^det A(x n ) . (28.36) 

We shall also need to determine the effect of noise accumulated along the 
trajectory points preceding x. As the noise is additive forward in time, one cannot 
simply invert the Fokker-Planck evolution operator; instead, the past is described 
by the adjoint Fokker-Planck evolution operator, 

Pk-i(x) = [£ f FP p k ](x)= f[dy] e-^-f^ T i^-f^p k {y), (28.37) 

which transports a density concentrated around the point f(x) to a density con- 
centrated around the previous point x and adds noise to it. In the deterministic, 
vanishing noise limit this is the Koopman operator (G.l). 
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The Fokker-Planck evolution operator (28.34) is non-hermitian and non-unitary. 
For example, if the deterministic flow is contracting, the natural measure (the lead- 
ing right eigenvector of the evolution operator) will be concentrated and peaked, 
but then the corresponding left eigenvector has to be broad and flat, as backward 
in time the deterministic flow is expanding. We shall denote by f\ t the right eigen- 
vectors of -C FP , and by p a its left eigenvectors, i.e., the right eigenvectors of the 
adjoint operator Jj Fp . 



28.5 All nonlinear noise is local 

I ain't gonna work for Maggie's pa no more 
No, I ain't gonna work for Maggie's pa no more 
Well, he puts his cigar 
Out in your face just for kicks 

— Bob Dylan, Maggie 's Farm 

(P. Cvitanovic and D. Lippolis) 

Our main goal in this section is to convince the reader that the diffusive dynamics 
of nonlinear flows is fundamentally different from Brownian motion, with the flow 
inducing a local, history dependent noise. In order to accomplish this, we gener- 
alize here the notion of invariant deterministic recurrent solutions, such as fixed 
points and periodic orbits, to noisy flows. While a Langevin trajectory (28.32) 
can never return exactly to the initial point and thus cannot ever be periodic, in 
the Fokker-Planck formulation (28.35) a recurrent motion can be defined as one 
where a peaked distribution returns to the initial neighborhood after time n. Re- 
currence so defined not only coincides with the classical notion of a recurrent orbit 
in the vanishing noise limit, but it also enables us to derive exact formulas for how 
this local, history dependent noise is to be computed. 

As the function x n+ \ - f{x n ) is a nonlinear function, in general the path inte- 
gral (28.35) can only be evaluated numerically. In the vanishing noise limit the 
Gaussian kernel sharpens into the Dirac ^-function, and the Fokker-Planck evo- 
lution operator reduces to the deterministic Perron-Frobenius operator (16.10). 
For weak noise the Fokker-Planck evolution operator can be evaluated perturba- 
tively as an asymptotic series in powers of the diffusion constant, centered on the 
deterministic trajectory. Here we retain only the linear term in this series, which 
has a particulary simple dynamics given by a covariance matrix evolution formula 
(see (28.45) below) that we now derive. 

We shift local coordinates labeled at time 'd to the deterministic trajectory 
{. . . , x-i, xq, xi, X2, . . . , } centered coordinate frame x - x a + z a , Taylor expand 

f(x) = fa{Za) - x a +i + M a z a H , and approximate the noisy map (28.32) by its 

linearization, 

Z, + l = M aZa + & , M/yQc) = dfi/dxj , (28.38) 
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with the deterministic trajectory points at Za - z a +i - 0, and M a - M{x a ) the 
one time step Jacobian matrix. The corresponding linearized Fokker-Planck evo- 
lution operator (28.34) action on density p a {z a ) = p(x a +z a , a) is given in the local 
coordinates by 

J dZa-C a FP {Z a+ l,Za)Pa{Za) (28.39) 

by the linearization (28.38) centered on the deterministic trajectory 

£ a FP {Za + UZa) = i e -3(- + l-^) T ^fe +l -^). (28.40) 

The superscript 'a' in £ c Fp distinguishes the local, linearized Fokker-Planck evo- 
lution operator coordinate frame Za - x - x a centered on the deterministic trajec- 
tory point x a from the full global evolution operator (28.35), in global coordinate 
system x. 

The kernel of the linearized Fokker-Planck evolution operator (28.40) is a 
Gaussian. As a convolution of a Gaussian with a Gaussian is again a Gaussian, 
we investigate the action of the linearized Fokker-Planck evolution operator on a 
normalized, cigar-shaped Gaussian density distribution 



Pa(z) = -^e~^^ z , C a = (2nf 2 (det Q a ) l/2 , (28.41) 



and the action of the linearized adjoint Fokker-Planck evolution operator on den- 
sity 

Paiz) = -^"^i Z , C a = (2;r) rf/2 (det Q a ) 1/2 , (28.42) 

also centered on the deterministic trajectory, with strictly positive [dxd] covari- 
ance matrices Q, Q. Label V plays a double role, and {a + I, a} stands both for 
the {next, initial} space partition and for the times the trajectory lands in these 
partitions. The linearized Fokker-Planck evolution operator (28.40) maps the 
Gaussian p a (z a ) into the Gaussian 

Pa+l ( Za+l ) = r [&Je 4f(-i-^) T ^(- + i-^)+^ T ^^ (28.43) 
t'a J 

one time step later. Likewise, linearizing the adjoint Fokker-Planck evolution 
operator (28.37) around the x a trajectory point yields: 



- -\[{Za+l-M a Za) T {Za+l-M a Za) + Za+l T Za+l\ 

Pa(Za) = - — \[dz a+ i\e (28.44) 
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Completing the squares, integrating and substituting (28.41), respectively (28.42) 
we obtain the formula for covariance matrix evolution forward in time, 

Q a+ i = M a Q a M a T + A a . (28.45) 

In the adjoint case, the evolution of the Q is given by 

M a Q a M T a = Q a+1 + A a . (28.46) 

The two covariance matrices differ, as the adjoint evolution Q a is computed by 
going backwards along the trajectory. These covariance evolution rules are the 
basis of all that follows. 

Think of the initial covariance matrix (28.41) as an error matrix describing 
the precision of the initial state, a cigar-shaped probability distribution #(z a ). In 
one time step this density is deterministicaily advected and deformed into density 
with covariance MQM T , and then the noise A is added: the two kinds of inde- 
pendent uncertainties add up as sums of squares, hence the covariance evolution 
law (28.45), resulting in the Gaussian ellipsoid whose widths and orientation are 
given by the singular values and singular vectors (6.13) of the covariance matrix. 
After n time steps, the variance Qa is built up from the deterministicaily propa- 
gated M n a Q a -. n M n a T initial distribution, and the sum of noise kicks at intervening 
times, M^A a -kMa T , also propagated deterministicaily. 

The pleasant surprise is that the evaluation of this noise requires no Fokker- 
Planck PDE formalism. The width of a Gaussian packet centered on a trajectory 
is fully specified by a deterministic computation that is already a pre-computed 
byproduct of the periodic orbit computations; the deterministic orbit and its linear 
stability. We have attached label 'a' to \, = A(x a ) in (28.45) to account for 
the noise distributions that are inhomogeneous, state space dependent, but time 
independent multiplicative noise. 



28.6 Weak noise: Hamiltonian formulation 

All imperfection is easier to tolerate if served up in small 
doses. 

— Wislawa Szymborska 

(G. Vattay and P. Cvitanovic) 

In the spirit of the WKB approximation (to be fully developed in chapter 32), we 
shall now study the evolution of the probability distribution by rewriting it as 

p(x, t) = e ^ R{xA . (28.47) 
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The time evolution of R is given by 



d,R + vdR + {OR) 2 = Ddv + Dd 2 R . 



Consider now the weak noise limit and drop the terms proportional to D. The 
remaining equation 

d,R + H(x, dR) = 0 

is known as the Hamilton- Jacobi equation . The function R can be interpreted as 
the Hamilton's principal function, corresponding to the Hamiltonian 



H(x,p) = pv(x) + p 2 /2, 



with the Hamilton's equations of motion 
x = d p H = v + p 

p = -d x H = —A T p , (28.48) 
where A is the stability matrix (4.3) 

The noise Lagrangian is then 



U**) = *-p-H = ^-«eif. (28.49) 

We have come the full circle - the Lagrangian is the exponent of our assumed 
Gaussian distribution (28.31) for noise £ 2 = [x - v(x)] 2 . What is the meaning 
of this Hamiltonian, Lagrangian? Consider two points xq and x. Which noisy 
path is the most probable path that connects them in time t7 The probability of a 
given path f is given by the probability of the noise sequence g(t) which generates 
the path. This probability is proportional to the product of the noise probability 
functions (28.31) along the path, and the total probability for reaching x from xj 
in time t is given by the sum over all paths, or the stochastic path integral (Wiener 
integral) 



P(x,x 0 ,t) ~ J]Y\p(t{Tj),6Tj)= fyidfJ^L) e-^r, 

- iZ^-^Jf*^)' (28 ' 50) 
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where cjt, = t; - t,-, and the normalization constant is 




The most probable path is the one maximizing the integral inside the exponential. 
If we express the noise (28.18) as 



By the standard arguments, for a given x, x" and t the probability is maximized by 
a solution of Hamilton's equations (28.48) that connects the two points xq — > x' 
in time t. The solution is a bit boring: x = v, p = 0 , and lives in the initial, 
cf-dimensional state space, so not much is to be made of this surprising appearance 
of Hamiltonians. 



Resume 

When a deterministic trajectory is smeared out under the influence of Gaussian 
noise of strength D, the deterministic dynamics is recovered in the weak noise 
limit D —* 0. The effect of the noise can be taken into account by adding noise 
corrections to the classical trace formula. 

Symplectic structure emerges here not as a deep principle of mechanics, but 
an artifact of the leading approximation to quantum/noisy dynamics, not respected 
by higher order corrections. The same is true of semiclassical quantum dynamics; 
higher corrections do not respect canonical invariance. 



Remark 28.1 A brief history of noise. The theory of stochastic processes is a vast 
subject, starting with the Laplace 1810 memoir [42], spanning over centuries, and over 
disciplines ranging from pure mathematics to impure finance. The presentation given here 
is based on the Cvitanovic and Lippolis 2012 Maribor lectures [1]. The material reviewed 
is standard [2, 3, 43], but needed in order to set the notation for what is new here, the role 
that local Fokker-Planck operators play in defining stochastic neighborhoods of periodic 
orbits. We enjoyed reading van Kampen classic [2], especially his railings against those 
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who blunder carelessly into nonlinear landscapes. Having committed this careless chapter 
to print, we shall no doubt be cast to a special place on the long list of van Kampen's sin- 
ners (and not for the first time, either). A more specialized monograph like Risken's [ 3] 
will do just as well. Schulman's monograph [12] contains a very readable summary of 
Kac's [ 1 3] exposition of Wiener's integral over stochastic paths. The standard Langevin 
equation [40] is a stochastic equation for a Brownian particle, in which one replaces the 
Newton's equation for force by two counter-balancing forces: random accelerations £(f) 
which tend to smear out a particle trajectory, and a damping term which drives the ve- 
locity to zero. In this context D is Einstein diffusion constant, and (28.1 1) is the Einstein 
diffusion formula [41]. Here we denote by 'Langevin equation' a more general family 
of stochastic differential equations (28.18) with additive or multiplicative [47, 48] weak 
noise. Noisy discrete time dynamical systems are discussed in refs. [60, 61, 62]. 

In probabilist literature [58] the differential operator -V • (v(x)p(x, ?)) + D V 2 p(x, f) is 
called 'Fokker-Planck operator;' here we reserve the term 'Fokker-Planck evolution oper- 
ator' for the finite time, 'Green's function' integral operator (28.30), i.e., the stochastic 
path (Wiener) integral [53, 54, 3] for a noisy flow, with the associated continuous time 
Fokker-Planck [2, 3, 55] (or forward Kolmogorov) equation (28.24). 

The cost function (28.3 1) appears to have been first introduced by Wiener as the exact 
solution for a purely diffusive Wiener-Levy process in one dimension, see (28.16). On- 
sager and Machlup [19, 24] use it in their variational principle to study thermodynamic 
fluctuations in a neighborhood of single, linearly attractive equilibrium point (i.e., without 
any dynamics). It plays important role in the optimal control theory [63, 64]. Gaussians 
are often rediscovered, so Onsager-Machlup seminal paper, which studies the same at- 
tractive linear fixed point, is in literature often credited for being the first to introduce a 
variational method -the "principle of least dissipation"- based on the Lagrangian of form 
(28.49). They, in turn, credit Rayleigh [ 20] with introducing the least dissipation principle 
in hydrodynamics. Onsager-Machlup paper deals only with a finite set of linearly damped 
thermodynamic variables, and not with a nonlinear flow or unstable periodic orbits. 

Gaspard [23] derives a trace formula for the Fokker-Planck equation associated with 
ltd stochastic differential equations describing noisy time-continuous nonlinear dynamical 
systems. In the weak-noise limit, the trace formula provides estimations of the eigenval- 
ues of the Fokker-Planck operator on the basis of the Pollicott-Ruelle resonances of the 
noiseless deterministic system, which is assumed to be non-bifurcating. At first order 
in the noise amplitude, the effect of noise on a periodic orbit is given in terms of the 
period and the derivative of the period with respect to the pseudo-energy of the Onsager- 
Machlup-Freidlin-Wentzell scheme [24]. The dynamical 'action' Lagrangian in the ex- 
ponent of (28.30), and the associated symplectic Hamiltonian were first written down in 
1970's by Freidlin and Wentzell [24], whose formulation of the 'large deviation princi- 
ple' was inspired by the Feynman quantum path integral [ 49] . Feynman, in turn, followed 
Dirac [ ] who was the first to discover that in the short-time limit the quantum propa- 
gator (imaginary time, quantum sibling of the Wiener stochastic distribution (28.16)) is 
exact. Gaspard [23] thus refers to the 'pseudo-energy of the Onsager-Machlup-Freidlin- 
Wentzell scheme.' M. Roncadelli [51, 52] refers to the Fokker-Planck exponent in (28.30) 
as the 'Wiener-Onsager-Machlup Lagrangian,' constructs weak noise saddle-point expan- 
sion and writes transport equations for the higher order coefficients. In our exposition 
the setting is more general: we study fluctuations over a state space-varying velocity field 
v(x). 

Remark 28.2 Weak noise perturbation theory. DasBuch omits any discussion of the 
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Martin-Siggia-Rose [69] type weak noise corrections. For an overview of possible ways 
for improvement of diagrammatic summation in noisy field theories, see Chaotic Field 
Theory: a Sketch [70]. The details are in the three papers on trace formulas for stochastic 
evolution operators (see also ref. [5 1]): Weak noise perturbation theory [9], smooth con- 
jugation method [10], and local matrix representation approach [ 1 1]. Such corrections 
have not been evaluated before, probably because one is so unsure about nature of the 
noise itself that nth order correction is beyond the point. Doing continuous time flows 
requires the same kind of corrections, with diagrams standing for integrals rather than 
sums, though no one ever tried weakly stochastic flows in continuous time. 

Remark 28.3 Covariance evolution. In quantum mechanics the linearized evolu- 
tion operator corresponding to the linearized Fokker-Planck evolution operator ( 28.40) is 
known as the Van Vleck propagator, the basic block in the semi-classical periodic orbit 
quantization [2], see chapter 33. Q covariance matrix composition rule (28.45) or its con- 
tinuous time version is called 'covariance evolution' (for example, in ref. [ 65]), but it goes 
all the way back to Lyapunov's 1892 thesis [ 66]. In the Kalman filter literature [67, 68] it 
is called 'prediction' . 

Remark 28.4 Operator ordering. According to L. Arnold [43] review of the original 
literature, the derivations are much more delicate than what is presented here: the noise is 
colored rather than Dirac delta function in time. He refers only to the linear case as the 
'Langevin equation'. The 6t — > 0 limit and the proper definition of i(r) are delicate is- 
sues [44, 45, 43, 46] of no import for the applications of stochasticity studied here: Ito and 
Stratanovich operator ordering issues arise in the order beyond the leading approximation 
considered here. 
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Exercises 



28.1. Who ordered V^? Derive the Gaussian integral 

— — I dx e 2 « = yfa , a > 0 . 
V27r J-oo 

assuming only that you know to integrate the exponen- 
tial function e~ x . Hint, hint: x 2 is a radius-squared of 
something, n is related to the area or circumference of 
something. 

28.2. D-dimensional Gaussian integrals. Show that the 
Gaussian integral in D-dimensions is given by 

— !— f^-l^'W = |detM|M^D 
(2^) d '- J 

where M is a real positive definite [d x of] matrix, i.e., 
a matrix with strictly positive eigenvalues, x, J are D- 
dimensional vectors, and x T is the transpose of x. 

28.3. Convolution of Gaussians. Show that the Fourier 
transform of convolution 

lf*g](x) = J d d yf(x-y)g(y) 



of two Gaussians 

_I ,-t.j_. 

fix) = e 2 A > 
factorizes as 



where 



[/**](*) 
G(*) = 



(2 



#0) 



dkF{k)G(k)e' k x , (28.52) 



i-= f ^*g(jc)e-** = |detA 2 | 1/2 e^ T ^* 
7r) d J 



Hence 



[/**](■*) = -^-jldetA^etAil 1 / 2 f d"p e ^ T 

(27T) ri J 



(Ai+/ 



det Aidet A 2 



det(A, + A 2 ) 



1/2 



-5X T -(Ai+A 2 r'-x 



exerNoise - 17apr2013 



ChaosBook.org versionl4.4.1, Apr 21 2013 



REFERENCES 



590 



References 

[28.1] P. Cvitanovic and D. Lippolis. Knowing when to stop: How noise frees us 
from determinism. In M. Robnik and V. G. Romanovski, editors, Let's Face 
Chaos through Nonlinear Dynamics, pages 82-126, Melville, New York, 
2012. Am. Inst, of Phys. arXiv: 1206. 5506. 

[28.2] N. G. van Kampen, Stochastic Processes in Physics and Chemistry (North 
Holland, Amsterdam 1992). 

[28.3] H. Risken, The Fokker-Planck Equation: Methods of Solution and Appli- 
cations (Springer- Verlag, New York, 1996). 

[28.4] W. Dittrich and M. Reuter, Classical and Quantum Dynamics: From Clas- 
sical Paths to Path Integrals (Springer- Verlag, Berlin 1994). 

[28.5] E. M. Lifshitz and L. P. Pitaevskii, Physical Kinetics (Pergamon, London 
1981). 

[28.6] C. Itzykson and J.-M. Drouffe, Statistical field theory (Cambridge U. Press, 
1991). 

[28.7] V. Ambegaokar, Reasoning about luck; probability and its uses in physics 
(Cambridge Univ. Press, Cambridge 1996). 

[28.8] B. Sakita, Quantum theory of many variable systems and fields (World 
Scientific, Singapore 1985). 

[28.9] G. E. Uhlenbeck, G. W. Ford and E. W. Montroll, Lectures in Statistical 
Mechanics (Amer. Math. Soc, Providence R.I., 1963). 

[28.10] M. Kac, "Random walk and the theory of Brownian motion," (1946), 
reprinted in ref. [11]. 

[28.1 1] N. Wax, ed., Selected Papers on Noise and Stochastic Processes (Dover, 
New York 1954). 

[28.12] L. S. Schulman, Techniques and Applications of Path Integration (Wiley - 
Interscience, New York 1981). 

[28.13] M. Kac, Probability and Related Topics in Physical Sciences (Wiley- 
Interscience, New York 1959). 

[28.14] E. Nelson, Quantum Fluctuations (Princeton Univ. Press 1985). 

[28. 15] H. Kunita, Stochastic Flows and Stochastic Differential Equations (Cam- 
bridge Univ. Press, Cambridge 1990). 

[28.16] H. Haken and G. Mayer-Kress, Z.f. Physik B 43, 185 (1981). 

[28.17] M. Roncadelli, Phys. Rev. E 52, 4661 (1995). 

[28.18] G. Ryskin, Phys. Rev. E 56, 5123 (1997). 

[28.19] L. Onsager and S. Machlup, Phys. Rev. 91, 1505, 1512 (1953). 



refsNoise-24dec2012 



ChaosBook.org versionl4.4.1, Apr21 2013 



References 



591 



[28.20] Lord Rayleigh, Phil. Mag. 26, 776 (1913). 

[28.21] G. E. Uhlenbeck and L. S. Ornstein, Phys. Rev. 36, 823 (1930). 

[28.22] M. Jacobsen, Bernoulli J. of Math. Stat, and Prob. 2, 111 (1997). 

[28.23] P. Gaspard, "Trace Formula for Noisy Flows," J. Stat. Phys. 106, 57 
(2002). 

[28.24] M. I. Freidlin and A. D. Wentzel, Random Perturbations of Dynamical 
Systems, (Springer, New York 1998). 

[28.25] E. Gozzi, M. Reuter and W. D. Thacker, Phys. Rev. D 40, 3363 (1989). 

[28.26] E. Gozzi and M. Reuter, Phys. Lett. 233B, 383 (1989); 238B, 451 (1990); 
240B, 137 (1990). 

[28.27] E. Gozzi, M. Reuter and W. D. Thacker, Phys. Rev. D 46, 757 (1992). 

[28.28] E. Gozzi, M. Reuter and W. D. Thacker, Chaos, Solitons and Fractals 2, 
441 (1992). 

[28.29] R. Benzi, G. Paladin, G. Parisi and A. Vulpiani, J. Phys. A 18, 2157 
(1985). 

[28.30] R. Graham, Europhys. Lett. 5, 101 (1988). 

[28.31] R. Graham and T. Tel, Phys. Rev. A 35, 1382 (1987). 

[28.32] L. Arnold and V. Wihstutz, Lyapunov exponents, Lecture Notes in Math. 
1186 (Springer- Verlag, New York 1986). 

[28.33] M. V. Feigelman and A. M. Tsvelik, Sov. Phys. JETP 56, 823 (1982). 

[28.34] G. Parisi and N. Sourlas, Nucl. Phys. B206, 321 (1982). 

[28.35] F. Langouche et al., Functional integration and semiclassical expansion 
(Reidel, Dordrecht 1982). 

[28.36] O. Cepas and J. Kurchan, "Canonically invariant formulation of Langevin 
and Fokker-Planck equations," European Phys. J. B 2, 221 (1998); 
arXiv:cond-mat/9706296. 

[28.37] S. Tanase-Nicola and J. Kurchan, "Statistical-mechanical formulation of 
Lyapunov exponents;" arXiv:cond-mat/02 10380. 

[28.38] M. Abramowitz and I. A. Stegun, editors. Handbook of Mathematical 
Functions. Dover, New York, 1964. 

[28.39] H. Takada, Y. Kitaoka, and Y. Shimizua. Mathematical index and model 
in stabirometry. Forma, 16:17-46, 2001. 

[28.40] S. Chandrasekhar. Stochastic problems in physics and astronomy. Rev. 
Mod. Phys., 15:1, 1943. 



refsNoise-24dec2012 



ChaosBook.org versionl4.4.1, Apr21 2013 



References 



592 



[28.41] A. Einstein. Uber die von der molekularkinetischen Theorie der Warme 
geforderte Bewegung von in ruhenden Fliissigkeiten suspendierten Teilchen. 
Ann. Physik, 17:549, 1905. 

[28.42] P. S. Laplace. Memoire sur les integrates defmies et leur application aux 
probabilites, et specialement a la recherche du milieu qu'il faut choisir entre 
les resultats des observations. Mem. Acad. Sci. (I), XI, Section V, pages 
375-387, 1810. 

[28.43] L.Arnold. Stochastic Differential Equations: Theory and Applications. 
Wiley, New York, 1974. 

[28.44] K. Ito, Stochastic integral, Proc. Imp. Acad. Tokyo 20 (1944) 519-524. 

[28.45] R. L. Stratonovich, Conditional Markov Processes and Their Application 
to the Theory of Control, Elsevier, New York, 1968. 

[28.46] R. F. Fox, Gaussian stochastic processes in physics, Phys. Rep. 48 (1978) 
179-283. 

[28.47] C. Kuehn. Deterministic continutation of stochastic metastable equilibria 
via Lyapunov equations and ellipsoids, 201 1. arXiv : 1 106 . 3479. 

[28.48] J. L. Doob. The Brownian movement and stochastic equations. Ann. 
Math., 43:351-369, 1942. 

[28.49] R. Feynman, A. Hibbs, Quantum Mechanics and Path Integrals, 
McGraw-Hill Book Co., New York, 1965. 

[28.50] P. A. M. Dirac, The Lagrangian in quantum mechanics, Phys. Z. Sowje- 
tunion 3 (1933) 64-72. 

[28.51] M. Roncadelli, Small-fluctuation expansion of the transition probability 
for a diffusion process, Phys. Rev. E 52 (1995) 4661. 

[28.52] A. Defendi, M. Roncadelli, Semiclassical approximation as a small-noise 
expansion, J. Phys. A 28 (1995) L515-L520. 

[28.53] B. Shraiman, C. E. Wayne, and P. C. Martin. Scaling theory for noisy 
period-doubling transitions to chaos. Phys. Rev. Lett., 46:935-939, 1981. 

[28.54] P. C. Martin, E. D. Siggia, H. A. Rose, Statistical mechanics of classical 
systems, Phys. Rev. A 8 (1973) 423. 

[28.55] Bernt 0ksendal. Stochastic Differential Equations. Springer, New York, 
2003. 

[28.56] H. Dekker and N. G. Van Kampen. Eigenvalues of a diffusion process 
with a critical point. Phys. Lett. A, 73:374-376, 1979. 

[28.57] P. Gaspard, G. Nicolis, A. Provata, and S. Tasaki. Spectral signature of 
the pitchfork bifurcation: Liouville equation approach. Phys. Rev. E, 51:74, 
1995. 



refsNoise-24dec2012 



ChaosBook.org versionl4.4.1, Apr21 2013 



References 



593 



[28.58] N. Berglund and B. Gentz, Noise-Induced Phenomena in Slow-Fast Dy- 
namical Systems: A Sample-Paths Approach (Springer, Berlin 2005). 

[28.59] M. Stone and P. Goldbart, Mathematics for Physics: A Guided Tour for 
Graduate Students (Cambridge Univ. Press, Cambridge, 2009). 

[28.60] Y. Kifer, On small random perturbations of some smooth dynamical sys- 
tems, Math. USSR-Izv. 8 (1974) 1083-1107. 

[28.61] M. J. Feigenbaum, B. Hasslacher, Irrational decimations and path- 
integrals for external noise, Phys. Rev. Lett. 49 (1982) 605-609. 

[28.62] A. Boyarsky, On the significance of absolutely continuous invariant mea- 
sures, PhysicaD 11 (1984) 130-146. 

[28.63] L. S. Pontryagin, V. Boltyanskii, R. Gamkrelidge, E. Mishenko, Math- 
ematical Theory of Optimal Processes, Interscience Publishers, New York, 
1962. 

[28.64] R. Bellman, Dynamic Programming, Princeton Univ. Press, Princeton, 
1957. 

[28.65] M. K. Tippett and S. E. Cohn. Adjoints and low-rank covariance repre- 
sentation. Nonlinear Processes in Geophysics, 8:331-340, 2001. 

[28.66] A. M. Lyapunov. The general problem of the stability of motion. Int. J. 
Control, 55:531-534, 1992. 

[28.67] R. E. Kalman, A new approach to linear filtering and prediction problems, 
Trans. ASME - J. Basic Engineering 82 (1960) 35^15. 

[28.68] H. D. I. Abarbanel, D. R. Creveling, R. Farsian, M. Kostuk, Dynamical 
state and parameter estimation, SIAM J. Appl. Math. 8 (2009) 1341-1381. 

[28.69] P. C. Martin, E. D. Siggia, and H. A. Rose. Statistical mechanics of 
classical systems. Physical Review A, 8:423-437, 1973. 

[28.70] P. Cvitanovic, Chaotic field theory: a sketch, Physica A288, 61 (2000); 
arXiv:nlin. CD/000 1034. 



refsNoise-24dec2012 



ChaosBook.org versionl4.4.1, Apr21 2013 



Chapter 29 



Relaxation for cyclists 




ycles, i.e., solutions of the periodic orbit condition (13.1) 



r T {x)=f\x), 



T > 0 



(29.1) 



are prerequisite to chapters 18 and 19 evaluation of spectra of classical evo- 
lution operators.Chapter 13 offered an introductory, hands-on guide to ex- 
traction of periodic orbits by means of the Newton-Raphson method. Here 
we take a very different tack, drawing inspiration from variational principles of 
classical mechanics, and path integrals of quantum mechanics. 

In sect. 13.2.1 we converted orbits unstable forward in time into orbits stable 
backwards in time. Indeed, all methods for finding unstable cycles are based on 
the idea of constructing a new dynamical system such that (/) the position of the 
cycle is the same for the original system and the transformed one, (ii) the unstable 
cycle in the original system is a stable cycle of the transformed system. 

The Newton-Raphson method for determining a fixed point x* for a map xf = 
f{x) is an example. The method replaces iteration of f{x) by iteration of the 
Newton-Raphson map (13.5) 



A fixed point x* for a map f{x) is also a fixed point of g{x), indeed a superstable 
fixed point since dgj(x*)/dxj = 0. This makes the convergence to the fixed point 
super-exponential. 

We also learned in chapter 13 that methods that start with initial guesses for 
a number of points along a cycle are considerably more robust and safer than 




(29.2) 



594 



CHAPTER 29. RELAXATION FOR CYCLISTS 



595 



searches based on direct solution of the fixed-point condition (29.1). The relax- 
ation (or variational) methods that we shall now describe take this multipoint ap- 
proach to its logical extreme, and start by a guess of not a few points along a 
periodic orbit, but a guess of the entire orbit. 

The idea is to make an informed rough guess of what the desired periodic orbit 
looks like globally, and then use variational methods to drive the initial guess 
toward the exact solution. Sacrificing computer memory for robustness of the 
method, we replace a guess that a point is on the periodic orbit by a guess of 
the entire orbit. And, sacrificing speed for safety, in sect. 29.1 we replace the 
Newton-Raphson iteration by a fictitious time, flow that minimizes a cost function 
computed as deviation of the approximate flow from the true flow along a loop 
approximation to a periodic orbit. 

If you have some insight into the topology of the flow and its symbolic dy- 
namics, or have already found a set of short cycles, you might be able to con- 
struct an initial approximation to a longer cycle p as a sequence of ./V points 
{x^\ ■ ■ ■ , x^) with the periodic boundary condition %+j = x\. Suppose 
you have an iterative method for improving your guess; after k iterations the cost 
function 



or some other more cleverly constructed function (for classical mechanics - action) 
is a measure of the deviation of the km approximate cycle from the true cycle. This 
observation motivates variational approaches to determining cycles. 

We give here three examples of such methods, two for maps, and one for bil- 
liards. In sect. 29.1 we start out by converting a problem of finding an unstable 
fixed point of a map into a problem of constructing a differential flow for which 
the desired fixed point is an attracting equilibrium point. Solving differential equa- 
tions can be time intensive, so in sect. 29.2 we replace such flows by discrete iter- 
ations. In sect. 29.3 we show that for 2D-dimensional billiard flows variation of D 
coordinates (where D is the number of Hamiltonian degrees of freedom) suffices 
to determine cycles in the full 2D-dimensional phase space. 



The relaxation (or gradient) algorithm for finding cycles is based on the observa- 
tion that a trajectory of a map such as the Henon map (3.17), 



N 




(29.3) 



29.1 Fictitious time relaxation 



(O. Biham, C. Chandre and P. Cvitanovic) 



x M = 1 - axf + byi 



yt+i = Xi 



1 > 



(29.4) 
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Figure 29.1: "Potential" Vj(x) (29.7) for a typical v iM 
point along an initial guess trajectory. For cr, = +1 
the flow is toward the local maximum of V,(x), and for 
cr, = -1 toward the local minimum. A large devia- 
tion of Xj's is needed to destabilize a trajectory passing 
through such local extremum of V,(x), hence the basin 
of attraction is expected to be large. 




is a stationary solution of the relaxation dynamics denned by the flow 

Jii = Vu f = 1 n (29.5) 

dr 

for any vector field v; = v,-(x) which vanishes on the trajectory. Here r is a "ficti- 
tious time" variable, unrelated to the dynamical time (in this example, the discrete 
time of map iteration). As the simplest example, take v, to be the deviation of an 
approximate trajectory from the exact 2- step recurrence form of the Henon map 
(3.18) 

Vj = xi+\ - 1 + axj - bxi-i. (29.6) 

For fixed x i+ i there are two values of x,- satisfying v; = 0. These solutions 
are the two extremal points of a local "potential" function (no sum on i) 

Vi = T-VfW, V l ix) = x i (x i+ i-bx i . 1 -l) + ^. (29.7) 

Assuming that the two extremal points are real, one is a local minimum of Vj(x) 
and the other is a local maximum. Now here is the idea; replace (29.5) by 

= 0-,-Vf, i - \,...,n, (29.8) 

dr 

where cr,- = ± 1 . 

The modified flow will be in the direction of the extremal point given by the 
local maximum of V,(x) if cr, = +1 is chosen, or in the direction of the one corre- 
sponding to the local minimum if we take cr, = -1. This is not quite what happens 
in solving (29.8) - all x,- and V,(x) change at each integration step - but this is the 
observation that motivates the method. The differential equations (29.8) then drive 
an approximate initial guess toward the exact trajectory. A sketch of the landscape 
in which x,- converges towards the proper fixed point is given in figure 29.1. As 
the "potential" function (29.7) is not bounded for a large |x,|, the flow diverges for 
initial guesses which are too distant from the true trajectory. However, the basin 
of attraction of initial guesses that converge to a given cycle is nevertheless very 
large, with the spread in acceptable initial guesses for figure 29.1 of order 1, in 
contrast to the exponential precision required of initial guesses by the Newton- 
Raphson method. 
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Figure 29.2: The repeller for the Henon map at a 
1.8, b = 0.3 . 



Example 29.1 Henon map cycles. Our aim in this calculation is to find all periodic 
orbits of period n for the Henon map (29.4), in principle at most 2" orbits. We start by 
choosing an initial guess trajectory (x\,X2, • • • , x n ) and impose the periodic boundary 
condition x n+ \ = x\. The simplest and a rather crude choice of the initial condition 
in the Henon map example is x, = 0 for all i. In order to find a given orbit one sets 
cr, = -1 for all iterates i which are local minima of Vi(x), and cri - 1 for iterates which 
are local maxima. In practice one runs through a complete list of prime cycles, such 
as the table 15.1. The real issue for all searches for periodic orbits, this one included, 
is how large is the basin of attraction of the desired periodic orbit? There is no easy 
answer to this question, but empirically it turns out that for the Henon map such initial 
guess almost always converges to the desired trajectory as long as the initial \x\ is not 
too large compared to l/^a. Figure 29. 1 gives some indication of a typical basin of 
attraction of the method (see also figure 29.3). 

The calculation is carried out by solving the set ofn ordinary differential equa- 
tions (29.8) using a simple Runge-Kutta method with a relatively large step size (h = 
0.1) until |v| becomes smaller than a given value s (in a typical calculation s ~ 1CT 7 ). 
Empirically, in the case that an orbit corresponding to the desired itinerary does not ex- 
ist, the initial guess escapes to infinity since the "potential" V,(x) grows without bound. 

exercise 29.3 

Applied to the Henon map at the Henon's parameters choice a = 1.4, b = 0.3, 
the method has yielded all periodic orbits to periods as long as n = 28, as well as 
selected orbits up to period n = 1000. All prime cycles up to period 10 for the Henon 
map, a = 1.4 and b = 0.3, are listed in table 29.1. The number of unstable periodic 
orbits for periods n < 28 is given in table 29.2. Comparing this with the list of all 
possible 2-symbol alphabet prime cycles, table 15.1, we see that the pruning is quite 
extensive, with the number of periodic points of period n growing as e 0 - 4645 " = ( l .592)" 
rather than as 2". 

As another example we plot all unstable periodic points up to period n = 14 for 
a = 1.8, b = 0.3 in figure 29.2. Comparing this repelling set with the strange attractor 
for the Henon's parameters figure 3.6, we note the existence of gaps in the set, cut out 
by the preimages of the escaping regions. 

rsmsirK ^y.i 

In practice, the relaxation flow (29.8) finds (almost) all periodic orbits which 
exist and indicates which ones do not. For the Henon map the method enables us to 
calculate almost all unstable cycles of essentially any desired length and accuracy. 



The idea of the relaxation algorithm illustrated by the above Henon map ex- 
ample is that instead of searching for an unstable periodic orbit of a map, one 
searches for a stable attractor of a vector field. More generally, consider a d- 
dimensional map x 1 - f(x) with a hyperbolic fixed point x, . Any fixed point x, is 
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Table 29.1: All prime cycles up to period 10 for the Henon map, a — 1.4 and b = 0.3. 
The columns list the period n p , the itinerary (defined in remark 29.3), a periodic point 
(y p , x p ), and the cycle Lyapunov exponent A p = In \A p \/n p . While most of the cycles have 
A p ~ 0.5, several significantly do not. The 0 periodic point is very unstable, isolated and 
transient fixed point, with no other cycles returning close to it. At period 13 one finds a 
pair of cycles with exceptionally low Lyapunov exponents. The cycles are close for most 
of the trajectory, differing only in the one symbol corresponding to two periodic points 
straddle the (partition) fold of the attractor. As the system is not hyperbolic, there is no 
known lower bound on cycle Lyapunov exponents, and the Henon's strange "attractor" 
might some day turn out to be nothing but a transient on the way to a periodic attractor of 
some long period. 
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Table 29.2: The number of unstable periodic orbits of the Henon map for a — 1 .4, b — 0.3, 
of all periods n < 28. M„ is the number of prime cycles of length n, and N„ is the total 
number of periodic points of period « (including repeats of shorter prime cycles). 
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Figure 29.3: Typical trajectories of the vector field 
(29.9) for the stabilization of a hyperbolic fixed 
point of the Ikeda map (29.11) located at (x,y) ss 
(0.53275, 0.24689). The circle indicates the position 
of the fixed point. Note that the basin of attraction of 
this fixed point is large, larger than the entire Ikeda at- 
tractor. 




by construction an equilibrium point of the fictitious time flow 
dx 

— = f(x) - x. (29.9) 

ClT 

If all eigenvalues of the Jacobian matrix J(x*) = Df(x t ) have real parts smaller 
than unity, then x» is a stable equilibrium point of the flow. 

If some of the eigenvalues have real parts larger than unity, then one needs to 
modify the vector field so that the corresponding directions of the flow are turned 
into stable directions in a neighborhood of the fixed point. In the spirit of (29.8), 
modify the flow by 

^ = C(f(x)-x), (29.10) 
dr 

where C is a [dxd] invertible matrix. The aim is to turn x* into a stable equilib- 
rium point of the flow by an appropriate choice of C. It can be shown that a set 
of permutation / reflection matrices with one and only one non-vanishing entry 
±1 per row or column (for J-dimensional systems, there are dlT? such matrices) 
suffices to stabilize any fixed point. In practice, one chooses a particular matrix 
C, and the flow is integrated. For each choice of C, one or more hyperbolic fixed 
points of the map may turn into stable equilibria of the flow. 



Example 29.2 Ikeda map: We illustrate the method with the determination of the 
periodic orbits of the Ikeda map: 

x = 1 + a(x cos w - y sin w) 

y' — a(x sin w + y cos w) (29.11) 

c 

where w = b ; t , 

1 + x + y z 

with a = 0.9, b = 0.4, c = 6. The fixed point x« is located at (x,y) ~ (0.53275, 0.24689), 
with eigenvalues of the Jacobian matrix (A 1; A 2 ) ~ (-2.3897,-0.3389), so the flow is 
already stabilized with C = 1. Figure 29.3 depicts the flow of the vector field around the 
fixed point x t . 

In order to determine x„ one needs to integrate the vector field (29.9) forward 
in time (the convergence is exponential in time), using a fourth order Runge-Kutta or 
any other integration routine. 
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Figure 29.4: Typical trajectories of the vector 
field (29.10) for a hyperbolic fixed point (x, y) ss 
(-0.13529,-0.37559) of/ 3 , where / is the Ikeda 
map (29.11). The circle indicates the position of 
the fixed point. For the vector field corresponding 
to (a) C = 1, x, is a hyperbolic equilibrium point 
of the flow, while for (b) C = (l _°A x, is an at- 
tracting equilibrium point. 



(a) 





(b) 



In contrast, determination of the 3-cycles of the Ikeda map requires nontrivial 
C matrices, different from the identity. Consider for example the hyperbolic fixed point 
(x,y) * (-0.13529, -0.37559) of the third iterate f 3 of the Ikeda map. The flow of the 
vector field for C = 1, Figure 29.4 (a), indicates a hyperbolic equilibrium point, while for 
C = (o_°i) ir) e fl° w of the vector field, figure 29.4(b) indicates that x„ is an attracting 
equilibrium point, reached at exponential speed by integration forward in time. 



The generalization from searches for fixed points to searches for cycles is 
straightforward. In order to determine a prime cycle x = (xi,X2, • • • , x n ) of a 
cf-dimensional map x" = f(x), we modify the multipoint shooting method of 
sect. 13.3, and consider the rcd-dimensional vector field 

^ = C(f(x)-x), (29.12) 

(XT 

where f(x) = (f(x n ), f(x{), f(xz), . . . , f(x n -\)), and C is an invertible [ndxnd] 
matrix. For the Henon map, it is sufficient to consider a set of T diagonal matrices 
with eigenvalues ± 1 . Risking a bit of confusion, we denote by x, f(x) both the 
J-dimensional vectors in (29.10), and nJ-dimensional vectors in (29.12), as the 
structure of the equations is the same. 



29.2 Discrete iteration relaxation method 



(C. Chandre, F.K. Diakonos and P. Schmelcher) 

The problem with the Newton-Raphson iteration (29.2) is that it requires very 
precise initial guesses. For example, the nth iterate of a unimodal map has as 
many as 2" periodic points crammed into the unit interval, so determination of all 
cycles of length n requires that the initial guess for each one of them has to be 
accurate to roughly 2~ n . This is not much of a problem for 1 -dimensional maps, 
but making a good initial guess for where a cycle might lie in a J-dimensional 
state space can be a challenge. 

Emboldened by the success of the cyclist relaxation trick (29.8) of manually 
turning instability into stability by a sign change, we now (/) abandon the Newton- 
Raphson method altogether, (//) abandon the continuous fictitious time flow (29.9) 
with its time-consuming integration, replacing it by a map g with a larger basin 
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of attraction (not restricted to a linear neighborhood of the fixed point). The idea 
is to construct a very simple map g, a linear transformation of the original /, for 
which the fixed point is stable. We replace the Jacobian matrix prefactor in (29.2) 
(whose inversion can be time-consuming) by a constant matrix prefactor 



where At is a positive real number, and C is a [dxd] permutation and reflection 
matrix with one and only one non- vanishing entry ± 1 per row or column. A fixed 
point of / is also a fixed point of g. Since C is invertible, the inverse is also true. 

This construction is motivated by the observation that for small At — » dr the 
map (29.13) is the Euler method for integrating the modified flow (29.10), with 
the integration step At. 

The argument why a suitable choice of matrix C can lead to the stabilization 
of an unstable periodic orbit is similar to the one used to motivate the construction 
of the modified vector field in sect. 29.1. Indeed, the flow (29.8) is the simplest 
example of this method, with the infinitesimal fictitious time increment At — > dr, 
the infinitesimal coordinate correction (x - x") — > dxi, and the [nxn] diagonal 
matrix C — > <x ; - = ±1. 

For a given fixed point of f(x) we again chose a C such that the flow in the 
expanding directions of M(x*) is turned into a contracting flow. The aim is to 
stabilize x* by a suitable choice of C. In the case where the map has multiple fixed 
points, the set of fixed points is obtained by changing the matrix C (in general 
different for each unstable fixed point) and varying initial conditions for the map 
g. For example, for 2-dimensional dissipative maps it can be shown that the 3 



suffice to stabilize all kinds of possible hyperbolic fixed points. 

If At is chosen sufficiently small, the magnitude of the eigenvalues of the 
fixed point x* in the transformed system are smaller than one, and one has a stable 
fixed point. However, At should not be chosen too small: Since the convergence 
is geometrical with a ratio 1 - c?At (where the value of constant a depends on 
the stability of the fixed point in the original system), small At can slow down 
the speed of convergence. The critical value of At, which just suffices to make 
the fixed point stable, can be read off from the quadratic equations relating the 
stability coefficients of the original system and those of the transformed system. In 
practice, one can find the optimal At by iterating the dynamical system stabilized 
with a given C and At. In general, all starting points converge on the attractor 
provided At is small enough. If this is not the case, the trajectory either diverges 
(if At is far too large) or it oscillates in a small section of the state space (if At is 
close to its stabilizing value). 



x - g(x) = x + ArC(f(x) - x), 



(29.13) 



matrices 
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The search for the fixed points is now straightforward: A stalling point cho- 
sen in the global neighborhood of the fixed point iterated with the transformed 
dynamical system g converges to the fixed point due to its stability. Numerical 
investigations show that the domain of attraction of a stabilized fixed point is a 
rather extended connected area, by no means confined to a linear neighborhood. 
At times the basin of attraction encompasses the complete state space of the attrac- 
tor, so one can be sure to be within the attracting basin of a fixed point regardless 
of where on the on the attractor on picks the initial condition. 

The step size |g(x)-x| decreases exponentially when the trajectory approaches 
the fixed point. To get the coordinates of the fixed points with a high precision, 
one therefore needs a large number of iterations for the trajectory which is already 
in the linear neighborhood of the fixed point. To speed up the convergence of the 
final part of the approach to a fixed point we recommend a combination of the 
above approach with the Newton- Raphson method (29.2). 

The fixed points of the nth iterate f are periodic points of a cycle of period 
n. If we consider the map 

x' = g{ x ) = x + AtC(/"U) - x) , (29.14) 

the iterates of g converge to a fixed point provided that At is sufficiently small 
and C is a [dxd] constant matrix chosen such that it stabilizes the flow. As n 
grows, At has to be chosen smaller and smaller. In the case of the Ikeda map 
example 29.2 the method works well for n < 20. As in (29.12), the multipoint 
shooting method is the method of preference for determining longer cycles. Con- 
sider x - (x\,X2, . . . , x n ) and the «<i-dimensional map 

x' = f(x) = (f(x n ),f(xi), . . . ,/(x„_i)) . 

Determining cycles with period n for the J-dimensional / is equivalent to deter- 
mining fixed points of the multipoint dn -dimensional /. The idea is to construct a 
matrix C such that the fixed point of / becomes stable for the map: 

x' = x + AtC(/(x) - x), 

where C is now a [ndxnd] permutation/reflection matrix with only one non-zero 
matrix element ± 1 per row or column. For any given matrix C, a certain fraction 
of the cycles becomes stable and can be found by iterating the transformed map 
which is now a nd dimensional map. 

From a practical point of view, the main advantage of this method compared to 
the Newton-Raphson method is twofold: (/) the Jacobian matrix of the flow need 
not be computed, so there is no large matrix to invert, simplifying considerably 
the implementation, and (ii) empirical basins of attractions for individual C are 
much larger than for the Newton-Raphson method. The price is a reduction in the 
speed of convergence. 
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Table 29.3: All prime cycles up to 6 bounces for the 3-disk fundamental domain, center- 
to-center separation R = 6, disk radius a = 1 . The columns list the cycle itinerary, its 
expanding eigenvalue A p , and the length of the orbit (if the velocity =1 this is the same as 
its period or the action). Note that the two 6 cycles 001011 and 001101 are degenerate 
due to the time reversal symmetry, but are not related by any discrete spatial symmetry. 
(Computed by RE. Rosenqvist.) 



"0 
1 

01 

001 

011 

0001 

0011 

0111 

00001 

00011 

00101 

00111 

01011 

01111 

000001 

000011 

000101 

000111 

001011 

001101 

001111 

010111 

011111 



9.898979485^66 
-1. 177145519638x10' 
-1.240948019921xl0 2 
-1.240542557041xl0 3 

1.449545074956x10 3 
-1.229570686196xl0 4 

1.445997591902xl0 4 
-1.707901900894xl0 4 
-1.217338387051xl0 5 

1.432820951544xl0 5 

1.539257907420x10 s 
-1.704107155425x10 s 
-1.799019479426x10 s 

2.010247347433x10 s 
-1.205062923819xl0 6 

1.418521622814xl0 6 

1.5255974482 17x10 6 
-1.688624934257x10 s 
-1.796354939785x10 6 
-1.796354939785x10 6 

2.005733106218xl0 6 

2.119615015369xl0 6 
-2.366378254801xl0 6 



4.000000000000 
4.267949192431 
8.316529485168 
12.321746616182 
12.580807741032 
16.322276474382 
16.585242906081 
16.849071859224 
20.322330025739 
20.585689671758 
20.638238386018 
20.853571517227 
20.897369388186 
21.116994322373 
24.322335435738 
24.585734788507 
24.638760250323 
24.854025100071 
24.902167001066 
24.902167001066 
25.121488488111 
25.165628236279 
25.384945785676 
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29.3 Least action method 



(P. Dahlqvist) 

The methods of sects. 29.1 and 29.2 are somewhat ad hoc, as for general 
flows and iterated maps there is no fundamental principle to guide us in choosing 
the cost function, such as (29.3), to vary. 

For Hamiltonian dynamics, we are on much firmer ground; Maupertuis least 
action principle. You yawn your way through it in every mechanics course-but as 
we shall now see, it is a very hands-on numerical method for finding cycles. 

Indeed, the simplest and numerically most robust method for determining cy- 
cles of planar billiards is given by the principle of least action, or equivalently, 
by extremizing the length of an approximate orbit that visits a given sequence of 
disks. In contrast to the multipoint shooting method of sect. 13.3 which requires 
variation of 2n phase-space points, extremization of a cycle length requires varia- 
tion of only n bounce positions 

The problem is to find the extremum values of cycle length L(s) where s = 
(si, . . ., s n ), that is find the roots of d{L(s) = 0. Expand to first order 



diL(s 0 + 6s) - diL(s Q ) + ^ didjL(s 0 )Ssj 



j + 

j 



exercise 29.1 



and use Mjj(so) = djdjL(so) in the n -dimensional Newton-Raphson iteration 
scheme of sect. 13.2.2 



MO) 



djL(s) (29.15) 



The extremization is achieved by recursive implementation of the above algo- 
rithm, with proviso that if the dynamics is pruned, one also has to check that the 
final extremal length orbit does not penetrate a billiard wall. 

As an example, the short periods and stabilities of 3-disk cycles computed this 
way are listed table 29.3. 



exercise 29.2 
exercise 13.13 



Resume 



Unlike the Newton-Raphson method, variational methods are very robust. As each 
step around a cycle is short, they do not suffer from exponential instabilities, and 
with rather coarse initial guesses one can determine cycles of arbitrary length. 
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Commentary 



Remark 29.1 Relaxation method. The relaxation (or gradient) algorithm is one of the 
methods for solving extremal problems [ ] . The method described above was introduced 
by Biham and Wenzel [1], who have also generalized it (in the case of the Henon map) 
to determination of all 2" cycles of period n, real or complex [2]. The applicability and 
reliability of the method is discussed in detail by Grassberger, Kantz and Moening [ ], 
who give examples of the ways in which the method fails: (a) it might reach a limit 
cycle rather than a equilibrium saddle point (that can be remedied by the complex Biham- 
Wenzel algorithm [2]) (b) different symbol sequences can converge to the same cycle 
(i.e., more refined initial conditions might be needed). Furthermore, Hansen (ref. [7] 
and chapter 4. of ref. [22]) has pointed out that the method cannot find certain cycles 
for specific values of the Henon map parameters. In practice, the relaxation method for 
determining periodic orbits of maps appears to be effective almost always, but not always. 
It is much slower than the multipoint shooting method of sect. 13.3, but also much quicker 
to program, as it does not require evaluation of stability matrices and their inversion. If the 
complete set of cycles is required, the method has to be supplemented by other methods. 

Remark 29.2 Hybrid Newton-Raphson/relaxation methods. The method discussed 
in sect. 29.2 was introduced by Schmelcher et al [9]. The method was extended to flows 
by means of the Poincare surface of section technique in ref. [10]. It is also possible to 
combine the Newton-Raphson method and (29.13) in the construction of a transformed 
map [14]. In this approach, each step of the iteration scheme is a linear superposition of 
a step of the stability transformed system and a step of the Newton-Raphson algorithm. 
Far from the linear neighborhood the weight is dominantly on the globally acting stability 
transformation algorithm. Close to the fixed point, the steps of the iteration are dominated 
by the Newton-Raphson procedure. 

Remark 29.3 Relation to the Smale horseshoe symbolic dynamics. For a complete 
horseshoe Henon repeller (a sufficiently large), such as the one given in figure 29.2, the 
signs <x, e {1,-1} are in a 1-to-l correspondence with the Smale horsheshoe symbolic 
dynamics Sj e {0, 1): 



For arbitrary parameter values with a finite subshift symbolic dynamics or with arbitrar- 
ily complicated pruning, the relation of sign sequences {cr [, cr%, ■ ■ ■ , cr„} to the itineraries 
{s\, S2, ■ • ■, s n ] can be much subtler; this is discussed in ref. [5]. 

Remark 29.4 Ikeda map. Ikeda map (29.11) was introduced in ref. [12] is a model 
which exhibits complex dynamics observed in nonlinear optical ring cavities. 

Remark 29.5 Relaxation for continuous time flows. For a ^-dimensional flow 
x = v(x), the method described above can be extended by considering a Poincare sur- 
face of section. The Poincare section yields a map / with dimension d-1, and the above 
discrete iterative maps procedures can be carried out. A method that keeps the trial or- 
bit continuous throughout the calculation is the Newton descent, a variational method for 
finding periodic orbits of continuous time flows, is described in refs. [15, 16]. 
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Remark 29.6 Stability ordering. The parameter At in (29.13) is a key quantity 

here. It is related to the stability of the desired cycle in the transformed system: The 

more unstable a fixed point is, the smaller At has to be to stabilize it. With increasing 

cycle periods, the unstable eigenvalue of the Jacobian matrix increases and therefore At 

has to be reduced to achieve stabilization of all fixed points. In many cases the least 

unstable cycles of a given period n are of physically most important [ 11]. In this context j- 6 

At operates as a stability filter. It allows the selective stabilization of only those cycles 

which posses Lyapunov exponents smaller than a cut-off value. If one starts the search 

for cycles within a given period n with a value At w <9(10~'), and gradually lowers At 

one obtains the sequence of all unstable orbits of order n sorted with increasing values 

of their Lyapunov exponents. For the specific choice of C the relation between At and 

the stability coefficients of the fixed points of the original system is strictly monotonous. 

Transformed dynamical systems with other C's do not obey such a strict behavior but 

show a rough ordering of the sequence of Floquet multipliers of the fixed points stabilized 

in the course of decreasing values for At. As explained in sect. 20.6, stability ordered 

cycles are needed to order cycle expansions of dynamical quantities of chaotic systems 

for which a symbolic dynamics is not known. For such systems, an ordering of cycles 

with respect to their stability has been proposed [15, 16, ?], and shown to yield good 

results in practical applications. 

Remark 29.7 Action extremization method. The action extremization (sect. 29.3) 
as a numerical method for finding cycles has been introduced independently by many 
people. We have learned it from G. Russberg, and from M. Sieber's and F. Steiner's 
hyperbola billiard computations [17, 18]. The convergence rate is really impressive, for 
the Sinai billiard some 5000 cycles are computed within CPU seconds with rather bad 
initial guesses. 

Variational methods are the key ingredient of the Aubry-Mather theory of area-preserving 
twist maps (known in the condensed matter literature as the Frenkel-Kontorova models of 
1 -dimensional crystals), discrete-time Hamiltonian dynamical systems particularly suited 
to explorations of the K.A.M. theorem. Proofs of the Aubry-Mather theorem [ 20] on 
existence of quasi-periodic solutions are variational. It was quickly realized that the vari- 
ational methods can also yield reliable, high precision computations of long periodic or- 
bits of twist map models in 2 or more dimensions, needed for K.A.M. renormalization 
studies [19]. 

A fictitious time gradient flow similar to the one discussed here in sect. 29.1 was 
introduced by Anegent [21] for twist maps, and used by Gole [22] in his proof of the 
Aubry-Mather theorem. Mathematical bounds on the regions of stability of K.A.M. tori 
are notoriously restrictive compared to the numerical indications, and de la Llave, Fal- 
colini and Tompaidis [23, 24] have found the gradient flow formulation advantageous 
both in studies of the analyticity domains of the K.A.M. stability, as well as proving the 
Aubry-Mather theorem for extended systems (for a pedagogical introduction, see the lat- 
tice dynamics section of ref. [25]). 

All of the twist-maps work is based on extremizing the discrete dynamics version of 
the action S (in this context sometimes called a "generating function"). However, in their 
investigations in the complex plane, Falcolini and de la Llave [23] do find it useful to 
minimize instead S S , analogous to our cost function (29.3). 
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Exercises 

29.1. Evaluation of billiard cycles by minimization*. 

Given a symbol sequence, you can construct a guess tra- 
jectory by taking a point on the boundary of each disk 
in the sequence, and connecting them by straight lines. 
If this were a rubber band wrapped through 3 rings, it 
would shrink into the physical trajectory, which mini- 
mizes the action (in this case, the length) of the trajec- 
tory. 

Write a program to find the periodic orbits for your bil- 
liard simulator. Use the least action principle to ex- 
tremize the length of the periodic orbit, and reproduce 
the periods and stabilities of 3-disk cycles, table 29.3. 
(One such method is given in sect. 29.3.) After that 
check the accuracy of the computed orbits by iterating 
them forward with your simulator. What is your error 
\f"(x)-x\7 

29.2. Tracking cycles adiabatically*. Once a cycle has been 
found, orbits for different system parameters values may 



be obtained by varying slowly (adiabatically) the param- 
eters, and using the old orbit points as starting guesses 
in the Newton method. Try this method out on the 3- 
disk system. It works well for R : a sufficiently large. 
For smaller values, some orbits change rather quickly 
and require very small step sizes. In addition, for ra- 
tios below R : a — 2.04821419 . . . families of cycles are 
pruned, i.e. some of the minimal length trajectories are 
blocked by intervening disks. 



29.3. Cycles of the Henon map. Apply the method of 
sect. 29.1 to the Henon map at the Henon's parameters 
choice a — 1.4, b — 0.3, and compute all prime cycles 
for at least n < 6. Estimate the topological entropy, 
either from the definition ( 15. 1), or as the zero of a trun- 
cated topological zeta function (15.27). Do your cycles 
agree with the cycles listed in table 29.1? 



References 

[29.1] O. Biham and W. Wenzel, "Characterization of unstable periodic orbits in 
chaotic attractors and repellers," Phys. Rev. Lett. 63, 819 (1989). 

[29.2] O. Biham and W. Wenzel, Phys. Rev. A 42, 4639 (1990). 

[29.3] P. Grassberger and H. Kantz, "Generating partitions for the dissipative 
Henon map," Phys. Lett. A 113, 235 (1985). 

[29.4] H. Kantz and P. Grassberger, Physica 17D, 75 (1985). 

[29.5] P. Grassberger, H. Kantz, and U. Moenig. "On the symbolic dynamics of 
the Henon map," J. Phys. A 43, 5217 (1989). 

[29.6] M. Eisele, "Comparison of several generating partitions of the Hnon map," 
/. Phys. A 32, 1533 (1999). 

[29.7] K.T. Hansen, "Remarks on the symbolic dynamics for the Henon map," 
Phys. Lett. A 165, 100 (1992). 

[29.8] D. Sterling and J.D. Meiss, "Computing periodic orbits us- 
ing the anti-integrable limit," Phys. Lett. A 241, 46 (1998); 
arXiv:chao-dyn/9802®14. 



refsRelax-22jan2005 



ChaosBook.org versionl4.4.1, Apr21 2013 



References 



608 



[29.9] P. Schmelcher and F.K. Diakonos, Phys. Rev. Lett. 78, 4733 (1997); Phys. 
Rev. E 57, 2739 (1998). 

[29.10] D. Pingel, P. Schmelcher and F.K. Diakonos, O. Biham, Phys. Rev. E 64, 
026214 (2001). 

[29.11] F. K. Diakonos, P. Schmelcher, O. Biham, Phys. Rev. Lett. 81, 4349 
(1998). 

[29.12] K. Ikeda, Opt. Commun. 30, 257 (1979). 

[29.13] F. Stummel and K. Hainer, Praktische Mathematik (Teubner, Stuttgart 
1982). 

[29.14] R.L. Davidchack and Y.C. Lai, Phys. Rev. E 60, 6172 (1999). 

[29.15] P. Cvitanovic and Y. Lan, "Turbulent fields and their recurrences," in 
N. Antoniou, ed., Proceed, of 10. Intern. Workshop on Multiparticle Produc- 
tion: Correlations and Fluctuations in QCD (World Scientific, Singapore 
2003); arXiv:nlin. CD/0308006 

[29.16] Y. Lan and P. Cvitanovic, "Variational method for finding peri- 
odic orbits in a general flow," Phys. Rev. E 69 016217 (2004); 
arXiv : nlin . CD/0308008 

[29.17] M. Sieber and F. Steiner, " Quantum Chaos in the Hyperbola Billiard," 
Phys. Lett. A 148, 415 (1990). 

[29.18] M. Sieber, The Hyperbola Billiard: A Model for the Semiclassical Quan- 
tization of Chaotic Systems, Ph.D. thesis (Hamburg 1991); DESY report 91- 
030. 

[29.19] H. T. Kook and J. D. Meiss, "Periodic orbits for reversible symplectic 
mappings," Physica D 35, 65 (1989). 

[29.20] J.N. Mather, "Variational construction of orbits of twist difeomorphisms," 
J. Amer. Math. Soc. 4 207 (1991). 

[29.21] S. B. Angenent, "The periodic orbits of an area preserving twist-map," 
Comm. Math. Phys. 115, 353 (1988). 

[29.22] C. Gole, "A new proof of the Aubry-Mather's theorem," Math. Z. 210, 
441 (1992). 

[29.23] C. Falcolini and R. de la Llave, "Numerical calculation of domains of 
analyticity for perturbation theories in the presence of small divisors," J. 
Stat. Phys. 67, 645 (1992). 

[29.24] S. Tompaidis, "Numerical Study of Invariant Sets of a Quasi-periodic Per- 
turbation of a Symplectic Map," Experimental Mathematics 5, 211 (1996). 

[29.25] R. de la Llave, Variational methods for quasiperiodic solutions of partial 
differential equations, mp_arc 00-56. 



refsRelax-22jan2005 



ChaosBook.org versionl4.4.1, Apr21 2013 



References 



609 



[29.26] H. B. Keller, Numerical solution of bifurcation and nonlinear eigenvalue 
problems, in Applications of Bifurcation Theory, edited by P. H. Rabinowitz, 
pp. 359-384, New York, 1977, Academic Press. 

[29.27] H. B. Keller, Global homotopies and Newton methods, in Recent ad- 
vances in numerical analysis, edited by C. de Boor and G. H. Golub, pp. 
73-94, New York, 1979, Academic Press. 

[29.28] G. M. Shroff and H. B. Keller, Stabilization of unstable procedures: The 
recursive projection method, SIAM Journal on Numerical Analysis 30, 1099 
(1993). 

[29.29] E. L. Allgower and K. Georg, Numerically stable homotopy meth- 
ods without an extra dimension, Lect. Appl. Math. 26, 1 (1990), 
www . math . colostate . edu/ emeri ti/georg/homExtra . pdf . 



refsRelax-22jan2005 



ChaosBook.org versionl4.4.1, Apr21 2013 



Part V 

Quantum chaos 



610 



611 




unadry: 



Semiclassical Chaos 




Predrag Cvitanovic - Roberto Artuso - Per Dahlqvist - Ronnie Mainieri - Gregor 
Tanner - Gabor Vattay - Niall Whelan - Andreas Wirzba 



ChaosBook.org versionl4.4.1, Apr 21 2013 printed April 21, 2013 

ChaosBook.org comments to: predrag [snail] nbi.dk 



refsRelax-22jan2005 



ChaosBook.org versionl4.4.1, Apr21 2013 



Chapter 30 
Prologue 



Anyone who uses words "quantum" and "chaos" in the 
same sentence should be hung by his thumbs on a tree in 
the park behind the Niels Bohr Institute. 



— Joseph Ford 



(G. Vattay, G. Tanner and P. Cvitanovic) 




ou have read the first volume of this book. So far, so good - anyone can 



play a game of classical pinball, and a skilled neuroscientist can poke rat 



brains. We learned that information about chaotic dynamics can be ob- 
tained by calculating spectra of linear operators such as the evolution operator 
of sect. 17.2 or the associated partial differential equations such as the Liouville 
equation (16.36). The spectra of these operators can be expressed in terms of peri- 
odic orbits of the deterministic dynamics by means of periodic orbit expansions. 

But what happens quantum mechanically, i.e., if we scatter waves rather than 
point-like pinballs? Can we turn the problem round and study linear PDE's in 
terms of the underlying deterministic dynamics? And, is there a link between 
structures in the spectrum or the eigenfunctions of a PDE and the dynamical prop- 
erties of the underlying classical flow? The answer is yes, but . . . things are be- 
coming somewhat more complicated when studying 2nd or higher order linear 
PDE's. We can find classical dynamics associated with a linear PDE, just take ge- 
ometric optics as a familiar example. Propagation of light follows a second order 
wave equation but may in certain limits be well described in terms of geometric 
rays. A theory in terms of properties of the classical dynamics alone, referred 
to here as the semiclassical theory, will not be exact, in contrast to the classi- 
cal periodic orbit formulas obtained so far. Waves exhibit new phenomena, such 
as interference, diffraction, and higher Ti corrections which will only be partially 
incorporated into the periodic orbit theory. 
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30.1 Quantum pinball 



In what follows, we will restrict the discussion to the non-relativistic Schrodinger 
equation. The approach will be very much in the spirit of the early days of quan- 
tum mechanics, before its wave character has been fully uncovered by Schrodinger 
in the mid 1920's. Indeed, were physicists of the period as familiar with classical 
chaos as we are today, this theory could have been developed 80 years ago. It was 
the discrete nature of the hydrogen spectrum which inspired the Bohr - de Broglie 
picture of the old quantum theory: one places a wave instead of a particle on a 
Keplerian orbit around the hydrogen nucleus. The quantization condition is that 
only those orbits contribute for which this wave is stationary; from this followed 
the Balmer spectrum and the Bohr-Sommerfeld quantization which eventually led 
to the more sophisticated theory of Heisenberg, Schrodinger and others. Today 
we are very aware of the fact that elliptic orbits are an idiosyncracy of the Kepler 
problem, and that chaos is the rule; so can the Bohr quantization be generalized 
to chaotic systems? 

The question was answered affirmatively by M. Gutzwiller, as late as 1971: a 
chaotic system can indeed be quantized by placing a wave on each of the infinity 
of unstable periodic orbits. Due to the instability of the orbits the wave does not 
stay localized but leaks into neighborhoods of other periodic orbits. Contributions 
of different periodic orbits interfere and the quantization condition can no longer 
be attributed to a single periodic orbit: A coherent summation over the infinity of 
periodic orbit contributions gives the desired spectrum. 

The pleasant surprise is that the zeros of the dynamical zeta function (1.10) 
derived in the context of classical chaotic dynamics, 



chapter 19 



also yield excellent estimates of quantum resonances, with the quantum amplitude 
associated with a given cycle approximated semiclassically by the weight 



t p = — i-_e^ P -n™ P /2 f (30.1) 

|A p |2 



whose magnitude is the square root of the classical weight (19.10) 
1 



' IA, 



t - J-Ap-sT p 

n ■ ■ i 



and the phase is given by the Bohr-Sommerfeld action integral S p , together with 
an additional topological phase m p , the number of caustics along the periodic 
trajectory, points where the naive semiclassical approximation fails. 
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In this approach, the quantal spectra of classically chaotic dynamical systems 
are determined from the zeros of dynamical zeta functions, denned by cycle ex- 
pansions of infinite products of form 

i/s=Y\o--h) = i -'Z J t f-Yt ck (30 - 2) 

P f k 
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with weight t p associated to every prime (non-repeating) periodic orbit (or cycle) 
P- 

The key observation is that the chaotic dynamics is often organized around a 
few fundamental cycles. These short cycles capture the skeletal topology of the 
motion in the sense that any long orbit can approximately be pieced together from 
the fundamental cycles. In chapter 20 it was shown that for this reason the cy- 
cle expansion (30.2) is a highly convergent expansion dominated by short cycles 
grouped into fundamental contributions, with longer cycles contributing rapidly 
decreasing curvature corrections. Computations with dynamical zeta functions 
are rather straightforward; typically one determines lengths and stabilities of a fi- 
nite number of shortest periodic orbits, substitutes them into (50.2), and estimates 
the zeros of l/£ from such polynomial approximations. 

From the vantage point of the dynamical systems theory, the trace formulas 
(both the exact Selberg and the semiclassical Gutzwiller trace formula) fit into 
a general framework of replacing phase space averages by sums over periodic 
orbits. For classical hyperbolic systems this is possible since the invariant density 
can be represented by sum over all periodic orbits, with weights related to their 
instability. The semiclassical periodic orbit sums differ from the classical ones 
only in phase factors and stability weights; such differences may be traced back 
to the fact that in quantum mechanics the amplitudes rather than the probabilities 
are added. 



The type of dynamics has a strong influence on the convergence of cycle ex- 
pansions and the properties of quantal spectra; this necessitates development of 
different approaches for different types of dynamical behavior such as, on one 
hand, the strongly hyperbolic and, on the other hand, the intermittent dynamics 
of chapters 20 and 24. For generic nonhyperbolic systems (which we shall not 
discuss here), with mixed phase space and marginally stable orbits, periodic orbit 
summations are hard to control, and it is still not clear that the periodic orbit sums 
should necessarily be the computational method of choice. 

Where is all this taking us? The goal of this part of the book is to demonstrate 
that the cycle expansions, developed so far in classical settings, are also a powerful 
tool for evaluation of quantum resonances of classically chaotic systems. 

First, we shall warm up playing our game of pinball, this time in a quan- 
tum version. Were the game of pinball a closed system, quantum mechanically 
one would determine its stationary eigenfunctions and eigenenergies. For open 
systems one seeks instead complex resonances, where the imaginary part of the 
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Figure 30.1: A typical collinear helium trajectory in 
the r\ - r 2 plane; the trajectory enters along the r\ axis 
and escapes to infinity along the r 2 axis. 




eigenenergy describes the rate at which the quantum wave function leaks out of 
the central scattering region. This will turn out to work well, except who truly 
wants to know accurately the resonances of a quantum pinball? 
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30.2 Quantization of helium 



Once we have derived the semiclassical (30. 1 ) weight associated with the periodic 
orbit p, we will finally be in position to accomplish something altogether remark- 
able. We are now able to put together all ingredients that make the game of pinball 
unpredictable, and compute a "chaotic" part of the helium spectrum to shocking 
accuracy. From the classical dynamics point of view, helium is an example of 
Poincare's dreaded and intractable 3-body problem. Undaunted, we forge ahead 
and consider the collinear helium, with zero total angular momentum, and the 

two electrons on the opposite sides of the nucleus. 
++ 



■0" 



We set the electron mass to 1, the nucleus mass to oo, the helium nucleus charge 
to 2, the electron charges to -1. The Hamiltonian is 

1 , 1 , 2 2 1 

H=-p\ + -pl + —— . (30.3) 

2 2 r\ r-i r\ + r2 

Due to the energy conservation, only three of the phase space coordinates (ji_,r>i,p\,p2) 
are independent. The dynamics can be visualized as a motion in the (tj, fy), r\ > 0 
quadrant, figure 30.1, or, better still, by a well chosen 2-dimensional Poincare 
section. 

The motion in the (ri , r-i) plane is topologically similar to the pinball motion 
in a 3-disk system, except that the motion is not free, but in the Coulomb poten- 
tial. The classical collinear helium is also a repeller; almost all of the classical 
trajectories escape. Miraculously, the symbolic dynamics for the survivors turns 
out to be binary, just as in the 3-disk game of pinball, so we know what cycles 
need to be computed for the cycle expansion (1.11). A set of shortest cycles up to 
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a given symbol string length then yields an estimate of the helium spectrum. This 
simple calculation yields surprisingly accurate eigenvalues; even though the cycle 
expansion was based on the semiclassical approximation (30. 1 ) which is expected 
to be good only in the classical large energy limit, the eigenenergies are good to 
1 % all the way down to the ground state. 

Before we can get to this point, we first have to recapitulate some basic notions 
of quantum mechanics; after having denned the main quantum objects of interest, 
the quantum propagator and the Green's function, we will relate the quantum 
propagation to the classical flow of the underlying dynamical system. We will then 
proceed to construct semiclassical approximations to the quantum propagator and 
the Green's function. A rederivation of classical Hamiltonian dynamics starting 
from the Hamilton- Jacobi equation will be offered along the way. The derivation 
of the Gutzwiller trace formula and the semiclassical zeta function as a sum and as 
a product over periodic orbits will be given in chapter 34. In subsequent chapters 
we buttress our case by applying and extending the theory: a cycle expansion 
calculation of scattering resonances in a 3-disk billiard in chapter35, the spectrum 
of helium in chapter 37, and the incorporation of diffraction effects in chapter 38. 



Commentary 



Remark 30.1 Guide to literature. A key prerequisite to developing any theory of 
"quantum chaos" is solid understanding of Hamiltonian mechanics. For that, Arnol'd 
monograph [5] is the essential reference. Ozorio de Almeida's monograph [9] offers a 
compact introduction to the aspects of Hamiltonian dynamics required for the quantization 
of integrable and nearly integrable systems, with emphasis on periodic orbits, normal 
forms, catastrophy theory and torus quantization. The book by Brack and Bhaduri [ 1] 
is an excellent introduction to the semiclassical methods. Gutzwiller's monograph [ 2] 
is an advanced introduction focusing on chaotic dynamics both in classical Hamiltonian 
settings and in the semiclassical quantization. This book is worth browsing through for 
its many insights and erudite comments on quantum and celestial mechanics even if one 
is not working on problems of quantum chaos. More suitable as a graduate course text is 
Reichl's exposition [3]. 

This book does not discuss the random matrix theory approach to chaos in quantal 
spectra; no randomness assumptions are made here, rather the goal is to milk the deter- 
ministic chaotic dynamics for its full worth. The book concentrates on the periodic orbit 
theory. For an introduction to "quantum chaos" that focuses on the random matrix theory 
the reader is referred to the excellent monograph by Haake [ 4], among others. 



Remark 30.2 The dates. Schrodinger's first wave mechanics paper [3] (hydrogen 
spectrum) was submitted 27 January 1926. Submission date for Madelung's 'quantum 
theory in hydrodynamical form' paper [2] was 25 October 1926. 
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Chapter 31 

Quantum mechanics 
the short short version 



w 



e start with a review of standard quantum mechanical concepts prereq- 
uisite to the derivation of the semiclassical trace formula. 



In coordinate representation, the time evolution of a quantum mechanical 
wave function is governed by the Schrodinger equation 



d ( ft d 

m-ifi(q,t)=H{q,-—Mq,t), (31.1) 



dt \ i dq 



where the Hamilton operator H(q, -itid q ) is obtained from the classical Hamilto- 
nian by substituting p — > -ifid q . Most of the Hamiltonians we shall consider here 
are of the separable form 

H(q,p) = T(p) + V(q) , T(p) = p 2 /2m , (31.2) 



describing dynamics of a particle in a D-dimensional potential V(q). For time- 
independent Hamiltonians we are interested in finding stationary solutions of the 
Schrodinger equation of the form 

ilj n {q,t) = e- iE " ,in Uq), (31-3) 



where E n are the eigenenergies of the time-independent Schrodinger equation 

HcPiq) = E4>(q) . (31.4) 
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For bound systems, the spectrum is discrete and the eigenfunctions form an 
orthonormal, 

f dq <f> n (q)<f>* m (q) = 6 nm , (31.5) 
and complete, 

£ = (31.6) 

n 

set of functions in a Hilbert space. Here and throughout the text, 

J" dq - J dq\dq%...dqn- (31-7) 

For simplicity, we will assume that the system is bound, although most of the 

results will be applicable to open systems, where one has complex resonances cha p (er 35 

instead of real energies, and the spectrum has continuous components. 

A given wave function can be expanded in the energy eigenbasis 

if,(q,t) = Yj C » e ~ iE " t/n( t'» ( rt> (3L8) 



where the expansion coefficient c n is given by the projection of the initial wave 
function \fj{q, 0) onto the nth eigenstate 



■-S 



dq<f>* n (q)Mq,0). (31.9) 



By substituting (31.9) into (31.8), we can cast the evolution of a wave function 
into a multiplicative form 



i, t) - J dq'K{q, q , f)\j/(q' , 0) , 



with the kernel 

K(q, q', 0 = 2 <f> n (q) e' iE " t/n ^(q') (31.10) 

n 

called the quantum evolution operator, or the propagator. Applied twice, first for 
time t\ and then for time ?2, it propagates the initial wave function from </ to q" , 
and then from q" to q 

K(q,q',ti+t 2 )= fdq"K(q,q",t 2 )K(q",q',t l ) (31.11) 
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forward in time (hence the name 'propagator'). In non-relativistic quantum me- 
chanics, the range of q" is infinite, so that the wave can propagate at any speed; 
in relativistic quantum mechanics, this is rectified by restricting the propagation 
to the forward light cone. 

Because the propagator is a linear combination of the eigenfunctions of the 
Schrodinger equation, it too satisfies this equation 



n /id 
ih—K{q,q',t) = H\q, l -—\K{q,q',t), (31.12) 



dt \ fidq 



and is thus a wave function defined for t > 0; from the completeness relation 
(31.6), we obtain the boundary condition at t = 0: 



Urn K(q,q',t)=6(q-q'). (31.13) 



The propagator thus represents the time-evolution of a wave packet starting out as 
a configuration space delta-function localized at the point cj at initial time t - 0. 

For time-independent Hamiltonians, the time dependence of the wave func- 
tions is known as soon as the eigenenergies En and eigenfunctions <p n have been 
determined. With time dependence taken care of, it makes sense to focus on the 
Green 's function, which is the Laplace transform of the propagator 

G(q,q ,E + ie) = — dte* *K(q,q,f)=>- — -. (31.14) 

ih Jo ^ E - E„ + te 

Here, e is a small positive number, ensuring the existence of the integral. The 
eigenenergies show up as poles in the Green's function with residues correspond- 
ing to the wave function amplitudes. If one is only interested in spectra, one may 
restrict oneself to the (formal) trace of the Green's function, 

\xG(q,q',E) = J ' dqG(q,q,E) - ^ ^J-^ , (31.15) 



where E is complex, with a positive imaginary part, and we have used the eigen- 
function orthonormality (31.5). This trace is formal, because the sum in (31.15) 
is often divergent. We shall return to this point in sects. 34. 1 . 1 and 34. 1 .2. 

A useful characterization of the set of eigenvalues is given in terms of the 
density of states, with a delta function peak at each eigenenergy, figure31.1 (a), 

d{E) = Y,m-E n ). (31.16) 
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Figure 31.1: Schematic picture of a) the density 
of states d(E), and b) the spectral staircase func- 
tion N(E). The dashed lines denote the mean den- 
sity of states d(E) and the average number of states 
N{E) discussed in more detail in sect. 34.1.1. 





Using the identity 



exercise 31.1 



8{E - E n ) = - lim -Im 



e->+o n E - E n + ie 



(31.17) 



we can express the density of states in terms of the trace of the Green's function. 
That is, 



d(E) = V 6(E - E n ) = - lim ^Im tr G(q, q',E + ie) 



(31.18) 



section 34.1.1 



As we shall see (after "some" work), a semiclassical formula for the right-hand- 
side of this relation yields the quantum spectrum in terms of periodic orbits. 

The density of states can be written as the derivative d(E) = dN(E)/dE of the 
spectral staircase function 



N(E) = ^ ®(£ - E n ) 



(31.19) 



which counts the number of eigenenergies below E, figure 3 1.1 (b). Here 0 is the 
Heaviside function 



0(jc) = 1 if x > 0; 0(jc) = 0 if x < 0 . 



(31.20) 



The spectral staircase is a useful quantity in many contexts, both experimental 
and theoretical. This completes our lightning review of quantum mechanics. 



Exercises 
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31.1. Dirac delta function, Lorentzian representation. 



Derive the representation (31.17) 
6(E - E„) 



lim — Im 

e->+o n E — E n + ie 



of a delta function as imaginary part of 1 jx. 

(Hint: read up on principal parts, positive and negative 
frequency part of the delta function, the Cauchy theorem 
in a good quantum mechanics textbook). 



31.2. Green's function. 

transform (31.14), 



Verify Green's function Laplace 



i r°° 

G(q,q',E + is) = — dt eT. E '-f,'K(q,q',t) 
in Jo 



S 



E - E„ + is 



argue that positive e is needed (hint: read a good quan- 
tum mechanics textbook). 
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Chapter 32 



WKB quantization 



T 



he wave function for a particle of energy E moving in a constant potential V 
is 

i{/ = AeZ pq (32.1) 



with a constant amplitude A, and constant wavelength A - 2n/k, k = p/fr, 
and p - ± yJ2m(E — V) is the momentum. Here we generalize this solution 
to the case where the potential varies slowly over many wavelengths. This 
semiclassical (or WKB) approximate solution of the Schrodinger equation fails at 
classical turning points, configuration space points where the particle momentum 
vanishes. In such neighborhoods, where the semiclassical approximation fails, 
one needs to solve locally the exact quantum problem, in order to compute con- 
nection coefficients which patch up semiclassical segments into an approximate 
global wave function. 

Two lessons follow. First, semiclassical methods can be very powerful - classi- 
cal mechanics computations yield surprisingly accurate estimates of quantal spec- 
tra, without solving the Schrodinger equation. Second, semiclassical quantization 
does depend on a purely wave-mechanical phenomena, the coherent addition of 
phases accrued by all fixed energy phase space trajectories that connect pairs of 
coordinate points, and the topological phase loss at every turning point, a topolog- 
ical property of the classical flow that plays no role in classical mechanics. 



32.1 WKB ansatz 



Consider a time-independent Schrodinger equation in 1 spatial dimension: 



H 2 

—ip"(q) + V{q)iff{q) = Eiftq) , (32.2) 
2m 
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Figure 32.1: A 1 -dimensional potential, location of 
the two turning points at fixed energy E. 




with potential V(q) growing sufficiently fast as q — » ±oo so that the classical 
particle motion is confined for any E. Define the local momentum p{q) and the 
local wavenumber k(q) by 



p(q) = ±^2m(E-V(q)), p(q) = hk(q) . 



(32.3) 



The variable wavenumber form of the Schrodinger equation 



if/' + k 2 {q)ift = 0 



(32.4) 



sugests that the wave function be written as iff - Ae^ s , A and S real functions of 
q. Substitution yields two equations, one for the real and other for the imaginary 
part: 



(S') 2 



A" 

2 + 2™ 

P + n- 
1 d 



S"A + 2S'A' = (S'A z ) = 0. 

A dq 



(32.5) 
(32.6) 



The Wentzel-Kramers-Brillouin (WKB) or semiclassical approximation consists 
of dropping the h 2 term in (32.5). Recalling that p = Tik, this amounts to assuming 
that k 2 » ^r- , which in turn implies that the phase of the wave function is changing 
much faster than its overall amplitude. So the WKB approximation can interpreted 
either as a short wavelength/high frequency approximation to a wave-mechanical 
problem, or as the semiclassical, fi «; 1 approximation to quantum mechanics. 

Setting fi = 0 and integrating (32.5) we obtain the phase increment of a wave 
function initially at q, at energy E 



S(q,q',E)= f dq"p(q"). 
Ja' 



(32.7) 



This integral over a particle trajectory of constant energy, called the action, will 
play a key role in all that follows. The integration of (32.6) is even easier 



A(q) = 



C 
\p{q)\* 



C = \p{q')\-^{q), 



(32.8) 
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where the integration constant C is fixed by the value of the wave function at the 
initial point </. The WKB (or semiclassical) ansatz. wave function is given by 

i/r sc (q, q',E) = S— e iSis^^ . (32 . 9 ) 
\p{q)\* 



In what follows we shall suppress dependence on the initial point and energy in 
such formulas, (q,^ ,E) — » (q). 

The WKB ansatz generalizes the free motion wave function (32.1), with the 
probability density |A(g)| 2 for finding a particle at q now inversely proportional 
to the velocity at that point, and the phase ^q p replaced by ^ J dq p(q), the in- 
tegrated action along the trajectory. This is fine, except at any turning point <$, 
figure 32.1, where all energy is potential, and 

p(q) 0 as q^qo, (32.10) 

so that the assumption that A 2 » ^- fails. What can one do in this case? 

For the task at hand, a simple physical picture, due to Maslov, does the job. 
In the q coordinate, the turning points are denned by the zero kinetic energy con- 
dition (see figure 32.1), and the motion appears singular. This is not so in the full 
phase space: the trajectory in a smooth confining 1 -dimensional potential is al- 
ways a smooth loop (see figure 32.2), with the "special" role of the turning points 
qi,qR seen to be an artifact of a particular choice of the (q, p) coordinate frame. 
Maslov proceeds from the initial point (</,//) to a point (qA,PA) preceding the 
turning point in the tp{q) representation, then switch to the momentum represen- 
tation 

^ qp iff(q), (32.11) 
continue from (q^, p&) to (qB,PB), switch back to the coordinate representation, 



l f,(q) = -L = f dpe^P^p), (32.12) 



and so on. 

The only rub is that one usually cannot evaluate these transforms exactly. But, 
as the WKB wave function (32.9) is approximate anyway, it suffices to estimate 
these transforms to the leading order in ft accuracy. This is accomplished by the 
method of stationary phase. 
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32.2 Method of stationary phase 



All "semiclassical" approximations are based on saddle point evaluations of inte- 
grals of the type 



dxA(x) e imx) , x, O(jc) € R , (32.13) 



where s is a real parameter, and O(x) is a real- valued function. In our applications 
s = l/h will always be assumed large. 

For large s, the phase oscillates rapidly and "averages to zero" everywhere 
except at the extremal points (& (xq) = 0. The method of approximating an integral 
by its values at extremal points is called the method of stationary phase. Consider 
first the case of a 1 -dimensional integral, and expand <&{xq + 8x) around xq to 
second order in Sx, 



dxA{x) e WK.*>y+kV'<.*>Vi*+...) . (32.14) 



Assume (for time being) that &'(xq) * 0, with either sign, sgn[0"] = <D"/|0"] = 
±1. If in the neighborhood of xq the amplitude A(x) varies slowly over many 
oscillations of the exponential function, we may retain the leading term in the 
Taylor expansion of the amplitude, and approximate the integral up to quadratic 
terms in the phase by 

r -*L 

The one integral that we know how to integrate is the Gaussian integral J dxe 2* = 
^2nb For for pure imaginary b = i a one gets instead the Fresnel integral formula 



exercise 32.1 



1 r°° 2 

——I dxe'ir* = Via - M^V'H (32.16) 

V27T J -co 
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we obtain 



/ * A(x 0 ) 



2n 



1/2 



(32.17) 



where ± corresponds to the positive/negative sign of s(&'(xo). 

32.3 WKB quantization 

We can now evaluate the Fourier transforms (32.11), (32.12) to the same order in 
Ti as the WKB wave function using the stationary phase method, 



C 



dq 



lljthJ \p{q)\2 
l2n% \p(q*)\? 



J dqe^ s " 



(32.18) 



where q* is given implicitly by the stationary phase condition 

0 = S'(q*)- p = p(q*) - p 

and the sign of S"(q*) = p'(q*) determines the phase of the Fresnel integral 
(32.16) 



c 



4[S( 9 *)-<?*i>]+§sgn[S"(<f)] 



\p(q*)p'(q*)\2 



(32.19) 



As we continue from (qA,p^) to (qs,PB), nothing problematic occurs - p(q*) is 
finite, and so is the acceleration //(<?*). Otherwise, the trajectory would take in- 
finitely long to get across. We recognize the exponent as the Legendre transform 

S(p) = S (q(p)) - q{p)p 

which can be used to expresses everything in terms of the p variable, 



, , d t dpdq(p) 

q = q(p), -i-q = i = -, — -, — = q (p)p (q ) ■ 

dq dq dp 



(32.20) 



As the classical trajectory crosses qi, the weight in (32.19), 



^-p 2 (q L ) = 2p(q L )p'(q L ) = -2mV'(q) , 
dq 



(32.21) 
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Figure 32.3: S P (E), the action of a periodic orbit p at 
energy E, equals the area in the phase space traced out 
by the 1-dof trajectory. 




is finite, and S"(q*) = p'(q*) < 0 for any point in the lower left quadrant, includ- 
ing (qA, Pa)- Hence, the phase loss in (32.19) is -?. To go back from the p to 
the q representation, just turn figure 32.2 quarter-turn anticlockwise. Everything 
is the same if you replace (q, p) — » (-p,q); so, without much ado we get the 
semiclassical wave function at the point (qs, Pb\ 



^sdq) = 



e t, 



US(p*)+qp*)- 



\q*(p*)\ 



<A.vc(P*) = 



c 



(32.22) 



The extra \p'(q*)\ l/2 weight in (32.19) is cancelled by the |</(p*)| 1/2 term, by the 
Legendre relation (32.20). 

The message is that going through a smooth potential turning point the WKB 
wave function phase slips by -|. This is equally true for the right and the left 
turning points, as can be seen by rotating figure 32.2 by 180°, and flipping co- 
ordinates (q, p) — > (—q,—p). While a turning point is not an invariant concept 
(for a sufficiently short trajectory segment, it can be undone by a 45' turn), for a 
complete period (q, p) = (c/, p') the total phase slip is always -2 • n/2, as a loop 
always has m -2 turning points. 

The WKB quantization condition follows by demanding that the wave function 
computed after a complete period be single-valued. With the normalization (32.8), 
we obtain 



\ff(q') = if/(q) 



p(q) 



,i(%§p(q)dq-n) 



The pref actor is 1 by the periodic orbit condition q = q", so the phase must be a 
multiple of 2n, 



- £p{q)dq = 2n(n + , 



(32.23) 



where m is the number of turning points along the trajectory - for this 1-dof prob- 
lem, m = 2. 



The action integral in (32.23) is the area (see figure 32.3) enclosed by the 
classical phase space loop of figure 32.2, and the quantization condition says that 
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eigen-energies correspond to loops whose action is an integer multiple of the unit 
quantum of action, Planck's constant fi. The extra topological phase, which, al- 
though it had been discovered many times in centuries past, had to wait for its 
most recent quantum chaotic (re)birth until the 1970's. Despite its derivation in a 
noninvariant coordinate frame, the final result involves only canonically invariant 
classical quantities, the periodic orbit action S , and the topological index m. 

32.3.1 Harmonic oscillator quantization 

Let us check the WKB quantization for one case (the only case?) whose quantum 
mechanics we fully understand: the harmonic oscillator 



The loop in figure 32.2 is now a circle in the (mcoq, p) plane, the action is its area 
S = 2nE/a), and the spectrum in the WKB approximation 



turns out to be the exact harmonic oscillator spectrum. The stationary phase condi- 
tion (32.18) keeps V(q) accurate to order q 2 , which in this case is the whole answer 
(but we were simply lucky, really). For many 1-dof problems the WKB spectrum 
turns out to be very accurate all the way down to the ground state. Surprisingly 
accurate, if one interprets dropping the W term in (32.5) as a short wavelength 
approximation. 



32.4 Beyond the quadratic saddle point 

We showed, with a bit of Fresnel/Maslov voodoo, that in a smoothly varying po- 
tential the phase of the WKB wave function slips by a 7r/2 for each turning point. 
This 7r/2 came from a yfi in the Fresnel integral (32. 16), one such factor for every 
time we switched representation from the configuration space to the momentum 
space, or back. Good, but what does this mean? 

The stationary phase approximation (32.14) fails whenever $"(*) = 0, or, in 
our the WKB ansatz (32.18), whenever the momentum p'{q) - S"(q) vanishes. 
In that case we have to go beyond the quadratic approximation (32.15) to the first 
nonvanishing term in the Taylor expansion of the exponent. If cp"(jc 0 ) + 0, then 




E n = ftco(n + 1 /2) 



(32.24) 




(32.25) 
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(1.6 




Figure 32.4: Airy function Ai(q). .0.6 1 _ 

•1-6-4-2 0 2 4 

Airy functions can be represented by integrals of the form 

1 r +0 ° v 3 

Ai{x) = —\ dye l(xy -- ] . (32.26) 

With a bit of Fresnel/Maslov voodoo we have shown that at each turning point 
a WKB wave function loses a bit of phase. Derivations of the WKB quantization 
condition given in standard quantum mechanics textbooks rely on expanding the 
potential close to the turning point 

V(q) = V(q 0 ) + (q~ qoW'iqo) + • • • , 

solving the Airy equation (with V'(qo) — > z after appropriate rescalings), 

if/' = Z^, (32.27) 

and matching the oscillatory and the exponentially decaying "forbidden" region 
wave function pieces by means of the WKB connection formulas. That requires 
staring at Airy functions (see (32.4)) and learning about their asymptotics - a chal- 
lenge that we will have to eventually overcome, in order to incorporate diffraction 
phenomena into semiclassical quantization. 

The physical origin of the topological phase is illustrated by the shape of the 
Airy function, figure 32.4. For a potential with a finite slope V'(q) the wave func- 
tion penetrates into the forbidden region, and accommodates a bit more of a sta- 
tionary wavelength then what one would expect from the classical trajectory alone. 
For infinite walls (i.e., billiards) a different argument applies: the wave function 
must vanish at the wall, and the phase slip due to a specular reflection is -n, rather 
than -7r/2. 



Resume 

The WKB ansatz wave function for 1 -degree of freedom problems fails at the 
turning points of the classical trajectory. While in the ^-representation the WKB 
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ansatz at a turning point is singular, along the p direction the classical trajectory in 
the same neighborhood is smooth, as for any smooth bound potential the classical 
motion is topologically a circle around the origin in the (q, p) space. The simplest 
way to deal with such singularities is as follows; follow the classical trajectory in 
g-space until the WKB approximation fails close to the turning point; then insert 
J dp\p)(p\ and follow the classical trajectory in the /?-space until you encounter 
the next p-space turning point; go back to the g-space representation, an so on. 
Each matching involves a Fresnel integral, yielding an extra e m ^ phase shift, for 
a total of e~ m phase shift for a full period of a semiclassical particle moving in a 
soft potential. The condition that the wave-function be single-valued then leads to 
the 1 -dimensional WKB quantization, and its lucky cousin, the Bohr-Sommerfeld 
quantization. 

Alternatively, one can linearize the potential around the turning point a, V(q) = 
V{a)+{q-a) V'{a)-\ — , and solve the quantum mechanical constant linear potential 
V(<l) - qF problem exactly, in terms of an Airy function. An approximate wave 
function is then patched together from an Airy function at each turning point, and 
the WKB ansatz wave-function segments in-between via the WKB connection 
formulas. The single- valuedness condition again yields the 1 -dimensional WKB 
quantization. This a bit more work than tracking the classical trajectory in the full 
phase space, but it gives us a better feeling for shapes of quantum eigenfunctions, 
and exemplifies the general strategy for dealing with other singularities, such as 
wedges, bifurcation points, creeping and tunneling: patch together the WKB seg- 
ments by means of exact QM solutions to local approximations to singular points. 



Commentary 



Remark 32.1 Airy function. The stationary phase approximation is all that is needed 
for the semiclassical approximation, with the proviso that D in ( 33.36) has no zero eigen- 
values. The zero eigenvalue case would require going beyond the Gaussian saddle-point 
approximation, which typically leads to approximations of the integrals in terms of Airy 
functions [9]. 



exercise 32.4 



Remark 32.2 Bohr-Sommerfeld quantization. Bohr-Sommerfeld quantization condi- 
tion was the key result of the old quantum theory, in which the electron trajectories were 
purely classical. They were lucky - the symmetries of the Kepler problem work out in 
such a way that the total topological index m = 4 amount effectively to numbering the 
energy levels starting with n — 1 . They were unlucky - because the hydrogen m = 4 
masked the topological index, they could never get the helium spectrum right - the semi- 
classical calculation had to wait for until 1980, when Leopold and Percival [?] added the 
topological indices. 
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I I 

Exercises 

from extremal points where the first non-zero term in a 
Taylor expansion of the phase is of third or higher order. 
Such situations occur, for example, at bifurcation points 
or in diffraction effects, (such as waves near sharp cor- 
ners, waves creeping around obstacles, etc.). In such 
calculations, one meets Airy functions integrals of the 
form 

Ai(x) = — \ dye i{xy -~ ] . (32.28) 

27T J_ M 

Calculate the Airy function Ai{x) using the stationary 
phase approximation. What happens when considering 
the limit x — > 0. Estimate for which value of x the 
stationary phase approximation breaks down. 
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32.1. WKB ansatz. ^l^r Try to show that no other 
ansatz other than (33.1) gives a meaningful definition of 
the momentum in the Ti — > 0 limit. 

32.2. Fresnel integral. Derive the Fresnel integral 

— — I dx e~fe = Via = |a| 1/2 e'^Pi . 
yhi J-co 

32.3. Sterling formula for n\. Compute an approximate 
value of n\ for large n using the stationary phase approx- 

poo 

imation. Hint: n\ — J Q dtt"e . 

*1 



32.4. Airy function for large arguments. Impor- 
tant contributions as stationary phase points may arise 
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Chapter 33 



Semiclassical evolution 



William Rowan Hamilton was born in 1805. At three 
he could read English; by four he began to read Latin, 
Greek and Hebrew, by ten he read Sanskrit, Persian, Ara- 
bic, Chaldee, Syrian and sundry Indian dialects. At age 
seventeen he began to think about optics, and worked out 
his great principle of "Characteristic Function." 
— Turnbull, Lives of Mathematicians 

(G. Vattay, G. Tanner and P. Cvitanovic) 



Semiclassical approximations to quantum mechanics are valid in the regime 
where the de Broglie wavelength A ~ fi/p of a particle with momentum 
p is much shorter than the length scales across which the potential of the 
system changes significantly. In the short wavelength approximation the particle 
is a point-like object bouncing off potential walls, the same way it does in the 
classical mechanics. The novelty of quantum mechanics is the interference of the 
point-like particle with other versions of itself traveling along different classical 
trajectories, a feat impossible in classical mechanics. The short wavelength - or 
semiclassical - formalism is developed by formally taking the limit fi —> 0 in 
quantum mechanics in such a way that quantum quantities go to their classical 
counterparts. 



33.1 Hamilton- Jacobi theory 

We saw in chapter 32 that for a 1-dof particle moving in a slowly varying poten- 
tial, it makes sense to generalize the free particle wave function (32.1) to a wave 
function 

Mq,t)=A(q,t)e iR ^ t)/n , (33.1) 
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with slowly varying (real) amplitude A{q, t) and rapidly varying (real) phase R(q, t). 
its phase and magnitude. The time evolution of the phase and the magnitude of if/ 
follows from the Schrodinger equation (31.1) 



id fi 2 d 2 

\in- + ——,-V(q)\^(q,t) = 0. (33.2) 



dt 2m dq 2 



Assume A + 0, and separate out the real and the imaginary parts. We get two 
equations: The real part governs the time evolution of the phase 



dR 1 (dR\ 2 fi 2 1 d 



dt 2m \dqj ^ 2m A dq 2 ' ^ ^ 



and the imaginary part the time evolution of the amplitude 



^ + 1^^ + 1,4— - 0 (33 4) 

dt m j-^ dqt dq,- 2m dq 2 



"momentum field" '-^(q, t) can be determined from the amplitude independent 



? + H l *)-0. (33-5) 



dt \ dq 



In classical mechanics this equation is known as the Hamilton-Jacobi equation. 
We will refer to this step (as well as all leading order in h approximations to 
follow) as the semiclassical approximation to wave mechanics, and from now on 
work only within this approximation. 



exercise 32.1 



exercise 33.6 
exercise 33.7 



exercise 33.8 



In this way a linear PDE for a complex wave function is converted into a set of 
coupled non-linear PDE's for real-valued functions R and A. The coupling term 
in (33.3) is, however, of order n 2 and thus small in the semiclassical limit fi — > 0. 

Now we generalize the Wentzel-Kramers-Brillouin (WKB) ansatz for 1-dof 
dynamics to the Van Vleck ansatz in arbitrary dimension: we assume the mag- 
nitude A(q, t) varies slowly compared to the phase R(q,t)/ti, so we drop the h- 
dependent term. In this approximation the phase R(q, t) and the corresponding 

dR, 

equation 



33.1.1 Hamilton's equations 



We now solve the nonlinear partial differential equation (33.5) in a way the 17 
year old Hamilton might have solved it. The main step is the step leading from 
the nonlinear PDE (33.9) to Hamilton's ODEs (33.10). If you already understand 
the Hamilton-Jacobi theory, you can safely skip this section. 
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Figure 33.1: (a) A phase R(q,f) plotted as a 
function of the position q for two infinitesimally 
close times, (b) The phase R(q, t) transported by 
a swarm of "particles"; The Hamilton's equations 
(33.10) construct R(q, t) by transporting go — * qifi 
and the slope of R(qo, to), that is po — » pit). 




(a) 



fast track: 

sect. 33.1.3, p. 638 



The wave equation (31.1) describes how the wave function i/r evolves with 
time, and if you think of i{r as an (infinite dimensional) vector, position q plays a 
role of an index. In one spatial dimension the phase R plotted as a function of the 
position q for two different times looks something like figure 33.1 (a): The phase 
R(q, to) deforms smoothly with time into the phase R(q, t) at time t. Hamilton's 
idea was to let a swarm of particles transport R and its slope dR/dq at q at initial 
time t = to to a corresponding R(q, t) and its slope at time t, figure 33.1 (b). For 
notational convenience, define 



dR 

Pi = Pi(q,t) := — , i = \,2,...,D. 
oqt 



(33.6) 



We saw earlier that (33.3) reduces in the semiclassical approximation to the Hamilton- 
Jacobi equation (33.5). To make life simple, we shall assume throughout this 
chapter that the Hamilton's function H(q, p) does not depend explicitly on time t, 
i.e., the energy is conserved. 

To start with, we also assume that the function R(q,t) is smooth and well 
defined for every q at the initial time t. This is true for sufficiently short times; 
as we will see later, R develops folds and becomes multi-valued as t progresses. 
Consider now the variation of the function R(q, t) with respect to independent 
infinitesimal variations of the time and space coordinates dt and dq, figure33.1 (a) 



, n dR , dR , 
dR = —dt + —dq . 

ot dq 



(33.7) 



Dividing through by dt and substituting (33.5) we obtain the total derivative of 
R(q, t) with respect to time along the as yet arbitrary direction q, that is, 



dR 

—(q, q, t) = -H(q, p) + q ■ p. 
dt 



(33.8) 



VanVleck - 8dec2010 



ChaosBook.org versionI4.4.1, Apr 21 2013 



CHAPTER 33. SEMICLASSICAL EVOLUTION 



636 



Note that the "momentum" p - dR/dq is a well defined function of q and t. 
In order to integrate R{q,t) with the help of (33.8) we also need to know how 
p = dR/dq changes along q. Varying p with respect to independent infinitesimal 
variations dt and dq and substituting the Hamilton- Jacobi equation (33.5) yields 



dR d 2 R , d 2 R , (dH 8Hdp\ , dp , 
d— = dt H -dq = - — — l- — — — \dt + —dq . 



dq dqdt dq 2 \dq dp dq J dq 

Note that H(q,p) depends on q also through p(q, t) - dR/dq, hence the ^ term 
in the above equation. Dividing again through by dt we get the time derivative of 
dR/dq, that is, 



dH ( . dH\ dp 



p(q,q,t) + —=lq-—\^-. (33.9) 
dq \ dp I dq 

Time variation of p depends not only on the yet unknown q, but also on the second 
derivatives of R with respect to q with yet unknown time dependence. However, if 
we choose q (which was arbitrary, so far) such that the right hand side of the above 
equation vanishes, we can calculate the function R(q, t) along a specific trajectory 
(q(t), p(t)) given by integrating the ordinary differential equations 

. dH(q,p) . dH(q,p) 

q = —x . P = 5 (33.10) 

dp dq 

with initial conditions 



dR 

q(t 0 ) = q , p(t 0 ) - p' - —(q'Jo). (33.11) 

dq 

We recognize (33.10) as Hamilton's equations of motion of classical mechanics. 
The miracle happens in the step leading from (33.5) to (33.9) - if you missed it, 
you have missed the point. Hamilton derived his equations contemplating optics 
- it took him three more years to realize that all of Newtonian dynamics can be 
profitably recast in this form. 

q is no longer an independent function, and the phase R(q, t) can now be com- 
puted by integrating equation (33.8) along the trajectory (q(t),p(t)) 



section 7.1 



R(q,t) = R(q',t 0 )+R(q,t;q',t 0 ) 



R{q 



,t;q',t 0 ) - f dT[q(r)-p(r)-H(q(T),p(T))] , (33.12) 
Jtn 



with the initial conditions (33.11). In this way the Hamilton- Jacobi partial differ- 
ential equation (33.3) is solved by integrating a set of ordinary differential equa- 
tions, Hamilton's equations. In order to determine R(q, t) for arbitrary q and t we 
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have to find a q' such that the trajectory starting in (</,// = d q R{q',to)) reaches 
q in time t and then compute R along this trajectory, see figure 33.1 (b). The 
integrand of (33.12) is known as the Lagrangian, 

L(q,q,t) = q-p-H(q,p,t). (33.13) 

A variational principle lurks here, but we shall not make much fuss about it as yet. 

Throughout this chapter we assume that the energy is conserved, and that the 
only time dependence of H(q, p) is through (q(r), p(r)), so the value of R(q, t; q", to) 
does not depend on to, but only on the elapsed time t - to. To simplify notation we 
will set to = 0 and write 

R(q,q',t) = R(q,t;q',0). 

The initial momentum of the particle must coincide with the initial momentum of 
the trajectory connecting c[ and q: 

p ' = w R{q ' ,0) = ~]kf R{q,q ' ,t) - (33 - 14) 

exercise 33.5 

The function R(q,q', t) is known as Hamilton's principal function. 



exercise 33.9 



To summarize: Hamilton's achievement was to trade in the Hamilton- Jacobi 
partial differential equation (33.5) describing the evolution of a wave front for a 
finite number of ordinary differential equations of motion, with the initial phase 
R(q, 0) incremented by the integral (33.12) evaluated along the phase space trajec- 
tory (q(T),p(T)). 



33.1.2 Action 

Before proceeding, we note in passing a few facts about Hamiltonian dynamics 
that will be needed for the construction of semiclassical Green's functions. If the 
energy is conserved, the J H{q,p)dr integral in (33.12) is simply Et. The first 
term, or the action 



S(q,q',E) 



f dTq(T)-p(T)= f'dq-p (33.15) 

Jo Jq' 



is integrated along a trajectory from </ to q with a fixed energy E. By (33.12) the 
action is a Legendre transform of Hamilton's principal function 

S(q,q',E) = R(q,q',t)+Et. (33.16) 
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The time of flight t along the trajectory connecting d — > q with fixed energy E is 
given by 

—S{q,q',E) = t. (33.17) 
oE 

The way to think about the formula (33. 16) for action is that the time of flight is a 
function of the energy, t - t(q, q 1 , E). The left hand side is explicitly a function of 
E; the right hand side is an implicit function of E through energy dependence of 
the flight time t. 

Going in the opposite direction, the energy of a trajectory E = E(q,q",t) 
connecting q' — > q with a given time of flight t is given by the derivative of 
Hamilton's principal function 

n 

-R(q,q',t) = -E, (33.18) 
ot 

and the second variations of R and 5 are related in the standard way of Legendre 
transforms: 

^R(q,q\t)^S(q,q',E) = -l. (33.19) 

A geometric visualization of what the phase evolution looks like is very helpful 
in understanding the origin of topological indices to be introduced in what fol- 
lows. Given an initial phase R(q, fo), the gradient d q R defines a D-dimensional 
Lagrangian manifold (q, p = d q R{q)) in the full 2d dimensional phase space 
(q, p). The defining property of this manifold is that any contractible loop y in 
it has zero action, 



section 33.1.4 




a fact that follows from the definition of p as a gradient, and the Stokes theorem. 
Hamilton's equations of motion preserve this property and map a Lagrangian man- 
ifold into a Lagrangian manifold at a later time, t 

Returning back to the main line of our argument: so far we have determined 
the wave function phase R(q, t). Next we show that the velocity field given by the 
Hamilton's equations together with the continuity equation determines the ampli- 
tude of the wave function. 



33.1.3 Density evolution 

To obtain the full solution of the Schrodinger equation (31.1), we also have to 
integrate (33.4). 

p(q, t) := A 2 = ip*ifj 
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plays the role of a density. To the leding order in h, the gradient of R may be 
interpreted as the semiclassical momentum density 



d OA dR 

\ff(q,t) {-m—)t]/{q,t) = -ihA—+p—. 

oq oq oq 



Evaluated along the trajectory (q(t),p(t)), the amplitude equation (33.4) is equiv- 
alent to the continuity equation (16.35) after multiplying (33.4) by 2A, that is 



% + ^-(pv ; ) = 0. (33.20) 
ot oqi 



Here, V; = cji = Pi/m denotes a velocity field, which is in turn determined by the 
gradient of R(q, t), or the Lagrangian manifold (q(t), p(t) - d q R(q, t)), 



1 d 

v = -—R(q,t). 
m oq 



As we already know how to solve the Hamilton-Jacobi equation (33.5), we can 
also solve for the density evolution as follows: 

The density p(q) can be visualized as the density of a configuration space 
flow q(t) of a swarm of hypothetical particles; the trajectories q(t) are solutions 
of Hamilton's equations with initial conditions given by (q(0) = <^,p(0) - p' - 
d q R(q',0)). 

If we take a small configuration space volume cPq around some point q at time 
t, then the number of particles in it is p(q, t)d°dq. They started initially in a small 
volume d D q' around the point c( of the configuration space. For the moment, we 
assume that there is only one solution, the case of several paths will be considered 
below. The number of particles at time t in the volume is the same as the number 
of particles in the initial volume at t = 0, 

p(q(t),t)d D q = p(q',0)d D q' , 

see figure 33.2. The ratio of the initial and the final volumes can be expressed as 



p(q(t), t) = 



dq' 
det-i- 

oq 



p(q',0). (33.21) 



section 16.2 



As we know how to compute trajectories (q(t), p{t)), we know how to compute 
this Jacobian and, by (33.21), the density p(q(t), t) at time t. 
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Figure 33.2: Density evolution of an initial surface 
(q',p' = d,jR(q',Q) into (q(t),p(t)) surface time t later, 
sketched in 1 dimension. While the number of trajec- 
tories and the phase space Liouville volume are con- 
served, the density of trajectories projected on the q 
coordinate varies; trajectories which started in dq' at 
time zero end up in the interval dq. 



♦1 



33.1.4 Semiclassical wave function 

Now we have all ingredients to write down the semiclassical wave function at 
time t. Consider first the case when our initial wave function can be written in 
terms of single- valued functions A(c[ , 0) and R(q' ' , 0). For sufficiently short times, 
R{q, t) will remain a single-valued function of q, and every cPq configuration 
space volume element keeps its orientation. The evolved wave function is in the 
semiclassical approximation then given by 



As the time progresses the Lagrangian manifold d q R(q, t) can develop folds, so 
for longer times the value of the phase R(q, t) is not necessarily unique; in gen- 
eral more than one trajectory will connect points q and q" with different phases 
R(q, q' , t) accumulated along these paths, see figure 33.3. 

We thus expect in general a collection of different trajectories from q" to q 
which we will index by j, with different phase increments Rj(q,q',t). The hy- 
pothetical particles of the density flow at a given configuration space point can 
move with different momenta p = d q Rj(q, t). This is not an ambiguity, since in 
the full (q, p) phase space each particle follows its own trajectory with a unique 
momentum. 

Whenever the Lagrangian manifold develops a fold, the density of the phase 
space trajectories in the fold projected on the configuration coordinates diverges. 
As illustrated in figure 33.3, when the Lagrangian manifold develops a fold at 
q = q\\ the volume element dq\ in the neighborhood of the folding point is pro- 
portional to yjdq' instead of dq'. The Jacobian dq' ' /dq diverges like 1 / ^q\ - q{t) 
when computed along the trajectory going trough the folding point at q . After 
the folding the orientation of the interval dd has changed when being mapped 
into dq2', in addition the function R, as well as its derivative which defines the 
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Figure 33.3: Folding of the Lagrangian surface 
(q,d g R(q,t)). 



Lagrangian manifold, becomes multi-valued. Distinct trajectories starting from 
different initial points c{ can now reach the same final point (That is, the point 
q' may have more than one pre-image.) The projection of a simple fold, or of an 
envelope of a family of phase space trajectories, is called a caustic; this expres- 
sion comes from the Greek word for "capable of burning," evoking the luminous 
patterns that one observes swirling across the bottom of a swimming pool. 

The folding also changes the orientation of the pieces of the Lagrangian man- 
ifold (q, d q R(q, t)) with respect to the initial manifold, so the eigenvalues of the 
Jacobian determinant change sign at each fold crossing. We can keep track of the 
signs by writing the Jacobian determinant as 



dq' 
det — 

dq 



-inmj(q,q' ,t) 



det 



dq 



where nij{q,q' , t) counts the number of sign changes of the Jacobian determinant 
on the way from q' to q along the trajectory indexed with j, see figure 33.3. We 
shall refer to the integer mj(q, q' , t) as the topological of the trajectory. So in 
general the semiclassical approximation to the wave function is thus a sum over 
possible trajectories that start at any inital 4 an d end in q in time t 



r _ ftrf 1/2 
iff sc (q,i)= dq'Y det^ J R > iq4 'A/hr-bmjfa V)/2^ ( ' 0 ) , (33.22) 
J t-r 1 dq : 3 



each contribution weighted by corresponding density, phase increment and the 
topological index. 

That the correct topological index is obtained by simply counting the number 
of eigenvalue sign changes and taking the square root is not obvious - the careful 
argument requires that quantum wave functions evaluated across the folds remain 
single valued. 
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33.2 Semiclassical propagator 



We saw in chapter 3 1 that the evolution of an initial wave function \p{q, 0) is com- 
pletely determined by the propagator (31.10). As K(q,q',t) itself satisfies the 
Schrodinger equation (31.12), we can treat it as a wave function parameterized 
by the configuration point </. In order to obtain a semiclassical approximation to 
the propagator we follow now the ideas developed in the last section. There is, 
however, one small complication: the initial condition (31.13) demands that the 
propagator at t - 0 is a 5-function at q - q 1 , that is, the amplitude is infinite at 
q' and the phase is not well defined. Our hypothetical cloud of particles is thus 
initially localized ntq-q' with any initial velocity. This is in contrast to the situ- 
ation in the previous section where we assumed that the particles at a given point q 
have well defined velocity (or a discrete set of velocities) given by q = dpH{q, p). 
We will now derive at a semiclassical expression for K(q, </, t) by considering the 
propagator for short times first, and extrapolating from there to arbitrary times t. 



33.2.1 Short time propagator 

For infinitesimally short times 8t away from the singular point t = 0 we assume 
that it is again possible to write the propagator in terms of a well defined phase 
and amplitude, that is 

K(q, q', St) = A(q, q' , St)e^ q ' q '•*> . 

As all particles start at q - q 1 , R(q, q', 5t) will be of the form (33. 12), that is 

R(q, q, St) = pqSt - H(q, p)St , (33.23) 

with q ~ (q-q')/St. For Hamiltonians of the form (31.2) we have q = p/m, which 
leads to 

m(q - q'j 1 
R(q,q',6t)= ^ > - V(q)6t . 

Here V can be evaluated any place along the trajectory from q to </, for example 
at the midway point V((q + q')/2). Inserting this into our ansatz for the propagator 
we obtain 

K sc {q, q, St) ~ A(q, q', 6t)e^ ( ^' )2 - V( ^ . (33.24) 

For infinitesimal times we can neglect the term V(q)6t, so K sc (q,q' ,5t) is a d- 
dimensional Gaussian with width o 2 - ihSt/m. This Gaussian is a finite width 
approximation to the Dirac delta function 

S{z) - lim 1 p' z1,1<t1 (33.25) 
V2tto- 2 
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if A - (m/2mHSt) D ^ 2 , with A(q,q',6t) fixed by the Dirac delta function normal- 
ization condition. The correctly normalized propagator for infinitesimal times 6t 
is therefore 

K ^-*H^V ek ^™- (33 - 26) 

The short time dynamics of the Lagrangian manifold (q, d q R) which corresponds 
to the quantum propagator can now be deduced from 03.23); one obtains 

dR m 

i.e., is the particles start for short times on a Lagrangian manifold which is a plane 
in phase space, see figure 33.4. Note, that for 8t — > 0, this plane is given by 
the condition q = </, that is, particles start on a plane parallel to the momentum 
axis. As we have already noted, all particles start at q = q" but with different 
velocities for t = 0. The initial surface (</,// = d q R(q',0)) is mapped into the 
surface (q(t), p(t)) some time t later. The slope of the Lagrangian plane for a short 
finite time is given as 

dp, _ d 2 R dp\ _ m § 
dqj dqjdq'i dqj 5t ' } 

The prefactor (m/ft) D/2 in (33.26) can therefore be interpreted as the determinant 
of the Jacobian of the transformation from final position coordinates q to initial 
momentum coordinates p' , that is 



/ a '\ l l 2 



where 



dp\ 
dqj 



t.q' 



,33.28, 



The subscript • • -\ t q , indicates that the partial derivatives are to be evaluated with 
t, q' fixed. 

The propagator in (33.27) has been obtained for short times. It is, however, 
already more or less in its final form. We only have to evolve our short time 
approximation of the propagator according to (33.22) 

K sc (q", q',t'+ 8t) = Y det ^- ^ e iRM",i,f)/n-inmj( q ", q ,t')/2 K{ , Sf) > 
^r 1 oq" j 1 



exercise 33.1 
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Figure 33.4: Evolution of the semiclassical propaga- 
tor. The configuration which corresponds to the initial 
conditions of the propagator is a Lagrangian manifold 
q = q', that is, a plane parallel to the p axis. The hy- 
pothetical particles are thus initially all placed at q' but 
take on all possible momenta p'. The Jacobian matrix 
C (33.29) relates an initial volume element in momen- 
tum space dp' to a final configuration space volume 
dq. 




and we included here already the possibility that the phase becomes multi- valued, 
that is, that there is more than one path from 4 to q" . The topological index mj = 
nij(q" ,q' ,t) is the number of singularities in the Jacobian along the trajectory j 
from q' to q" . We can write K sc {q" , q',t' + 8t) in closed form using the fact that 
R(q",q,t') + R{q,q',8t) - R{q",q',t' + St) and the multiplicativity of Jacobian 
determinants, that is 



det 



dq 
dq" 



det 



d£_ 

dq 



dp' 



-det , 

q',St da 



(33.29) 



q',t'+St 



The final form of the semiclassical or Van Vleck propagator, is thus 



K sc (q,q',t) = V 



(2mh) D ' 2 



det 



dp' 
dq 



1/2 



JRj(.q,q',t)/H-imjx/2 



(33.30) 



This Van Vleck propagator is the essential ingredient of the semiclassical quanti- 
zation to follow. 

The apparent simplicity of the semiclassical propagator is deceptive. The 
wave function is not evolved simply by multiplying by a complex number of mag- 
nitude -y/det dp' /dq and phase R(q,q',t); the more difficult task in general is to 
find the trajectories connecting qf and q in a given time t. 

In addition, we have to treat the approximate propagator (33.30) with some 
care. Unlike the full quantum propagator, which satisfies the group property 
(31.11) exactly, the semiclassical propagator performs this only approximately, 
that is 

K sc (q,q',t\ +t 2 ) ~ J dq" K sc (q,q" ,h)K sc (q" ,q ,t\) . (33.31) 

The connection can be made explicit by the stationary phase approximation, sect.32.2. 
Approximating the integral in (33.31) by integrating only over regions near points 
q" at which the phase is stationary, leads to the stationary phase condition 



dR(q,q",t 2 ) | dR{q",q',h) =Q 



dq'! 



dq- 



(33.32) 
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Classical trajectories contribute whenever the final momentum for a path from q" 
to q" and the initial momentum for a path from q" to q coincide. Unlike the 
classical evolution of sect. 17.2, the semiclassical evolution is not an evolution by 
linear operator multiplication, but evolution supplemented by a stationary phase 
condition p out = pj n that matches up the classical momenta at each evolution 
step. 



33.2.2 Free particle propagator 



To develop some intuition about the above formalism, consider the case of a free 
particle. For a free particle the potential energy vanishes, the kinetic energy is 
^q 2 , and the Hamilton's principal function (33.12) is 

m(q — q') 2 

R(q,q',t)= Kq 2t H ' . (33.33) 

The weig ht det ^ from (33.28) can be evaluated explicitly, and the Van Vleck 
propagator is 



identical to the short time propagator (33.26), with V(q) = 0. This case is rather 
exceptional: for a free particle the semiclassical propagator turns out to be the 
exact quantum propagator K(q,c[,t), as can be checked by substitution in the 
Schrodinger equation (33.2). The Feynman path integral formalism uses this fact 
to construct an exact quantum propagator by integrating the free particle propaga- 
tor (with V(q) treated as constant for short times) along all possible (not necessar- 
ily classical) paths from q 1 to q. 



33.3 Semiclassical Green's function 



remark 33.3 



exercise 33.10 
exercise 33.1 1 
exercise 33.12 



So far we have derived semiclassical formulas for the time evolution of wave func- 
tions, that is, we obtained approximate solutions to the time dependent Schrodinger 
equation (31.1). Even though we assumed in the calculation a time-independent 
Hamiltonian of the special form (31.2), the derivation would lead to the same final 
result (33.30) were one to consider more complicated or explicitly time dependent 
Hamiltonians. The propagator is thus important when we are interested in finite 
time quantum mechanical effects. For time-independent Hamiltonians, the time 
dependence of the propagator as well as of wave functions is, however, essentially 
given in terms of the energy eigen- spectrum of the system, as in (31.8). It is there- 
fore advantageous to switch from a time representation to an energy representa- 
tion, that is from the propagator (31.10) to the energy dependent Green's function 
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(31.14). A semiclassical approximation of the Green's function G sc (q,q' , E) is 
given by the Laplace transform (31.14) of the Van Vleck propagator K sc (q,q' , t): 

1 r°° 

G sc (q,q',E) = - dte iEt/n K sc (q,q' ,t) . (33.35) 
in Jo 



The expression as it stands is not very useful; in order to evaluate the integral, at 
least to the leading order in H, we need to turn to the method of stationary phase 
again. 



33.3.1 Stationary phase in higher dimensions 

exercise 32.1 

Generalizing the method of sect. 32.2 to d dimensions, consider stationary phase 
points fulfilling 



d 

— 3>(x) 

dxi 



= 0 Vi=l,...d. 

x=x 0 



An expansion of the phase up to second order involves now the symmetric matrix 
of second derivatives of ®(x), that is 



OXiOXj 



X=XQ 



After choosing a suitable coordinate system which diagonalizes D, we can ap- 
proximate the J-dimensional integral by d 1 -dimensional Fresnel integrals; the 
stationary phase estimate of (32.13) is then 

/ » ^ |det D(x 0 )r l/2 A(x 0 ) )->(*«) ^ 



where the sum runs over all stationary phase points xq of O(x) and m(xo) counts 

the number of negative eigenvalues of D(xn). 

b b v ' exercise 28.2 

exercise 33.2 

The stationary phase approximation is all that is needed for the semiclassical exerc j se 32 3 
approximation, with the proviso that D in (33.36) has no zero eigenvalues. 



33.3.2 Long trajectories 

When evaluating the integral (33.35) approximately we have to distinguish be- 
tween two types of contributions: those coming from stationary points of the phase 
and those coming from infinitesimally short times. The first type of contributions 
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can be obtained by the stationary phase approximation and will be treated in this 
section. The latter originate from the singular behavior of the propagator for t — > 0 
where the assumption that the amplitude changes slowly compared to the phase 
is not valid. The short time contributions therefore have to be treated separately, 
which we will do in sect. 33.3.3. 

The stationary phase points f of the integrand in (33.35) are given by the 
condition 



jR(q,q',f) + E = 0. 



(33.37) 



We recognize this condition as the solution of (33. 1 8), the time t* = t*(q, q', E) in 
which a particle of energy E starting out in c[ reaches q. Taking into account the 
second derivative of the phase evaluated at the stationary phase point, 



R(q, q', t) + Et = R(q, q' , f) + Ef + -(/ - ff^—R{q, q' , f) + 



1 



the stationary phase approximation of the integral corresponding to a classical 
trajectory j in the Van Vleck propagator sum (33.30) yields 



Gj(q,q',E) 



1 



detC 



d% 
dt 2 



1/2 



(33.38) 



where mj = mj(q, q' , E) now includes a possible additional phase arising from the 
time stationary phase integration (32.16), and Cj = Cj{q,q',f), Rj = Rj(q,q',f) 
are evaluated at the transit time f . We re-express the phase in terms of the energy 
dependent action (33.16) 



S(q,q',E) = R(q,q',t*) + Et* , with t* = t*(q,q',E), 



(33.39) 



the Legendre transform of Hamilton's principal function. Note that the partial 
derivative of the action (33.39) with respect to q\ 

dS(g,g',E) _ dR(g,q',t*) ( dR(q,q',t) \ dt_ 
dq { dqi \ dt* j dq t ' 

is equal to 



dS(q,q',E) dR(q,q',f) 



dqt 



dqi 



(33.40) 



due to the stationary phase condition (33.37), so the definition of momentum as a 
partial derivative with respect to q remains unaltered by the Legendre transform 
from time to energy domain. 



exercise 33.13 
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Next we will simplify the amplitude term in (33.38) and rewrite it as an ex- 
plicit function of the energy. Consider the [(D + l)x(D + 1)] matrix 



d 2 S 


a 2 s v 


( dp' 


dp' 


dq'dq 


dq'dE \ 


dq 


dE 


d 2 S 


d 2 S ~ 


* 


at 


dqdE 


dE 2 1 


V dq 


he 



D(q,q',E) = \ 0q 'f a %™ = * a f , (33.41) 



where S = S(q, q 1 , E) and we used (33.14-33.17) here to obtain the left hand side 
of (33.41). The minus signs follow from the definition of (33.15), which implies 
that that S (q, q',E) = -S (q', q, E). Note that D is nothing but the Jacobian matrix 
of the coordinate transformation (q, E) — > (//,f) for fixed q' . We can therefore 
use the multiplication rules of determinants of Jacobians, which are just ratios of 
volume elements, to obtain 



detD = (-1) D+1 Idet ~ sr I =(-l)" +1 (det 

d(q,E)J q , \ d(q,t) d(q,E)) ql 

U*\ =detr^ 



We use here the notation (det .) q , t for a Jacobian determinant with partial deriva- 
tives evaluated at t, </ fixed, and likewise for other subscripts. Using the relation 
(33.19) which relates the term % to d 2 t R we can write the determinant of D as 
a product of the Van Vleck determinant (33.28) and the amplitude factor arising 
from the stationary phase approximation. The amplitude in (33.38) can thus be 
interpreted as the determinant of a Jacobian of a coordinate transformation which 
includes time and energy as independent coordinates. This causes the increase in 
the dimensionality of the matrix D relative to the Van Vleck determinant (33.28). 

We can now write down the semiclassical approximation of the contribution 
of the jth trajectory to the Green's function (33.38) in explicitly energy dependent 
form: 

However, this is still not the most convenient form of the Green's function. 

The trajectory contributing to Gj(q,q',E) is constrained to a given energy 
E, and will therefore be on a phase space manifold of constant energy, that is 
H(q, p) = E. Writing this condition as a partial differential equation for S(q,q", E), 
that is 



H(q,—) = E, 
oq 
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one obtains 



^-H{q,p) = 0 = 

dq j 

—H(q',p') = 0 = 
dqi 



8H dpj 
dpj dq\ 
d 2 S , 



d 2 S 
dqjdq\ 



dqidq'j 



74 j ■ 



(33.43) 



that is the sub-matrix cfS /dqidq'j has (left- and right-) eigenvectors corresponding 
to an eigenvalue 0. Rotate the local coordinate system at the either end of the 
trajectory 

(qi,qi,q3, ••',#</)—» (q\\,q±i,q±2, ■ ■ • »<?j_(d-i)) 

so that one axis points along the trajectory and all others are perpendicular to it 
(quqi, q-h, • • • , 9d) -> (q, 0, 0, • • • , 0) . 

With such local coordinate systems at both ends, with the longitudinal coordinate 
axis q\\ pointing along the velocity vector of magnitude q, the stability matrix of 
S(q, q', E) has a column and a row of zeros as (33.43) takes form 



d 2 S 



d 2 S 



dq\\dq\ dq t dq. 



7 q'=0. 



The initial and final velocities are non-vanishing except for points \q\ = 0. These 
are the turning points (where all energy is potential), and we assume that neither q 
nor q' is a turning point (in our application - periodic orbits - we can always chose 
q = q' not a turning point). In the local coordinate system with one axis along 
the trajectory and all other perpendicular to it the determinant of (53.41) is of the 
form 



det D(q,q',E) - (-1) 



D+l 



det 



0 
0 

d 2 S 
dq\\BE 



0 

<f- s 

dq x dq' x 



d 2 S 
dEdqi 



(33.44) 



The corner entries can be evaluated using (33.17) 



d 2 S d_ _ 1 

dq\\dE dq\\ q ' 



d 2 S 



1 



dEdq[, q' 



As the q\\ axis points along the velocity direction, velocities q, c[ are by construc- 
tion almost always positive non-vanishing numbers. In this way the determinant 
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of the [(D + l)x(D + 1)] dimensional matrix D{q,c[ ,E) can be reduced to the 
determinant of a [(D - l)x(D - 1)] dimensional transverse matrix D ± (q,q',E) 



det D(q, q',E) = — det D ± (q, q' , E) 
qq' 

n ( ' v\ d 2 S(g,g',E) 

D ± {q,q ,E) ik = — . (33.45) 

oq ±i dq k 



Putting everything together we obtain the jth trajectory contribution to the semi- 
classical Green's function 



exercise 33.15 



Gj(q,q',E) = 



ihilmtyW-W 2 \ qq > 



1/2 



det D\ 



1/2 



i ? — !E m ■ 
gs j ; 2 m J 



(33.46) 



where the topological index mj = mj(q,q',E) now counts the number of changes 
of sign of det D 1 ^ along the trajectory j which connects q 1 to q at energy E. 



The endpoint velocities q, q 1 also depend on (q, q 1 ', E) and the trajectory j. 



33.3.3 Short trajectories 

The stationary phase method cannot be used when f is small, both because we 
cannot extend the integration in (32.16) to -oo, and because the amplitude of 
K(q, q' , t) is divergent. In this case we have to evaluate the integral involving the 
short time form of the exact quantum mechanical propagator (33.26) 

G 0 ( q ,q>,E) = I £ dt [^f /2 e ^-n^ . (33.47) 



By introducing a dimensionless variable r = t^j2m(E - V(q))/m\q - q'\, the 
integral can be rewritten as 



m / V2m(£-V) V' 1 f° dr A 
G 0 {q,q,E)- mKin . )DI2 y — - j ^ - 



^S 0 (q,q',E){T+\/T) 



where 5'o(^, q', E) = s/2m(E - V)\q - q'\ is the short distance form of the action. 
Using the integral representation of the Hankel function of first kind 



pea 

H + y {z) = --e-™ 12 e^ T+l/T) T' v ~ l dr 
n Jo 



VanVleck - 8dec2010 



ChaosBook.org versionI4.4.1, Apr 21 2013 



CHAPTER 33. SEMICLASSICAL EVOLUTION 



651 



we can write the short distance form of the Green's function as 

G 0 (q,q ,E) » -— I _ - ' \ H + ^(S Q {q, q' , E)fn) . (33.48) 

Hankel functions are stabdard, and their the short wavelength asymptotics is de- 
scribed in standard reference books. The short distance Green's function approx- 
imation is valid when So(q,q',E) < H. 



Resume 



The aim of the semiclassical or short- wavelength methods is to approximate a 
solution of the Schrodinger equation with a semiclassical wave function 



j 



iRj(q,t)/n 



accurate to the leading order in fi. Here the sum is over all classical trajectories 
that connect the initial point c[ to the final point q in time t. "Semi-" refers to fi, 
the quantum unit of phase in the exponent. The quantum mechanics enters only 
through this atomic scale, in units of which the variation of the phase across the 
classical potential is assumed to be large, "-classical" refers to the rest - both the 
amplitudes Aj(q, t) and the phases Rj(q, t) - which are determined by the classical 
Hamilton- Jacobi equations. 

In the semiclassical approximation the quantum time evolution operator is 
given by the semiclassical propagator 



K sc (q,q',t) = 



1 



{2mfi) D l 2 



det 



dq 



1/2 



where the topological index mj(q, q' , t) counts the number of the direction reversal 
along the jth classical trajectory that connects q" —> q in time t. Until very recently 
it was not possible to resolve quantum evolution on quantum time scales (such as 
one revolution of electron around a nucleus) - physical measurements are almost 
always done at time scales asymptotically large compared to the intrinsic quantum 
time scale. Formally this information is extracted by means of a Laplace trans- 
form of the propagator which yields the energy dependent semiclassical Green 's 
function 



G sc (q,q',E) = G 0 (q,q' ,E) + j{q,q ,E) 

j 

1 1 fir,' 

Gj(q,q',E) 



ih(2mh)' 



qq' dq± 



e n J j~2 m j 



(33.49) 



VanVleck - 8dec2010 



ChaosBook.org versionI4.4.1, Apr 21 2013 



CHAPTER 33. SEMICLASSICAL EVOLUTION 652 

where Go(q,q',E) is the contribution of short trajectories with So(q,q',E) < ft, 
while the sum is over the contributions of long trajectories (33.46) going from q' 
to q with fixed energy E, with Sj(q, q',E) » H. 



Commentary 

Remark 33.1 Limit h — > 0. The semiclassical limit "ti — > 0" discussed in sect. 33.1 
is a shorthand notation for the limit in which typical quantities like the actions R or S in 
semiclassical expressions for the propagator or the Green's function become large com- 
pared to Ti. In the world that we live in the quantity ft is a fixed physical constant whose 
value [8] is 1.054571596(82) 1(T 34 Js. 

Remark 33.2 Madelung's fluid dynamics. Already Schrodinger [ ] noted that 
p=p(q,t):=A 2 = iA*<A 

plays the role of a density, and that the gradient of R may be interpreted as a local semi- 
classical momentum, as the momentum density is 

i//(q,tY(-iti^-)i/f(q,t) = -ihA^-+p^-. 

oq oq dq 

A very different interpretation of (33.3-33.4) has been given by Madelung [2], and then 
built upon by Bohm [6] and others [3, 7]. Keeping the ft dependent term in (33.3), the 
ordinary differential equations driving the flow (33.10) have to be altered; if the Hamilto- 
nian can be written as kinetic plus potential term V(q) as in ( 3 1 .2), the tf term modifies 
the p equation of motion as 

Pi = - ir (V(q) + Q(q,t)) , (33.50) 
dqi 

where, for the example at hand, 



2m yfp dq 2 



interpreted by Bohm [ ] as the "quantum potential." Madelung observed that Hamilton's 
equation for the momentum (33.50) can be rewritten as 



dt \ dq) ' m dq, mp dqj 



+ |v-— |v; = —cry, (33.52) 



where cry = j£ dqjq- ' s ^ e "P ressure " stress tensor, v,- = pi/m, and p = A 2 as de- 
fined [3] in sect. 33.1.3. We recall that the Eulerian j t + -gj-j- is the ordinary derivative 
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of Lagrangian mechanics, that is 4 . For comparison, the Euler equation for classical 
hydrodynamics is 



where p6ij is the pressure tensor. 

The classical dynamics corresponding to quantum evolution is thus that of an "hypo- 
thetical fluid" experiencing H and p dependent stresses. The "hydrodynamic" interpreta- 
tion of quantum mechanics has, however, not been very fruitful in practice. 

Remark 33.3 Path integrals. Thesemiclassicalpropagator(33.30)canalsobederived 
from Feynman's path integral formalism. Dirac was the first to discover that in the short- 
time limit the quantum propagator (33.34) is exact. Feynman noted in 1946 that one can 
construct the exact propagator of the quantum Schrodinger equation by formally summing 
over all possible (and emphatically not classical) paths from q' to q . 

Gutzwiller started from the path integral to rederive Van Vleck's semiclassical ex- 
pression for the propagator; Van Vleck's original derivation is very much in the spirit 
of what has presented in this chapter. He did, however, not consider the possibility of 
the formation of caustics or folds of Lagrangian manifolds and thus did not include the 
topological phases in his semiclassical expression for the propagator. Some 40 years later 
Gutzwiller [4] added the topological indices when deriving the semiclassical propagator 
from Feynman's path integral by stationary phase conditions. 

Remark 33.4 Applications of the semiclassical Green's function. The semiclassical 
Green's function is the starting point of the semiclassical approximation in many appli- 
cations. The generic semiclassical strategy is to express physical quantities (for example 
scattering amplitudes and cross section in scattering theory, oscillator strength in spec- 
troscopy, and conductance in mesoscopic physics) in terms of the exact Green's function 
and then replace it with the semiclassical formula. 

Remark 33.5 The quasiclassical approximation The quasiclassical approximation 
was introduced by Maslov [ ]. The term 'quasiclassical' is more appropriate than semi- 
classical since the Maslov type description leads to a pure classical evolution operator in 
a natural way. Following mostly ref. [8], we give a summary of the quasiclassical approx- 
imation, which was worked out by Maslov [ 7] in this form. One additional advantage of 
this description is that the wave function evolves along one single classical trajectory and 
we do not have to compute sums over increasing numbers of classical trajectories as in 
computations involving Van Vleck formula [27]. 



VanVleck - 8dec2010 ChaosBook.org versionl4.4.1, Apr 21 2013 




EXERCISES 



654 



Exercises 



33.1. Dirac delta function, Gaussian representation. 

Consider the Gaussian distribution function 



Sail) 



1 



„-z 2 /2o 



V27TCT 2 



Show that in cr — » 0 limit this is the Dirac delta function 
dx 5{x) = 1 if 0 e M , zero otherwise . 



33.2. Stationary phase approximation in higher dimensions. 

All semiclassical approximations are based on saddle 
point evaluations of integrals of type 



33.6. 1 -dimensional harmonic oscillator. Take a 1- 
dimensional harmonic oscillator U(q) = Ikq 2 . Take a 
WKB wave function of form A(q, t) = a(t) and R(q, t) = 
r(t) + b(t)q + c(t)q 2 , where r(t), a(t), b(t) and c(f) are 
time dependent coefficients. Derive ordinary differen- 
tial equations by using (33.3) and (33.4) and solve them, 
(continuation: exercise 33.9.) 

33.7. 1 -dimensional linear potential. Take a 1 -dimensional 
linear potential U{q) = -Fq. Take a WKB wave func- 
tion of form A(q, t) = a(t) and R(q, t) - r(t) + b(t)q + 
c(t)q 2 , where r(t), a(t), b(t) and c(f) are time dependent 
coefficients. Derive and solve the ordinary differential 
equations from (33.3) and (33.4). 



= j d D xA(x)e i 



33.8. 



<D(jt)/fi 



(33.53) 



for small values of Ti. Obtain the stationary phase esti- 33.9. 
mate 



D/2 



D-dimensional quadratic potentials. Generalize the 
above method to general D-dimensional quadratic po- 
tentials. 

Time evolution of R. (continuation of exer- 

cise 33.6) Calculate the time evolution of R(q,0) = 
a + bq + cq 2 for a 1 -dimensional harmonic oscillator us- 
ing (33.12) and (33.14). 



-y/detD 2 ®^,,) 33.10. D-dimensional free particle propagator. Verify the 

where D 2 <5(x„) denotes the second derivative matrix. 
33.3. Schrodinger equation in the Madelung form. Ver- 



results in sect. 33.2.2; show explicitly that (33.34), the 
semiclassical Van Vleck propagator in D dimensions, 
solves the Schrodinger's equation. 



ify the decomposition of Schrodinger equation into real 33. 1 1 . 
and imaginary parts, eqs. (33.3) and (33.4). 



33.4. Transport equations. 

function in the asymptotic form 



Write the wave- 



Mqj) = ei R ™ + i st Y J (iK) n A n (x,i). 



Propagator, charged particle in constant magnetic 
field. Calculate the semiclassical propagator for a 
charged particle in constant magnetic field in 3 dimen- 
sions. Verify that the semiclassical expression coincides 
with the exact solution. 

33.12. 1-dimensional harmonic oscillator propagator. 

Calculate the semiclassical propagator for a 1- 
dimensional harmonic oscillator and verify that it is 
identical to the exact quantum propagator. 



Derive the transport equations for the A„ by substituting ^3 J3 
this into the Schrodinger equation and then collecting 
terms by orders of H. Note that equation for A„ only 
requires knowledge of A „_i and/?. 

33.5. Easy examples of the Hamilton's principal function. 33.14. 
Calculate R(q, q', t) for 

a) a D-dimensional free particle 

b) a 3-dimensional particle in constant magnetic field 

c) a 1-dimensional harmonic oscillator. 33.15. 

(continuation: exercise 33.13.) 



Free particle action. Calculate the energy dependent 
action for a free particle, a charged particle in a constant 
magnetic field and for the harmonic oscillator. 



Zero length orbits. Derive the classical 

trace (18.1) rigorously and either add the t — > 0 -1- zero 
length contribution to the trace formula, or show that it 
vanishes. Send us a reprint of Phys. Rev. Lett, with the 
correct derivation. 

Free particle semiclassical Green's functions. Cal- 
culate the semiclassical Green's functions for the sys- 
tems of exercise 33.13. 
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Chapter 34 



Semiclassical quantization 



(G. Vattay, G. Tanner and P. Cvitanovic) 



r "V Te derive here the Gutzwiller trace formula and the semiclassical zeta func- 
»/%/ tion, the central results of the semiclassical quantization of classically 
* * chaotic systems. In chapter 35 we will rederive these formulas for the 
case of scattering in open systems. Quintessential wave mechanics effects such as 
creeping, diffraction and tunneling will be taken up in chapter38. 



34.1 Trace formula 

Our next task is to evaluate the Green's function trace (31.15) in the semiclassical 
approximation. The trace 



receives contributions from "long" classical trajectories labeled by j which start 
and end in q after finite time, and the "zero length" trajectories whose lengths 
approach zero as c( — > q. 

First, we work out the contributions coming from the finite time returning 
classical orbits, i.e., trajectories that originate and end at a given configuration 
point q. As we are identifying q with q 1 ', taking of a trace involves (still another!) 
stationary phase condition in the cj — > q limit, 




OS j(q,q',E) 
dqt 



q'=q 



+ 



OS j(q,q',E) 



q'=q 



= 0, 
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Figure 34.1: A returning trajectory in the configura- 
tion space. The orbit is periodic in the full phase space 
only if the initial and the final momenta of a returning 
trajectory coincide as well. 




Figure 34.2: A romanticized sketch of S P (E) = 
S (q, q,E) = <j> p{q, E)dq landscape orbit. Unstable 
periodic orbits traverse isolated ridges and saddles of 
the mountainous landscape of the action S (q\,q ± ,E). 
Along a periodic orbit S P (E) is constant; in the trans- 
verse directions it generically changes quadratically. 



meaning that the initial and final momenta (33.40) of contributing trajectories 
should coincide 



Pi{q, q, E) - p'j(q, q, E) - 0 , q e jth periodic orbit , (34. 1) 

so the trace receives contributions only from those long classical trajectories which 
are periodic in the full phase space. 

For a periodic orbit the natural coordinate system is the intrinsic one, with q 
axis pointing in the q direction along the orbit, and q ± , the rest of the coordinates 
transverse to q. The y'fh periodic orbit contribution to the trace of the semiclassical 
Green's function in the intrinsic coordinates is 

where the integration in q\ goes from 0 to Lj, the geometric length of small tube 
around the orbit in the configuration space. As always, in the stationary phase ap- 
proximation we worry only about the fast variations in the phase Sj(q\\,q ± ,E), 
and assume that the density varies smoothly and is well approximated by its 
value D J ± (q\\,0, E) on the classical trajectory, q± = 0 . The topological index 
nij(q\\,q ± , E) is an integer which does not depend on the initial point q\ and not 
change in the infinitesimal neighborhood of an isolated periodic orbit, so we set 
ntj(E) = mj(q\\,q ± ,E). 

The transverse integration is again earned out by the stationary phase method, 
with the phase stationary on the periodic orbit, q ± = 0. The result of the transverse 
integration can depend only on the parallel coordinate 



detD ±J (4||,0,£) 



detD' xj (q h 0,E) 



1/2 
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where the new determinant in the denominator, det D 



-L/ 



, (d 2 S(q,q',E) d 2 S(q,q',E) d 2 S(q,q',E) d 2 S{q,q',E)" 
det — 1 , 1 — 1- 



dqudq 



dq'dq 



dq±idq' 



dq' n dq' A 



is the determinant of the second derivative matrix coming from the stationary 
phase integral in transverse directions. 

The ratio det D ±; /det D' ± j is here to enforce the periodic boundary condition 
for the semiclassical Green's function evaluated on a periodic orbit. It can be given 
a meaning in terms of the monodromy matrix of the periodic orbit by following 
observations 



det D ± 



detD' ± = 



dp' ± 




d(ql,p'J 


dq± 




d(q±,q' ± ) 



dp A 



dp'± dp ± 



dp' ± 



dq± dq ± dq' ± dq' ± 



d(p±-p' ± ,q± -q'J, 



d(q±,q'±) 



Defining the 2(D - l)-dimensional transverse vector x ± = (q±,p±) in the full 
phase space we can express the ratio 



detD^ 


d(p± - p' ± ,q± ~q' ± ) 




d(x ± - x'J 


detD ± 


d(q' ± ,p' ± ) 




dx' ± 



det (M - 1) , 



(34.2) 



in terms of the monodromy matrix M for a surface of section transverse to the 
orbit within the constant energy E = H(q, p) shell. 

The classical periodic orbit action Sj(E) = £ p(q\\,E)dq\\ is an integral around 
a loop defined by the periodic orbit, and does not depend on the starting point q 
along the orbit, see figure 34.2. The eigenvalues of the monodromy matrix are 
also independent of where Mj is evaluated along the orbit, so det (1 - Mj) can also 
be taken out of the q\\ integral 



|det(l -Mj)\V 2 



Here we have assumed that Mj has no marginal eigenvalues. The determinant 
of the monodromy matrix, the action S P (E) - £ p(q\\, E)dq\\ and the topological 
index are all classical invariants of the periodic orbit. The integral in the parallel 
direction we now do exactly. 

First, we take into account the fact that any repeat of a periodic orbit is also a 
periodic orbit. The action and the topological index are additive along the trajec- 
tory, so for rth repeat they simply get multiplied by r. The monodromy matrix of 
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the rth repeat of a prime cycle p is (by the chain rule for derivatives) M p , where 
M p is the prime cycle monodromy matrix. Let us denote the time period of the 
prime cycle p, the single, shortest traversal of a periodic orbit by T p . The remain- 
ing integral can be carried out by change of variables dt = dq\/q(t) 



r L "dq l= r T » 
Jo q(t) Jo 



dt = T p 



Note that the spatial integral corresponds to a single traversal. If you do not see 
why this is so, rethink the derivation of the classical trace formula (18.23) - that 
derivation takes only three pages of text. Regrettably, in the quantum case we do 
not know of an honest derivation that takes less than 30 pages. The final result, 
the Gutzwiller trace formula 

1 °° 1 

tvG sc (E) = trGo(E) + -Yt p Y ———— _ e K*S,-f m,) , (34.3) 
toy j~i Idet (1 -M r p )\ 1 ' 2 

an expression for the trace of the semiclassical Green's function in terms of peri- 
odic orbits, is beautiful in its simplicity and elegance. 

The topological index m p (E) counts the number of changes of sign of the ma- 
trix of second derivatives evaluated along the prime periodic orbit p. By now we 
have gone through so many stationary phase approximations that you have surely 
lost track of what the total m p (E) actually is. The rule is this: The topological 
index of a closed curve in a 2D phase space is the sum of the number of times 
the partial derivatives for each dual pair (g,-, p\), i — 1,2, ... ,D (no sum on i) 
change their signs as one goes once around the curve. 



34.1.1 Average density of states 

We still have to evaluate trGo(£"), the contribution coming from the infinitesimal 
trajectories. The real part of tiGo(E) is infinite in the q' —> q limit, so it makes 
no sense to write it down explicitly here. However, the imaginary part is finite, 
and plays an important role in the density of states formula, which we derive next. 

The semiclassical contribution to the density of states 01.15) is given by 
the imaginary part of the Gutzwiller trace formula (34.3) multiplied with -l/n. 
The contribution coming from the zero length trajectories is the imaginary part of 
(33.48) for q' — > q integrated over the configuration space 

d 0 (E) = -^-J d D qlmG 0 (q,q,E), 

The resulting formula has a pretty interpretation; it estimates the number of 
quantum states that can be accommodated up to the energy E by counting the 
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available quantum cells in the phase space. This number is given by the Weyl rule 
, as the ratio of the phase space volume bounded by energy E divided by fP, the 
volume of a quantum cell, 



N SC (E) = — d D pd D q Q(E - H(q, p)) . (34.4) 



where &(.x) is the Heaviside function (31.20). N SC (E) is an estimate of the spectral 
staircase (31.19), so its derivative yields the average density of states 

do(E) = —N SC {E) = — J d D pd D q5{E - H{q,p)) , (34.5) 

precisely the semiclassical result (34.6). For Hamiltonians of type p 2 /2m + 
V(q), the energy shell volume in (34.5) is a sphere of radius ^2m{E - V(q)). The 
surface of a cf-dimensional sphere of radius r is 7i c,/2 r d ~ 1 /r(<i/2), so the average 

exercise 34.2 

density of states is given by 

fi u 2 d n ul T{DI2) J V (q)<E 

and 

1 n D/2 C 
h u T(l +D/2) J V(q)<E 

Physically this means that at a fixed energy the phase space can support N SC (E) 
distinct eigenf unctions; anything finer than the quantum cell tP cannot be re- 
solved, so the quantum phase space is effectively finite dimensional. The average 
density of states is of a particularly simple form in one spatial dimension 



exercise 34.3 



T(E) 

do(E) = ^ > (34.8) 
2nn 



where T(E) is the period of the periodic orbit of fixed energy E. In two spatial 
dimensions the average density of states is 



*(£) = ^, (34.9) 



where J{{E) is the classically allowed area of configuration space for which V(q) < 
E. 



exercise 34.4 
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The semiclassical density of states is a sum of the average density of states and 
the oscillation of the density of states around the average, ds C {E) = do(E)+d osc (E), 
where 



1 V Y 1 c™(rS p (E)/h-rm p 7r/2) 
= J£ 2^' £ |det(l-Mp|^ (34 - 10) 



follows from the trace formula (34.3). 



34.1.2 Regularization of the trace 

The real part of the q 1 —> q zero length Green's function (33.48) is ultraviolet 
divergent in dimensions d > 1, and so is its formal trace 01-15). The short 
distance behavior of the real part of the Green's function can be extracted from 
the real part of (33.48) by using the Bessel function expansion for small z 



VV 1 §(ln(z/2) + y) forv = 0 



where y - 0.577... is the Euler constant. The real part of the Green's function for 
short distance is dominated by the singular part 



G sing (\q - q'\,E) 



—S^TQid - 2)/2)— Uj for d t 2 



2^(ln(2m(£ - V)\q - q'\HK) + y) fovd = 2 

The regularized Green's function 

G reg (q, q, E) = G(q, q , E) - G sing {\q - q'\, E) 

is obtained by subtracting the c[ — » q ultraviolet divergence. For the regularized 
Green's function the Gutzwiller trace formula is 



aG „ g m = -,W„< £ ) + - 2 T P £ |t , el( ,_ Mp| „ 2 - (34.1 1) 



Now you stand where Gutzwiller stood in 1990. You hold the trace formula in 
your hands. You have no clue how good is the h — > 0 approximation, how to 
take care of the sum over an infinity of periodic orbits, and whether the formula 
converges at all. 
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Figure 34.3: A sketch of how spectral determinants 
convert poles into zeros: The trace shows l/(E - E„) 
type singularities at the eigenenergies while the spec- 
tral determinant goes smoothly through zeroes. 




34.2 Semiclassical spectral determinant 

The problem with trace formulas is that they diverge where we need them, at 
the individual energy eigenvalues. What to do? Much of the quantum chaos 
literature responds to the challenge of wrestling the trace formulas by replacing 
the delta functions in the density of states (31.16) by Gaussians. But there is no 
need to do this - we can compute the eigenenergies without any further ado by 
remembering that the smart way to determine the eigenvalues of linear operators 
is by determining zeros of their spectral determinants. 

A sensible way to compute energy levels is to construct the spectral determin- 
ant whose zeroes yield the eigenenergies, det (H - E) sc - 0. A first guess might 
be that the spectral determinant is the Hadamard product of form 

det (H -E) = \\{E-E n ), 

n 

but this product is not well defined, since for fixed E we multiply larger and larger 
numbers (E - E n ). This problem is dealt with by regularization, discussed below 
in appendix 34.1.2. Here we offer an impressionistic sketch of regularization. 

The logarithmic derivative of det (H - E) is the (formal) trace of the Green's 
function 

n " 

This quantity, not surprisingly, is divergent again. The relation, however, opens a 
way to derive a convergent version of det(H - E) sc , by replacing the trace with 
the regularized trace 

~ In det (H - E) sc = tr G reg {E). 

The regularized trace still has 1 /(E -E n ) poles at the semiclassical eigenenergies, 
poles which can be generated only if det(// - E) sc has a zero at E - E,„ see 
figure 34.3. By integrating and exponentiating we obtain 

det(H-E) sc = exp(- f dE' trG reg (E')) 



traceSemid - 2mai200J 



ChaosBook.org versionl4.4. 1, Apr21 2013 



CHAPTER 34. SEMICLASSICAL QUANTIZATION 



663 



Now we can use (34.11) and integrate the terms coming from periodic orbits, 
using the relation (33.17) between the action and the period of a periodic orbit, 
dS P (E) = T p (E)dE, and the relation (31.19) between the density of states and the 
spectral staircase, dN sc {E) - do(E)dE. We obtain the semiclassical zeta function 



det (H 



E) sc = e inNAE) exp 



1 e ir(S p /%-m p 7T/2) 



r|det(l-Mp| 



1/2 



(34.12) 



chapter 20 



We already know from the study of classical evolution operator spectra of chap- 
ter 19 that this can be evaluated by means of cycle expansions. The beauty of this 
formula is that everything on the right side - the cycle action S p , the topological 
index m p and monodromy matrix M p determinant - is intrinsic, coordinate-choice 
independent property of the cycle p. 



34.3 One-dof systems 



It has been a long trek, a stationary phase upon stationary phase. Let us check 
whether the result makes sense even in the simplest case, for quantum mechanics 
in one spatial dimension. 

In one dimension the average density of states follows from the 1-dof form of 
the oscillating density (34.10) and of the average density (34.8) 

T (E) T (E} 

d{E) = 2rfT + 2 ~Sr cos(rS p(E)/n ~ rm p (£)7r/2) - (34 - 13) 



The classical particle oscillates in a single potential well with period T p (E). There 
is no monodromy matrix to evaluate, as in one dimension there is only the parallel 
coordinate, and no transverse directions. The r repetition sum in 04. 13) can be 
rewritten by using the Fourier series expansion of a delta spike train 



5{x-n)= Y ei2nkx = 1 + ^2cos(27ribc). 



We obtain 



T (E} 

d(E) = ^ 6{S p {E)l2n% - m p (E)/4 - n). (34.14) 



This expression can be simplified by using the relation (33.17) between T p and 
S p , and the identity (16.7) 8{x - x*) - \f'{x)\8{f{x)), where x* is the only zero of 
the function f{x*) - 0 in the interval under consideration. We obtain 

d(E) = Y,5(E- E n ), 
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where the energies E„ are the zeroes of the arguments of delta functions in (34.14) 

S p(E n )/2nfi - n - m p (E)/4, 

where m p {E) = m p = 2 for smooth potential at both turning points, and nip(E) = 
m p = 4 for two billiard (infinite potential) walls. These are precisely the Bohr- 
Sommerfeld quantized energies E„, defined by the condition 

p{q,E n )dq = h(n- r ^y (34.15) 

In this way the trace formula recovers the well known 1-dof quantization rule. 
In one dimension, the average of states can be expressed from the quantization 
condition. At E = E n the exact number of states is n, while the average number 
of states is n— 1/2 since the staircase function N(E) has a unit jump in this point 

N SC (E) = n - 1/2 = S p {E)l2nh - m p (E)/4 - 1/2. (34.16) 

The 1-dof spectral determinant follows from (34.12) by dropping the mon- 
odromy matrix part and using (34. 1 6) 

det (#-£).«. = eX p{-^S p + jm p )exp^- J] j ■ (34.17) 

Summation yields a logarithm by f/r = - ln(l - 1) and we get 

i „ imp at i c v m P 

det(H-E) sc - e -2» s p + — + 2(i - e ^ P -i— ) 
= 2sm(s p (E)/n-m p (E)/4) . 

So in one dimension, where there is only one periodic orbit for a given energy E, 
nothing is gained by going from the trace formula to the spectral determinant. The 
spectral determinant is a real function for real energies, and its zeros are again the 
Bohr-Sommerfeld quantized eigenenergies (34.15). 



34.4 Two-dof systems 

For flows in two configuration dimensions the monodromy matrix M p has two 
eigenvalues A p and l/A p , as explained in sect. 7.3. Isolated periodic orbits can 
be elliptic or hyperbolic. Here we discuss only the hyperbolic case, when the 
eigenvalues are real and their absolute value is not equal to one. The determinant 
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appearing in the trace formulas can be written in terms of the expanding eigen- 
value as 



|det(l-Mp| 1/2 = |A;| 1/2 (l-l/A;), 



and its inverse can be expanded as a geometric series 



|det 



J y 1 



/2 A kr- 



With the 2-dof expression for the average density of states (34.9) the spectral 
determinant becomes 



det (H - E) sc = 



■ m3KE 

e 2» 2 exp 



oo oo 



,ir(S p /%-m p 7r/2) 

Li Li Li r|Agl/2A* r 



= **nn 

p k=0 



p r=\ k=0 
1 - 



e n J v 2 m v 



|A P I 1/2 A^. 



(34.18) 



Resume 



Spectral determinants and dynamical zeta functions arise both in classical and 
quantum mechanics because in both the dynamical evolution can be described by 
the action of linear evolution operators on infinite-dimensional vector spaces. In 
quantum mechanics the periodic orbit theory arose from studies of semi-conductors, 
and the unstable periodic orbits have been measured in experiments on the very 
paradigm of Bohr's atom, the hydrogen atom, this time in strong external fields. 

In practice, most "quantum chaos" calculations take the stationary phase ap- 
proximation to quantum mechanics (the Gutzwiller trace formula, possibly im- 
proved by including tunneling periodic trajectories, diffraction corrections, etc.) 
as the point of departure. Once the stationary phase approximation is made, what 
follows is classical in the sense that all quantities used in periodic orbit calcu- 
lations - actions, stabilities, geometrical phases - are classical quantities. The 
problem is then to understand and control the convergence of classical periodic 
orbit formulas. 

While various periodic orbit formulas are formally equivalent, practice shows 
that some are preferable to others. Three classes of periodic orbit formulas are 
frequently used: 

Trace formulas. The trace of the semiclassical Green's function 



f 



dqG sc (q,q,E) 
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is given by a sum over the periodic orbits (34. 11). While easiest to derive, in cal- 
culations the trace formulas are inconvenient for anything other than the leading 
eigenvalue estimates, as they tend to be divergent in the region of physical interest. 
In classical dynamics trace formulas hide under a variety of appellations such as 
the f-a or multifractal formalism; in quantum mechanics they are known as the 
Gutzwiller trace formulas. 

Zeros of Rue lie or dynamical zeta functions 

i/«*) = n< 1 -^ *p = TTm^^ 12 
p |y v 

yield, in combination with cycle expansions, the semiclassical estimates of quan- 
tum resonances. For hyperbolic systems the dynamical zeta functions have good 
convergence and are a useful tool for determination of classical and quantum me- 
chanical averages. 

Spectral determinants, Selberg-type zeta functions, Fredholm determinants, 
functional determinants are the natural objects for spectral calculations, with con- 
vergence better than for dynamical zeta functions, but with less transparent cycle 
expansions. The 2-dof semiclassical spectral determinant (34.18) 

" ( JS p /h-imn r /2\ 

p k=o y |/v pl iy p ' 

is a typical example. Most periodic orbit calculations are based on cycle expan- 
sions of such determinants. 

As we have assumed repeatedly during the derivation of the trace formula that 
the periodic orbits are isolated, and do not form families (as is the case for inte- 
grable systems or in KAM tori of systems with mixed phase space), the formulas 
discussed so far are valid only for hyperbolic and elliptic periodic orbits. 

For deterministic dynamical flows and number theory, spectral determinants 
and zeta functions are exact. The quantum-mechanical ones, derived by the Gutzwiller 
approach, are at best only the stationary phase approximations to the exact quan- 
tum spectral determinants, and for quantum mechanics an important conceptual 
problem arises already at the level of derivation of the semiclassical formulas; how 
accurate are they, and can the periodic orbit theory be systematically improved? 



Commentary 

Remark 34.1 Gutzwiller quantization of classically chaotic systems. The deriva- 
tion given here and in sects. 33.3 and 34.1 follows closely the excellent exposition [ ] by 
Martin Gutzwiller, the inventor of the trace formula. The derivation presented here is self 
contained, but refs. [3, 1] might also be of help to the student. 
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Remark 34.2 Zeta functions. For "zeta function" nomenclature, see remark 19.4 on 
page 392. 
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Exercises 



34. 1 . Monodromy matrix from second variations of the ac- 
tion. Show that 



D±j/D' xj = (1 - M) 



(34.19) 



34.2. Volume of d-dimensional sphere. Show that the 
volume of a cZ-dimensional sphere of radius r equals 
7r c,/2 r d /r(l + d/2). Show that T(l + d/2) = T(d/2)d/2. 

34.3. Average density of states in 1 dimension. Show that 
in one dimension the average density of states is given 



by (34.8) 



d(E) 



T(E) 
2nh 



where T(E) is the time period of the 1 -dimensional mo- 
tion and show that 



N(E) 



S(E) 
2nh 



(34.20) 



where S (E) = j> p(q, E) dq is the action of the orbit. 

34.4. Average density of states in 2 dimensions. Show that 
in 2 dimensions the average density of states is given by 
(34.9) 



d(E) 



mME) 
2nh 2 



where tR(E) is the classically allowed area of configura- 
tion space for which U(q) < E. 
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Chapter 35 



Quantum scattering 



Scattering is easier than gathering. 
— Irish proverb 

(A. Wirzba, P. Cvitanovic and N. Whelan) 

So far the trace formulas have been derived assuming that the system under 
consideration is bound. As we shall now see, we are in luck - the semiclas- 
sics of bound systems is all we need to understand the semiclassics for open, 
scattering systems as well. We start by a brief review of the quantum theory of 
elastic scattering of a point particle from a (repulsive) potential, and then develop 
the connection to the standard Gutzwiller theory for bound systems. We do this 
in two steps - first, a heuristic derivation which helps us understand in what sense 
density of states is "density," and then we sketch a general derivation of the cen- 
tral result of the spectral theory of quantum scattering, the Krein-Friedel-Lloyd 
formula. The end result is that we establish a connection between the scatter- 
ing resonances (both positions and widths) of an open quantum system and the 
poles of the trace of the Green's function, which we learned to analyze in earlier 
chapters. 



35.1 Density of states 

For a scattering problem the density of states (31.16) appear ill defined since for- 
mulas such as (34.6) involve integration over infinite spatial extent. What we will 
now show is that a quantity that makes sense physically is the difference of two 
densities - the first with the scatterer present and the second with the scatterer 
absent. 

In non-relativistic dynamics the relative motion can be separated from the 
center-of-mass motion. Therefore the elastic scattering of two particles can be 
treated as the scattering of one particle from a static potential V{q). We will study 
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the scattering of a point-particle of (reduced) mass m by a short-range potential 
V(q), excluding inter alia the Coulomb potential. (The Coulomb potential decays 
slowly as a function of q so that various asymptotic approximations which apply 
to general potentials fail for it.) Although we can choose the spatial coordinate 
frame freely, it is advisable to place its origin somewhere near the geometrical 
center of the potential. The scattering problem is solved, if a scattering solution 
to the time-independent Schrodinger equation (??) 



can be constructed. Here E is the energy, p - fk the initial momentum of the 
particle, and k the corresponding wave vector. 

When the argument r = \q\ of the wave function is large compared to the typ- 
ical size a of the scattering region, the Schrodinger equation effectively becomes 
a free particle equation because of the short-range nature of the potential. In the 
asymptotic domain r » a, the solution (p^{q) of (35.1) can be written as superpo- 
sition of ingoing and outgoing solutions of the free particle Schrodinger equation 
for fixed angular momentum: 

(p(q) = A$~\q) + B(fr + '(q) , (+ boundary conditions) , 

where in 1-dimensional problems (ff"\q), 4>^ + \q) are the "left," "right" moving 
plane waves, and in higher-dimensional scattering problems the "incoming," "out- 
going" radial waves, with the constant matrices A, B fixed by the boundary con- 
ditions. What are the boundary conditions? The scatterer can modify only the 
outgoing waves (see figure 35.1), since the incoming ones, by definition, have yet 
to encounter the scattering region. This defines the quantum mechanical scattering 
matrix, or the S matrix 



All scattering effects are incorporated in the deviation of S from the unit matrix, 
the transition matrix T 



For concreteness, we have specialized to two dimensions, although the final for- 
mula is true for arbitrary dimensions. The indices m and ni are the angular mo- 
menta quantum numbers for the incoming and outgoing state of the scattering 
wave function, labeling the S -matrix elements S mm '. More generally, given a set 
of quantum numbers 8, y, the S matrix is a collection Sp y of transition amplitudes 
6 —> y normalized such that \Sp y \ 2 is the probability of the 8 — > y transition. The 




(35.1) 



4>m(f) = <pm \r) + S mm xf>) n ;(r) . 



(35.2) 



S - 1 - /T . 



(35.3) 
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Figure 35.1: (a) Incoming spherical waves run- 
ning into an obstacle, (b) Superposition of outgo- 
ing spherical waves scattered from an obstacle. 



(a) 



(b) 



total probability that the ingoing state j3 ends up in some outgoing state must add 
up to unity 



(35.4) 



so the S matrix is unitary: S^S = SS^ = 1. 

We have already encountered a solution to the 2-dimensional problem; free 
particle propagation Green's function (33.48) is a radial solution, given in terms 
of the Hankel function 



G 0 (r,0,E) = ~H^\kr), 



where we have used Sofa 0, E)/H = kr for the action. The with angular mo- 
mentum eigenfunction is proportional to <p* (q) oc (kr), and given a potential 
V(q) we can in principle compute the infinity of matrix elements S mm i. We will 
not need much information about H^ilcf), other than that for large r its asymptotic 
form is 



oc e 



±ikr 



In general, the potential V(q) is not radially symmetric and (35.1) has to be 
solved numerically, by explicit integration, or by diagonalizing a large matrix in 
a specific basis. To simplify things a bit, we assume for the time being that a 
radially symmetric scatterer is centered at the origin; the final formula will be 
true for arbitrary asymmetric potentials. Then the solutions of the Schrodinger 
equation (??) are separable, <p, n (q) = <p(r)e' m6 , r = \q\, the scattering matrix cannot 
mix different angular momentum eigenstates, and S is diagonal in the radial basis 
(35.2) with matrix elements given by 



S m (k) 



- e 2i6 m (k)_ 



(35.5) 
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The matrix is unitary so in a diagonal basis all entries are pure phases. This means 
that an incoming state of the form H^^\kr)e' me gets scattered into an outgoing state 
of the form S m (k)H„\kr)e m9 , where H^\z) are incoming and outgoing Hankel 
functions respectively. We now embed the scatterer in a infinite cylindrical well 
of radius R, and will later take R — » oo. Angular momentum is still conserved so 
that each eigenstate of this (now bound) problem corresponds to some value of m. 
For large r » a each eigenstate is of the asymptotically free form 

<t>m{r) * e imd (s m (k)H^(kr)+Hi;\kr)) 

« • • • cos(kr + 8 m (k) - Xm) , (35.6) 

where • • • is a common prefactor, andxm - mn/l+nlA is an annoying phase factor 
from the asymptotic expansion of the Hankel functions that will play no role in 
what follows. 

The state (35.6) must satisfy the external boundary condition that it vanish at 
r - R. This implies the quantization condition 

k n R + 8 m (k n ) -Xm =n(n + 12) . 

We now ask for the difference in the eigenvalues of two consecutive states of 
fixed m. Since R is large, the density of states is high, and the phase 8 n (k) does 
not change much over such a small interval. Therefore, to leading order we can 
include the effect of the change of the phase on state n + 1 by Taylor expanding, is 

k n+l R + 8 m {k n ) + (k n+ i - k n )8' m {k n ) - Xm ~n + n{n + 12) . 

Taking the difference of the two equations we obtain Ak « n(R + 8 m {k)Y l . This 
is the eigenvalue spacing which we now interpret as the inverse of the density of 
states within m angular momentum sbuspace 

d m (k) *-(R + S' m (k)) . 
n 

The R term is essentially the 1 - d Weyl term (34.8), appropriate to 1 - d radial 
quantization. For large R, the dominant behavior is given by the size of the circular 
enclosure with a correction in terms of the derivative of the scattering phase shift, 
approximation accurate to order 1/7?. However, not all is well: the area under 
consideration tends to infinity. We regularize this by subtracting from the result 
from the free particle density of states cfa(k), for the same size container, but this 
time without any scatterer, figure 35.2. We also sum over all m values so that 

d(k)-d 0 (k) = -Y s' m (k) 

n 

m 




(35.7) 
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Figure 35.2: The "difference" of two bounded refer- %/b b 

ence systems, one with and one without the scattering — j 

system. ^ * 



The first line follows from the definition of the phase shifts (35.5) while the second 
line follows from the unitarity of S so that S~ l - S We can now take the limit 
R — > co since the R dependence has been cancelled away. 

This is essentially what we want to prove since for the left hand side we al- 
ready have the semiclassical theory for the trace of the difference of Green's func- 
tions, 



d{k) - d Q (k) = -— Im(tr(G0fc) - G 0 (k)) . (35.8) 
Ink 

There are a number of generalizations. This can be done in any number of 
dimensions. It is also more common to do this as a function of energy and not 
wave number k. However, as the asymptotic dynamics is free wave dynamics 
labeled by the wavenumber k, we have adapted k as the natural variable in the 
above discussion. 

Finally, we state without proof that the relation (35.7) applies even when there 
is no circular symmetry. The proof is more difficult since one cannot appeal to the 
phase shifts 6 m but must work directly with a non-diagonal S matrix. 



35.2 Quantum mechanical scattering matrix 

The results of the previous section indicate that there is a connection between the 
scattering matrix and the trace of the quantum Green's function (more formally 
between the difference of the Green's function with and without the scattering 
center.) We now show how this connection can be derived in a more rigorous 
manner. We will also work in terms of the energy E rather than the wavenumber 
k, since this is the more usual exposition. Suppose particles interact via forces of 
sufficiently short range, so that in the remote past they were in a free particle state 
labeled (3, and in the distant future they will likewise be free, in a state labeled y. 
In the Heisenberg picture the S -matrix is defined as S = CLD.\. in terms of the 
M0ller operators 

n ± = lim e ^/n e -iH 0 t/n ^ (35 9) 

t— *±oo 

where H is the full Hamiltonian, whereas Hq is the free Hamiltonian. In the 
interaction picture the S -matrix is given by 

S = Q\a = lim e iHot/h e -2iHt/n e iH 0 t/n 

t—*co 
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= T exp 



dtH'(t)\ , 



(35.10) 



where H' = V = H - Hq is the interaction Hamiltonian and T is the time-ordering 
operator. In stationary scattering theory the S matrix has the following spectral 
representation 



■ f 

Jo 



dE S (E)S(Hq - E) 



S(E) = Q + (E)QZ l (E), Q ± (E) = l + (H 0 -E±iey i V , (35.11) 



such that 



Tr 



S\E)—S(E) 
dE 



= Tr 



Hn- E - ie H - E - ie 



{e «-» -e) 



.(35.12) 



The manipulations leading to (35.12) are justified if the operators Q±(E) can be 
linked to trace-class operators. 



appendix K 



We can now use this result to derive the Krein-Lloyd formula which is the 
central result of this chapter. The Krein-Lloyd formula provides the connection 
between the trace of the Green's function and the poles of the scattering matrix, 
implicit in all of the trace formulas for open quantum systems which will be pre- 
sented in the subsequent chapters. 



35.3 Krein-Friedel-Lloyd formula 



The link between quantum mechanics and semiclassics for scattering problems is 
provided by the semiclassical limit of the Krein-Friedel-Lloyd sum for the spectral 
density which we now derive. This derivation builds on the results of the last 
section and extends the discussion of the opening section. 

In chapter 33 we linked the spectral density (see (31.16)) of a bounded system 



d(E) = ^(En-E) 



(35.13) 



via the identity 



6{E n -E) = 



1 1 
hm - Im 

e-»0 7r E - E n + ie 

lim - Im^^l 

e— »0 jt 'E-H + ie 



\E n ) 



- — : lim (E„ 

2ni £->0 



1 



1 



E-H-ie E-H + ie 



(35.14) 
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to the trace of the Green's function (34.1.1). Furthermore, in the semiclassical 
approximation, the trace of the Green's function is given by the Gutzwiller trace 
formula (34. 1 1) in terms of a smooth Weyl term and an oscillating contribution of 
periodic orbits. 

Therefore, the task of constructing the semiclassics of a scattering system is 
completed, if we can find a connection between the spectral density d(E) and the 
scattering matrix S . We will see that (35. 12) provides the clue. Note that the right 
hand side of (35.12) has nearly the structure of (35.14) when the latter is inserted 
into (35.13). The principal difference between these two types of equations is that 
the S matrix refers to outgoing scattering wave functions which are not normal- 
izable and which have a continuous spectrum, whereas the spectral density d(E) 
refers to a bound system with normalizable wave functions with a discrete spec- 
trum. Furthermore, the bound system is characterized by a hermitian operator, 
the Hamiltonian H, whereas the scattering system is characterized by a unitary 
operator, the S -matrix. How can we reconcile these completely different classes 
of wave functions, operators and spectra? The trick is to put our scattering system 
into a finite box as in the opening section. We choose a spherical conatiner with 
radius R and with its center at the center of our finite scattering system. Our scat- 
tering potential V(r) will be unaltered within the box, whereas at the box walls we 
will choose an infinitely high potential, with the Dirichlet boundary conditions at 
the outside of the box: 



In this way, for any finite value of the radius R of the box, we have mapped our 
scattering system into a bound system with a spectral density d(E; R) over dis- 
crete eigenenergies E n {R). It is therefore important that our scattering potential 
was chosen to be short-ranged to start with. (Which explains why the Coulomb 
potential requires special care.) The hope is that in the limit R — > oo we will 
recover the scattering system. But some care is required in implementing this. 
The smooth Weyl term d(E; R) belonging to our box with the enclosed potential V 
diverges for a spherical 2-dimensional box of radius R quadratically, as n$ l{4n) 
or as R 3 in the 3-dimensional case. This problem can easily be cured if the spec- 
tral density of an empty reference box of the same size (radius R) is subtracted 
(see figure 35.2). Then all the divergences linked to the increasing radius R in 
the limit R — > oo drop out of the difference. Furthermore, in the limit R — > oo 
the energy-eigenfunctions of the box are only normalizable as a delta distribution, 
similarly to a plane wave. So we seem to recover a continous spectrum. Still the 
problem remains that the wave functions do not discriminate between incoming 
and outgoing waves, whereas this symmetry, namely the hermiticity, is broken in 
the scattering problem. The last problem can be tackled if we replace the spec- 
tral density over discrete delta distributions by a smoothed spectral density with a 
small finite imaginary part n in the energy E: 



<K?)\r=R = 0 - 



(35.15) 




(35.16) 
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Note that d(E + irj;R) ± d(E - in;R) = -d(E + in;R). By the introduction of the 
positive finite imaginary part 77 the time-dependent behavior of the wave function 
has effectively been altered from an oscillating one to a decaying one and the 
hermiticity of the Hamiltonian is removed. Finally the limit 77 — > 0 can be carried 
out, respecting the order of the limiting procedures. First, the limit R — > 00 has 
to be performed for a finite value of 77, only then the limit n — » 0 is allowed. In 
practice, one can try to work with a finite value of R, but then it will turn out (see 
below) that the scattering system is only recovered if R^Jrj » 1 . 

Let us summarize the relation between the smoothed spectral densities d(E + 
irj; R) of the boxed potential and cf \E + in; R) of the empty reference system and 
the S matrix of the corresponding scattering system: 



This is the Krein-Friedel-Lloyd formula. It replaces the scattering problem by 
the difference of two bounded reference billiards of the same radius R which fi- 
nally will be taken to infinity. The first billiard contains the scattering region or 
potentials, whereas the other does not (see figure 35.2). Here d(E + in;R) and 
<r '{E + irj;R) are the smoothed spectral densities in the presence or in the ab- 
sence of the scatterers, respectively. In the semiclassical approximation, they are 
replaced by a Weyl term (34. 10) and an oscillating sum over periodic orbits. As in 
(34.2), the trace formula (35.17) can be integrated to give a relation between the 
smoothed staircase functions and the determinant of the S -matrix: 



Furthermore, in both versions of the Krein-Friedel-Lloyd formulas the energy ar- 
gument E + in can be replaced by the wavenumber argument k+irj . These expres- 
sions only make sense for wavenumbers on or above the real &-axis. In particular, 
if k is chosen to be real, rf must be greater than zero. Otherwise, the exact left 
hand sides (35.18) and (35.17) would give discontinuous staircase or even delta 
function sums, respectively, whereas the right hand sides are continuous to start 
with, since they can be expressed by continuous phase shifts. Thus the order of 
the two limits in (35.18) and (35.17) is essential. 

The necessity of the +irj prescription can also be understood by purely phe- 
nomenological considerations in the semiclassical approximation: Without the ir\ 
term there is no reason why one should be able to neglect spurious periodic or- 
bits which are there solely because of the introduction of the confining boundary. 
The subtraction of the second (empty) reference system removes those spurious 
periodic orbits which never encounter the scattering region - in addition to the re- 
moval of the divergent Weyl term contributions in the limit R — > 00. The periodic 
orbits that encounter both the scattering region and the external wall would still 




- — Tr— In S(E) 
2ni dE 



-i- -^-lndetS(£). (35.17) 



2m dE 




(35.18) 
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survive the first limit R — > oo, if they were not exponentially suppressed by the 
+irj term because of their 



behavior. As the length L(R) of a spurious periodic orbit grows linearly with the 
radius R. The bound Rrf » 1 is an essential precondition on the suppression of 
the unwanted spurious contributions of the container if the Krein-Friedel-Lloyd 
formulas (35.17) and (35.18) are evaluated at a finite value of R. 



Finally, the semiclassical approximation can also help us in the interpretation 
of the Weyl term contributions for scattering problems. In scattering problems the 
Weyl term appears with a negative sign. The reason is the subtraction of the empty 
container from the container with the potential. If the potential is a dispersing bil- 
liard system (or a finite collection of dispersing billiards), we expect an excluded 
volume (or the sum of excluded volumes) relative to the empty container. In other 
words, the Weyl term contribution of the empty container is larger than of the 
filled one and therefore a negative net contribution is left over. Second, if the 
scattering potential is a collection of a finite number of non-overlapping scatter- 
ing regions, the Krein-Friedel-Lloyd formulas show that the corresponding Weyl 
contributions are completely independent of the position of the single scatterers, 
as long as these do not overlap. 



35.4 Wigner time delay 

The term ^ bidets in the density formula (35.17) is dimensionally time. This 
suggests another, physically important interpretation of such formulas for scatter- 
ing systems, the Wigner delay, defined as 



and can be shown to equal the total delay of a wave packet in a scattering system. 
We now review this fact. 

A related quantity is the total scattering phase shift Q(k) defined as 



e iL(R) ^2m(E+irj) _ jUR)*. ^-Z-W 



exercise 35.1 



d(k) = — Argdet(S(*)) 
dk 

= -i— logdet(S(£) 




(35.19) 



det S(k) = e 



,+ie(k) 



so that d(k) = 4®{k). 
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The time delay may be both positive and negative, reflecting attractive re- 
spectively repulsive features of the scattering system. To elucidate the connection 
between the scattering determinant and the time delay we study a plane wave: 

The phase of a wave packet will have the form: 

<p = k ■ x— cot + 0 . 

Here the term in the parenthesis refers to the phase shift that will occur if scattering 
is present. The center of the wave packet will be determined by the principle of 
stationary phase: 

0 = d(p — dk ■ x - dco t + d® . 

Hence the packet is located at 



dco_ _d& 
dk dk 



The first term is just the group velocity times the given time t. Thus the packet is 
retarded by a length given by the derivative of the phase shift with respect to the 
wave vector k. The arrival of the wave packet at the position x will therefore be 
delayed. This time delay can similarly be found as 



dQ(aj) 
t(oj) = — . 

OOJ 



To show this we introduce the slowness of the phase s = k/a> for which s • v g = 1, 
where v ? is the group velocity to get 



dk - x - s- xdaj = — da) , 



since we may assume x is parallel to the group velocity (consistent with the 
above). Hence the arrival time becomes 

x d@(co) 
t = — + . 

v g OOJ 

If the scattering matrix is not diagonal, one interprets 



,dSij\ (d&ij 
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as the delay in the y'th scattering channel after an injection in the ith. The proba- 
bility for appearing in channel j goes as |5; ; | 2 and therefore the average delay for 
the incoming states in channel i is 



where we have used the derivative, d/dco, of the unitarity relation S • fs — 1 valid 
for real frequencies. This discussion can in particular be made for wave packets 
related to partial waves and superpositions of these like an incoming plane wave 
corresponding to free motion. The total Wigner delay therefore corresponds to the 
sum over all channel delays (35.19). 

Commentary 

Remark 35.1 Krein-Friedel-Lloyd formula. The third volume of Thirring [ I], sections 
3.6.14 (Levison Theorem) and 3.6.15 (the proof), or P. Scherer's thesis [ 15] (appendix) 
discusses the Levison Theorem. 

It helps to start with a toy example or simplified example instead of the general the- 
orem, namely for the radially symmetric potential in a symmetric cavity. Have a look at 
the book of K. Huang, chapter 10 (on the "second virial coefficient"), or Beth and Uh- 
lenbeck [5], or Friedel [7]. These results for the correction to the density of states are 
particular cases of the Krein formula [3]. The Krein-Friedel-Lloyd formula (35.17) was 
derived in refs. [3, 7, 8, 9], see also refs. [11, 14, 15, 17, 18]. The original papers are by 
Krein and Birman [3, 4] but beware, they are mathematicans. 

Also, have a look at pages 15-18 of Wirzba's talk on the Casimir effect [ ]. Page 
16 discusses the Beth-Uhlenbeck formula [5], the predecessor of the more general Krein 
formula for spherical cases. 

Remark 35.2 Weyl term for empty container. For a discussion of why the Weyl term 
contribution of the empty container is larger than of the filled one and therefore a negative 
net contribution is left over, see ref. [15]. 

Remark 35.3 Wigner time delay. Wigner time delay and the Wigner-Smith time 
delay matrix, are powerful concepts for a statistical description of scattering. The diagonal 
elements Q aa of the lifetime matrix Q = -iS~ l dS/du>, where S is the [2Nx2N] scattering 
matrix, are interpreted in terms of the time spent in the scattering region by a wave packet 
incident in one channel . As shown by S mith [ ] , they are the sum over all ouput channels 
(both in reflection and transmission) of At a t, = Re [(— i / S a b)(dS a bl dot)] weighted by the 
probability of emerging from that channel. The sum of the Q aa over all 2N channels 
is the Wigner time delay tw = £ fl Q aa , which is the trace of the lifetime matrix and is 
proportional to the density of states. 
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Exercises 



35.1. Spurious orbits under the Krein-Friedel-Lloyd con- 
traction. Draw examples for the three types of period 
orbits under the Krein-Friedel-Lloyd construction: (a) 
the genuine periodic orbits of the scattering region, (b) 
spurious periodic orbits which can be removed by the 
subtraction of the reference system, (c) spurious peri- 
odic orbits which cannot be removed by this subtraction. 
What is the role of the double limit 77 — * 0, container size 
b -> 00? 





R 



35.2. The one-disk scattering wave function. 

one-disk scattering wave function. 



Derive the 



35.3. Quantum two-disk scattering. 

classical spectral determinant 



Z(s) = [ 



pJJ 



1 - 



A 



J+21 



(Andreas Wirzba) 



Compute the quasi- 



for the two disk problem. Use the geometry 



The full quantum mechanical version of this problem 
can be solved by finding the zeros in k for the deter- 
minant of the matrix 



M„ 



(-1)" J m (ka) (um 
2 H^\ka) 



where J„ is the nth Bessel function and H„ is the Han- 
kel function of the first kind. Find the zeros of the de- 
terminant closest to the origin by solving det M(k) = 0. 
(Hints: note the structure M = / + A to approximate the 
determinant; or read Chaos 2, 79 (1992)) 

35.4. Pinball topological index. Upgrade your pinball sim- 
ulator so that it computes the topological index for each 
orbit it finds. 
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Chapter 36 



Chaotic multiscattering 



(A. Wirzba and P. Cvitanovic) 

We discuss here the semiclassics of scattering in open systems with a finite 
number of non-overlapping finite scattering regions. Why is this inter- 
esting at all? The semiclassics of scattering systems has five advantages 
compared to the bound-state problems such as the helium quantization discussed 
in chapter 37. 

• For bound-state problem the semiclassical approximation does not respect 
quantum-mechanical unitarity, and the semi-classical eigenenergies are not 
real. Here we construct a manifestly unitary semiclassical scattering matrix. 

• The Weyl-term contributions decouple from the multi-scattering system. 

• The close relation to the classical escape processes discussed in chapter 1 . 

• For scattering systems the derivation of cycle expansions is more direct and 
controlled than in the bound-state case: the semiclassical cycle expansion 
is the saddle point approximation to the cumulant expansion of the determi- 
nant of the exact quantum-mechanical multi-scattering matrix. 

• The region of convergence of the semiclassical spectral function is larger 
than is the case for the bound-state case. 

We start by a brief review of the elastic scattering of a point particle from finite 
collection of non-overlapping scattering regions in terms of the standard textbook 
scattering theory, and then develop the semiclassical scattering trace formulas and 
spectral determinants for scattering off N disks in a plane. 
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36.1 Quantum mechanical scattering matrix 

We now specialize to the elastic scattering of a point particle from finite collection 
of N non-overlapping reflecting disks in a 2-dimensional plane. As the point par- 
ticle moves freely between the static scatterers, the time-independent Schrodinger 
equation outside the scattering regions is the Helmholtz equation: 

(V;: + k 2 ^ if/(r) = 0 , f outside the scattering regions. (36.1) 

Here ij/{r) is the wave function of the point particle at spatial position f and E = 
h k /2m is its energy written in terms of its mass m and the wave vector & of the 
incident wave. For reflecting wall billiards the scattering problem is a boundary 
value problem with Dirichlet boundary conditions: 

t//(r) = 0 , f on the billiard perimeter (36.2) 

As usual for scattering problems, we expand the wave function tp{f ) in the 
(2-dimensional) angular momentum eigenfunctions basis 



m = £ tiff )e- im<S>k , (36.3) 

m=—co 

where k and cD^ are the length and angle of the wave vector, respectively. A plane 
wave in two dimensions expaned in the angular momentum basis is 

CO 

= e i*rcos(<iv-%) = ^ J m (kr)e im ^-^ , (36.4) 



where r and (D r denote the distance and angle of the spatial vector f as measured 
in the global 2-dimensional coordinate system. 

The mth angular component J m (kr)e" n ®' of a plane wave is split into a super- 
position of incoming and outgoing 2-dimensional spherical waves by decompos- 
ing the ordinary Bessel function J m (z) into the sum 

Jm{z) = \(H^{z) + H^{z)) (36.5) 

of the Hankel functions H^\z) and H m (z) of the first and second kind. For |z| » 1 
the Hankel functions behave asymptotically as: 



H%\z) - J— e-^-^-V incoming, 

V nz 

Hn\z) ~ J—e +ii ^ m -^ outgoing. (36.6) 

V nz. 
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Thus for r — > oo and & fixed, the mth angular component J m {kr)e r of the plane 
wave can be written as superposition of incoming and outgoing 2-dimensional 
spherical waves: 



-i(kr- 



■V + e i{kr - 



(36.7) 



In terms of the asymptotic (angular momentum) components $ m of the wave 
function ifr(r), the scattering matrix (35.3) is defined as 

1 00 

~ -=!= X [W^^-^ + S^^-^'-D] ^ . (36.8) 

The matrix element describes the scattering of an incoming wave with an- 
gular momentum m into an outgoing wave with angular momentum rrl . If there 
are no scatterers, then S = 1 and the asymptotic expression of the plane wave ^ 
in two dimensions is recovered from tA(r ). 



36.1.1 1-disk scattering matrix 

In general, S is nondiagonal and nonseparable. An exception is the 1-disk scat- 
terer. If the origin of the coordinate system is placed at the center of the disk, by 
(36.5) the mth angular component of the time-independent scattering wave func- 
tion is a superposition of incoming and outgoing 2-dimensional spherical waves 



fa = ^(H%\kr) + S mm Hil\kr))e im ^ 
— ^Jm(kr) — — T mm H m \kr)*J e 



imfS> r 



The vanishing (36.2) of the wave function on the disk perimeter 



0 = J m (ka) - -T mm H m '(ka) 



yields the 1-disk scattering matrix in analytic form: 



SLAV = i 



2J m (ka s ) 



1 Hg\ka s ) 



H%\ka s ) 
H%\ka s ) 



exercise 35.2 



Km' , (36.9) 



where a = a s is radius of the disk and the suffix s indicates that we are dealing 
with a disk whose label is s. We shall derive a semiclassical approximation to this 
1-disk S-matrix in sect. 36.3. 
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36.1.2 Multi-scattering matrix 

Consider next a scattering region consisting of N non-overlapping disks labeled 
s € {1,2, •• • ,N}, following the notational conventions of sect. 1 1.6. The strategy 
is to construct the full T-matrix (35.3) from the exact 1-disk scattering matrix 
(36.9) by a succession of coordinate rotations and translations such that at each 
step the coordinate system is centered at the origin of a disk. Then the T-matrix 
in S mm > = 8 mm ' - i T mm > can be split into a product over three kinds of matrices, 



7W(*)=£ £ cj(k)M-\k)™;x, m .(k). 



=1 h,U-- 



The outgoing spherical wave scattered by the disk s is obtained by shifting the 
global coordinates origin distance R s to the center of the disk s, and measuring 
the angle O s with respect to direction k of the outgoing spherical wave. As in 
(36.9), the matrix C s takes form 



c ; =— Jm ~f R ^ ^_ (3610) 

s ™ s Hf\ka s ) 



If we now describe the ingoing spherical wave in the disk / coordinate frame by 
the matrix D 4 



£>f /m , = -na sl J m ^i sl {kR s -)Ji sl {ka sl )e- im '^' , (36.1 1) 



and apply the Bessel function addition theorem 

oo 

J m (y + z) = ^ J m-f(y)J((z), 



we recover the T-matrix (36.9) for the single disk s = s' , M - 1 scattering. The 
Bessel function sum is a statement of the completness of the spherical wave basis; 
as we shift the origin from the disk s to the disk ^ by distance R s > , we have to 
reexpand all basis functions in the new coordinate frame. 

The labels m and m' refer to the angular momentum quantum numbers of the 
ingoing and outgoing waves in the global coordinate system, and 1 , l s > refer to the 
(angular momentum) basis fixed at the sth and th scatterer, respectively. Thus, 
C s and D s depend on the origin and orientation of the global coordinate system 
of the 2-dimensional plane as well as on the internal coordinates of the scatterers. 
As they can be made separable in the scatterer label s, they describe the single 
scatterer aspects of what, in general, is a multi-scattering problem. 
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Figure 36.1: Global and local coordinates for a gen 
eral 3-disk problem. 



The matrix M is called the multi-scattering matrix. If the scattering problem 
consists only of one scatterer, M is simply the unit matrix t = 6 SS Si s i s> . 
For scattering from more than one scatterer we separate out a "single traversal" 
matrix A which transports the scattered wave from a scattering region A% to the 
scattering region M s >, 



The matrix A ss ' reads: 

A".' =-(i-^)^L Jl '^ a ' ) H?\ (kR ss >)e^- 1 ^'-^. (36.13) 
a s , H) l) (ka s/ ) s 

's' 

Here, a s is the radius of the 5th disk. R s and ®. v are the distance and angle, 
respectively, of the ray from the origin in the 2-dimensional plane to the center of 
disk s as measured in the global coordinate system. Furthermore, Rs S > = R s > s is 
the separation between the centers of the sth and s' th disk and a s > s of the ray from 
the center of disk s to the center of disk s 1 as measured in the local (body-fixed) 
coordinate system of disk s (see figure 36.1). 

Expanded as a geometrical series about the unit matrix 1, the inverse matrix 
M _1 generates a multi-scattering series in powers of the single-traversal matrix A. 
All genuine multi-scattering dynamics is contained in the matrix A; by construc- 
tion A vanishes for a single-scatterer system. 



36.2 A/-scatterer spectral determinant 

In the following we limit ourselves to a study of the spectral properties of the S- 
matrix: resonances, time delays and phase shifts. The resonances are given by the 
poles of the S-matrix in the lower complex wave number (k) plane; more precisely, 
by the poles of the S on the second Riemann sheet of the complex energy plane. 
As the S-matrix is unitary, it is also natural to focus on its total phase shift rj(k) 
defined by detS = exp 2 "'®. The time-delay is proportional to the derivative of 
the phase shift with respect to the wave number k. 
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As we are only interested in spectral properties of the scattering problem, it 
suffices to study det S. This determinant is basis and coordinate-system indepen- 
dent, whereas the S-matrix itself depends on the global coordinate system and on 
the choice of basis for the point particle wave function. 

As the S-matrix is, in general, an infinite dimensional matrix, it is not clear 
whether the corresponding determinant exists at all. If T-matrix is trace-class, the 
determinant does exist. What does this mean? 



36.2.1 Trace-class operators 

An operator (an infinite-dimensional matrix) is called trace-class if and only if, 
for any choice of orthonormal basis, the sum of the diagonal matrix elements 
converges absolutely; it is called "Hilbert-Schmidt," if the sum of the absolute 
squared diagonal matrix elements converges. Once an operator is diagnosed as 
trace-class, we are allowed to manipulate it as we manipulate finite-dimensional 
matrices. We review the theory of trace-class operators in appendix K; here we 
will assume that the T-matrix (35.3) is trace-class, and draw the conlusions. 

If A is trace-class, the determinant det (1 - zA), as defined by the cumulant 
expansion, exists and is an entire function of z. Furthermore, the determinant is 
invariant under any unitary transformation. 

The cumulant expansion is the analytical continuation (as Taylor expansion in 
the book-keeping variable z) of the determinant 



det (1 - zA) = exp[tr ln(l - zA)] - exp 



z: 



n=\ 



That means 



det 



(l-zA):=J]z m Q m (A), (36.14) 



m=0 



where the cumulants Q m (A) satisfy the Plemelj -Smithies recursion formula (K.19), 
a generalization of Newton's formula to determinants of infinite-dimensional ma- 
trices, 

Go(A) - 1 

1 '" 

G»(A) = V G m _;(A)tr(A;') form>l, (36.15) 

in terms of cumulants of order n < m and traces of order n < m. Because of the 
trace-class property of A, all cumulants and traces exist separately. 
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For the general case ofN < oo non-overlapping scatterers, the T-matrix can be 
shown to be trace-class, so the determinant of the S-matrix is well defined. What 
does trace-class property mean for the corresponding matrices C , D v and A ss ? 
Manipulating the operators as though they were finite matrices, we can perform 
the following transformations: 



detS = det (l - iCM -1 D) 

= Det (l - /M _1 DC) - Det (m~'(M - /DC)) 
Det (M - /DC) 
Det(M) 



(36.16) 



In the first line of (36.16) the determinant is taken over small I (the angular mo- 
mentum with respect to the global system). In the remainder of (36.16) the deter- 
minant is evaluated over the multiple indices = (s, l s ). In order to signal this 
difference we use the following notation: det . . . and tr . . . refer to the \€) space, 
Det . . . and Tr . . . refer to the multiple index space. The matrices in the multiple 
index space are expanded in the complete basis {|L,)j = which refers for 

fixed index s to the origin of the sth scatterer and not any longer to the origin of 
the 2-dimensional plane. 

Let us explicitly extract the product of the determinants of the subsystems 
from the determinant of the total system (36.16): 



detS = 



Det (M - /DC) 

Det(M) 
Det (M- /DC) nf=idetS s 



DetM 



* N detS 1 



n? =1 

(" \ Det (M - /DC)/ n sLi det S s 

I detS s 

4 = 1 



DetM 



(36.17) 



The final step in the reformulation of the determinant of the S-matrix of the N- 
scatterer problem follows from the unitarity of the S-matrix. The unitarity of 
S^(&*) implies for the determinant 

det (S(k*) f ) = 1/det S(Jt) , (36.18) 



where this manipulation is allowed because the T-matrix is trace-class. The uni- 
tarity condition should apply for the S-matrix of the total system, S, as for the 
each of the single subsystems, S J , s = 1, • • • , N. In terms of the result of (36.17), 
this implies 



Det(M(ik)-/D(jk)C(jk)) . + 
v = Det(M(F) t ) 

IlLdetS' 
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since all determinants in (36.17) exist separately and since the determinants det S v 
respect unitarity by themselves. Thus, we finally have 



where all determinants exist separately. 

In summary: We assumed a scattering system of a finite number of non- 
overlapping scatterers which can be of different shape and size, but are all of 
finite extent. We assumed the trace-class character of the T-matrix belonging to 
the total system and of the single-traversal matrix A and finally unitarity of the 
S-matrices of the complete and all subsystems. 

What can one say about the point-particle scattering from a finite number of 
scatterers of arbitrary shape and size? As long as each of N < oo single scatterers 
has a finite spatial extent, i.e., can be covered by a finite disk, the total system 
has a finite spatial extent as well. Therefore, it too can be put insided a circular 
domain of finite radius b, e.g., inside a single disk. If the impact parameter of the 
point particle measured with respect to the origin of this disk is larger than the disk 
size (actually larger than (e/2) x b), then the T matrix elements of the A^-scatterer 
problem become very small. If the wave number k is kept fixed, the modulus of 
the diagonal matrix elements, \T mm \ with the angular momentum m > (e/2)kb, is 
bounded by the corresponding quantity of the covering disk. 

36.2.2 Quantum cycle expansions 

In formula (36. 19) the genuine multi-scattering terms are separated from the single- 
scattering ones. We focus on the multi-scattering terms, i.e., on the ratio of the 
determinants of the multi-scattering matrix M = 1 - A in (36.19), since they are 
the origin of the periodic orbit sums in the semiclassical reduction. The reso- 
nances of the multi-scattering system are given by the zeros of Det M(k) in the 
lower complex wave number plane. 

In order to set up the problem for the semiclassical reduction, we express the 
determinant of the multi-scattering matrix in terms of the traces of the powers 
of the matrix A, by means of the cumulant expansion (36.14). Because of the 
finite number N > 2 of scatterers tr(A") receives contributions corresponding to 
all periodic itineraries S1S2S3 • • • s„-is n of total symbol length n with an alphabet 
si e {1,2, ...,N}. of N symbols, 




(36.19) 



trA^A' 52 * 3 •••A 4 ' 



(36.20) 




's 1 =- 00 'i 2 =-° 0 



multscal - 25jul2006 



ChaosBook.org versionl4.4.1, Apr 21 2013 



CHAPTER 36. CHAOTIC MULTISCATTERING 



692 



Remember our notation that the trace tr(- • •) refers only to the |/) space. By con- 
struction A describes only scatterer-to-scatterer transitions, so the symbolic dy- 
namics has to respect the no-self-reflection pruning rule: for admissible itineraries 
the successive symbols have to be different. This rule is implemented by the factor 
1 -6"' in (36.13). 

The trace tr A" is the sum of all itineraries of length n, 



We will show for the Af-disk problem that these periodic itineraries correspond 
in the semiclassical limit, ka Sj » 1, to geometrical periodic orbits with the same 
symbolic dynamics. 

For periodic orbits with creeping sections the symbolic alphabet has to be 
extended, see sect. 36.3.1. Furthermore, depending on the geometry, there might 
be nontrivial pruning rules based on the so called ghost orbits, see sect.36.4.1. 

36.2.3 Symmetry reductions 

The determinants over the multi-scattering matrices run over the multiple index L 
of the multiple index space. This is the proper form for the symmetry reduction 
(in the multiple index space), e.g., if the scatterer configuration is characterized 
by a discrete symmetry group G, we have 



where the index a runs over all conjugate classes of the symmetry group G and 
D a is the ath representation of dimension da. The symmetry reduction on the 
exact quantum mechanical level is the same as for the classical evolution oper- 
ators spectral determinant factorization (21.17) of sect. 21.4.2. 



36.3 Semiclassical 1-disk scattering 

We start by focusing on the single-scatterer problem. In order to be concrete, we 
will consider the semiclassical reduction of the scattering of a single disk in plane. 

Instead of calculating the semiclassical approximation to the determinant of 
the one-disk system scattering matrix (36.9), we do so for 




(36.21) 



[S\S7-S„ 



DetM = ]~](detM Da (*))<*<• , 



(T 




(36.22) 
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the so called time delay. 



= i7 tr ( lndet S (*a)J - — > — ~ -77—777 

a sr- 
2ni2-J 



'H^'ika) H% y (ka) 
K H%\ka) H%\ka) 



(36.23) 



Here the prime denotes the derivative with respect to the argument of the Hankel 
functions. Let us introduce the abbreviation 

H^'Qco) H ( v l) \ka) 
Xr = ~m m • (36.24) 

We apply the Watson contour method to (36.23) 

d(k) = -L V Xm = -L (h d y ——Xv ■ (36.25) 
2/n ^— 1 2/n 2/ J> sin(v7r) 



Here the contour C encircles in a counter-clockwise manner a small semiinfinite 
strip D which completely covers the real v-axis but which only has a small finite 
extent into the positive and negative imaginary v direction. The contour C is then 
split up in the path above and below the real v-axis such that 



d(k) = dv-—-v v - dv 

4m [J-oo+i'e Sin(v7T) J-oo-ie 



e -,vn 



sin(v7r) 

Then, we perform the substitution v — > -v in the second integral so as to get 

a f ^+oo +i e e -i VJT e+ ivn \ 

&(k) = -r-{\ dv v + dv -x -v\ 

4n [J-oo+ie sm(v7r) sm(v7r) J 

- M 2 L„ iv T^*» + h ivx '\- <36 ' 26) 

where we used the fact that^_ v = Xv The contour in the last integral can be de- 
formed to pass over the real v-axis since its integrand has no Watson denominator. 

We will now approximate the last expression semiclassically, i.e., under the 
assumption ka » 1. As the two contributions in the last line of (36.26) differ by 
the presence or absence of the Watson denominator, they will have to be handled 
semiclassically in different ways: the first will be closed in the upper complex 
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plane and evaluated at the poles of Xv, the second integral will be evaluated on the 
real v-axis under the Debye approximation for Hankel functions. 

We will now work out the first term. The poles of Xv in the upper complex 
plane are given by the zeros of H^\ka) which will be denoted by V((ka) and by 
the zeros of H^\ka) which we will denote by -vc(ka), i = 1, 2, 3, • • •. In the Airy 
approximation to the Hankel functions they are given by 

V[{ka) = ka + ia((ka) , (36.27) 
-V[{ka) = -ka + i(at(k*a))* = - (v((k*a))* , (36.28) 



with 



ia c {ka) = e 3 — q c 



ka 



1/3 



ka 



1/3 q ] 



1 



180 lOka 



30, 



+ e'3\ — 



1 



ka 3150 



29q f 2Slq 



6 2 180 • 6 3 



(36.29) 



Here qc labels the zeros of the Airy integral 

poo 

A(q) = I dr cos^r - t 3 ) = 3~ 1/3 7rAi(-3~ 1/3 g) , 
Jo 

with Ai(z) being the standard Airy function; approximately, q « 6^ 3 [3^(^ - 
l/4)] 2 ^ 3 /2. In order to keep the notation simple, we will abbreviate V{ = V{{ka) 
and V[ = V[(ka). Thus the first term of (36.26) becomes finally 

a f r^+oc+ie e 2ivn \ ™ / g 2/v f w e ~2iv t n \ 

In the second term of (36.26) we will insert the Debye approximations for the 
Hankel functions: 



H^l 2 \x) ~ * \ — z exp (±i Vx 2 - v 2 + iv arccos — + i-\ for \x\ > v 

(36.30) 

— exp f - Vv 2 - x 2 + vArcCosh- ) for \x\ < v . 
ttVv 2 -* 2 V 



4 1/2) w ~ *i 



Note that for v > ka the contributions in Xv cancel. Thus the second integral of 
(36.26) becomes 



+ka 



2m J_ 0O 2m J_ fca a dk \ 



a 2 - v 2 - v arccos — I + 
ka) 



C ka i a 2 

= -— \ dv^Jk 2 a 2 -v 2 + ... = -— jfe + ... , (36.31) 
for J-fca 2 
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where • • • takes care of the polynomial corrections in the Debye approximation 
and the boundary correction terms in the v integration. 

In summary, the semiclassical approximation to d(k) reads 

d(k) = 2a > — + — -—* + ••• . 

Using the definition of the time delay (36.22), we get the following expression for 
detS 1 ^): 

lndetS'Ota) - lim lndetS'^oa) (36.32) 



n • (**r ak ^ 
= 2ma dk -— + 2} 

Jo 2 t—l I 1 — e '27Tv e (ka) I _ e -i2nv c (ka) 

~ -2niN(k)+lYt d^4 j-ln(l-e' 2;rVf(fa ' ) ) + ln(l-e- , ' 2 ^ (M )) + --- 
, =1 Jo 

where in the last expression it has been used that semiclassically ^V((ka) ~ 
4rVt(ka) ~ a and that the Weyl term for a single disk of radius a goes like 
N(k) = na 2 k 2 l(4n) + • • • (the next terms come from the boundary terms in the 
y-integration in (36.31)). Note that for the lower limit, ko — > 0, we have two 
simplifications: First, 

lim S^ m ,(koa) = lim — — <W = 1 X 8 mm > Vm,m 

ko^o k a ^o H%\koa) 

— > lim det S 1 ^^) - 1 • 

ko^O 

Secondly, for ko —> 0, the two terms in the curly bracket of (36.32) cancel. 



gi2nvc(ka) g—i2nv[(ka) 
+ 



+ 



36.3.1 1-disk spectrum interpreted; pure creeping 

To summarize: the semiclassical approximation to the determinant S l (ka) is given 
by 



]~[ 00 (\ - e -2inv e (ka)\ 2 

det S\ka) ~ e -a*mi±i± L , (36 .33) 

UZi( l - e2i7TVt{ka) ) 

with 

v c {ka) = ka + ia c (ka) = ka + <? +OT/3 (&a/6) 1/3 ^ + 
v ( (ka) = ka- i(a { (k*a))* = ka + e _i7r / 3 (to/6) 1/3 ^ + 
- (ve(k*a))* 
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Figure 36.2: Right- and left-handed diffractive 
creeping paths of increasing mode number i for 
a single disk. 





and N(ka) = (na 2 k 2 )/4n + ■ ■ ■ the leading term in the Weyl approximation for 
the staircase function of the wavenumber eigenvalues in the disk interior. From 
the point of view of the scattering particle, the interior domains of the disks are 
excluded relatively to the free evolution without scattering obstacles. Therefore 
the negative sign in front of the Weyl term. For the same reason, the subleading 
boundary term has here a Neumann structure, although the disks have Dirichlet 
boundary conditions. 

Let us abbreviate the r.h.s. of (36.33) for a disk s as 



detS'(*a,) ~ (e 



-*)) 2 Z fo* fl '>* ggg^l 
Z s Aka s ) Z s r (ka s ) 



(36.34) 



where Z s c {ka s ) and Z s r (ka s ) are the diffractional zeta functions (here and in the fol- 
lowing we will label semiclassical zeta functions with diffractive corrections by a 
tilde) for creeping orbits around the 5th disk in the left-handed sense and the right- 
handed sense, respectively (see figure 36.2). The two orientations of the creeping 
orbits are the reason for the exponents 2 in (36.33). Equation (36.33) describes 
the semiclassical approximation to the incoherent part (= the curly bracket on the 
r.h.s.) of the exact expression (36.19) for the case that the scatterers are disks. 

In the following we will discuss the semiclassical resonances in the 1-disk 
scattering problem with Dirichlet boundary conditions, i.e. the so-called shape 
resonances. The quantum mechanical resonances are the poles of the S -matrix in 
the complex &-plane. As the 1-disk scattering problem is separable, the S -matrix 
is already diagonalized in the angular momentum eigenbasis and takes the sim- 
ple form (36.9). The exact quantummechanical poles of the scattering matrix are 
therefore given by the zeros of the Hankel functions H]^\ka) in the lower 
complex k plane which can be labeled by two indices, m and n, where m denotes 
the angular quantum number of the Hankel function and n is a radial quantum 
number. As the Hankel functions have to vanish at specific k values, one cannot 
use the usual Debye approximation as semiclassical approximation for the Hankel 
function, since this approximation only works in case the Hankel function is dom- 
inated by only one saddle. However, for the vanishing of the Hankel function, one 
has to have the interplay of two saddles, thus an Airy approximation is needed as 
in the case of the creeping poles discussed above. The Airy approximation of the 
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Hankel function Hy (ka) of complex-valued index v reads 

H^\ka)^e^(f\'\{ q ^), 
7T \ka J 

with 

1/3 

(y-ka) + 0((kay l ). 

Hence the zeros V( of the Hankel function in the complex v plane follow from 
the zeros qc of the Airy integral A{q) (see (36.3). Thus if we set V{ = m (with m 
integer), we have the following semiclassical condition on k es 

k ies a + iadk^a) 

\ 6 ) 1 \k res a) 180 70ik res a( 30 J 

6 3 \k™a) 3150 ( 6 2 180 • 6 3 J + '"' 

(36.35) 

For a given index / this is equivalent to 

0 ~ l - gW^-atlPxa 

the de-Broglie condition on the wave function that encircles the disk. Thus the 
semiclassical resonances of the 1-disk problem are given by the zeros of the fol- 
lowing product 

CO 

1=1 

which is of course nothing else than Zi-^(k), the semiclassical diffraction zeta 
function of the 1-disk scattering problem, see (36.34). Note that this expression 
includes just the pure creeping contribution and no genuine geometrical parts. 
Because of 

H il l(ka) = (-irH^\ka), 

the zeros are doubly degenerate if m + 0, corresponding to right- and left handed 
creeping turns. The case m = 0 is unphysical, since all zeros of the Hankel func- 
tion H^\ka) have negative real value. 




m ~ 



+ 

with/= 1,2, 3, • • • . 
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Figure 36.3: The shape resonances of the 1-disk 
system in the complex k plane in units of the disk 
radius a. The boxes label the exact quantum me- 
chanical resonances (given by the zeros of H^Qca) 
for m = 0, 1, 2), the crosses label the diffractional 
semiclassical resonances (given by the zeros of 
the creeping formula in the Airy approximation 
(36.35) up to the order 0([ka] 1/3 )). 



Figure 36.4: Same as in figure 36.3. However, 
the subleading terms in the Airy approximation 
(36.35) are taken into account up to the order 
0([kay lli ) (upper panel) and up to order 0([kaY r ) 
(lower panel). 



From figure 36.3 one notes that the creeping terms in the Airy order 0{\kaf^), 
which are used in the Keller construction, systematically underestimate the magni- 
tude of the imaginary parts of the exact data. However, the creeping data become 
better for increasing Rek and decreasing \lmk\, as they should as semiclassical 
approximations. 

In the upper panel of figure 36.4 one sees the change, when the next order 
in the Airy approximation (36.35) is taken into account. The approximation is 
nearly perfect, especially for the leading row of resonances. The second Airy 
approximation using (36.35) up to order 0{\kaY l ) is perfect up to the drawing 
scale of figure 36.4 (lower panel). 



multscat - 25jul2006 



ChaosBook.org version!4.4.1, Apr 21 2013 



CHAPTER 36. CHAOTIC MULTISCATTERING 



699 



Figure 36.5: A 4-disk problem with three specular 
reflections, one ghost tunneling, and distinct creeping 
segments from which all associated creeping paths can 
be constructed. 




36.4 From quantum cycle to semiclassical cycle 



The procedure for the semiclassical approximation of a general periodic itinerary 
(36.20) of length n is somewhat laborious, and we will only sketch the procedure 
here. It follows, in fact, rather closely the methods developed for the semiclassical 
reduction of the determinant of the 1-disk system. 

The quantum cycle 



trA*'* 2 • • • A 



Vl 



- z - z 



A sis 2 



' A l I 



still has the structure of a "multi-trace" with respect to angular momentum. 

Each of the sums 2w°° = -oo ~~ as i n tne 1-disk case - is replaced by a Watson 

s i 

contour resummation in terms of complex angular momentum v Si . Then the paths 
below the real v J; -axes are transformed to paths above these axes, and the integrals 
split into expressions with and without an explicit Watson sin(y $i /r) denominator. 

1. In the sm(v Sj n) -independent integrals we replace all Hankel and Bessel 
functions by Debye approximations. Then we evaluate the expression in 
the saddle point approximation: either left or right specular reflection at 
disk Si or ghost tunneling through disk $ result. 

2. For the sin(v Sj .7r) -dependent integrals, we close the contour in the upper v Sj 
plane and evaluate the integral at the residua fl^ \ka s .)—0. Then we use 

the Airy approximation for J Vs {ka Si ) and Hy s '(ka Si ): left and right creeping 
paths around disk Si result. 

In the above we have assumed that no grazing geometrical paths appear. If 
they do show up, the analysis has to be extended to the case of coinciding saddles 
between the geometrical paths with 7r/2 angle reflection from the disk surface and 
paths with direct ghost tunneling through the disk. 

There are three possibilities of "semiclassical" contact of the point particle 
with the disk sf. 

1 . either geometrical which in turn splits into three alternatives 
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(a) specular reflection to the right, 

(b) specular reflection to the left, 

(c) or 'ghost tunneling' where the latter induce the nontrivial pruning 
rules (as discussed above) 

2. or right-handed creeping turns 

3. or left-handed creeping turns, 

see figure 36.5. The specular reflection to the right is linked to left-handed creep- 
ing paths with at least one knot. The specular reflection to the left matches a 
right-handed creeping paths with at least one knot, whereas the shortest left- and 
right-handed creeping paths in the ghost tunneling case are topologically trivial. 
In fact, the topology of the creeping paths encodes the choice between the three 
alternatives for the geometrical contact with the disk. This is the case for the 
simple reason that creeping sections have to be positive definite in length: the 
creeping amplitude has to decrease during the creeping process, as tangential rays 
are constantly emitted. In mathematical terms, it means that the creeping angle 
has to be positive. Thus, the positivity of the two creeping angles for the shortest 
left and right turn uniquely specifies the topology of the creeping sections which 
in turn specifies which of the three alternatives, either specular reflection to the 
right or to the left or straight "ghost" tunneling through disk j, is realized for the 
semiclassical geometrical path. Hence, the existence of a unique saddle point is 
guaranteed. 

In order to be concrete, we will restrict ourselves in the following to the scat- 
tering from N < oo non-overlapping disks fixed in the 2-dimensional plane. The 
semiclassical approximation of the periodic itinerary 

tr A*'* 2 A' V2 * 3 • • • A I,H|S " A s "' Vl 

becomes a standard periodic orbit labeled by the symbol sequence sl S2 • • • s n . De- 
pending on the geometry, the individual legs st-i — > 57 —> St+i result either from a 
standard specular reflection at disk Sj or from a ghost path passing straight through 
disk Sj. If furthermore creeping contributions are taken into account, the symbolic 
dynamics has to be generalized from single-letter symbols {3} to triple-letter sym- 
bols [si, cr\ x £{} with li > 1 integer valued and cr, < - 0, ±1 1 By definition, the 
value o-j = 0 represents the non-creeping case, such that {s,, 0 x £(} = [st, 0} = {si} 
reduces to the old single-letter symbol. The magnitude of a nonzero 4 corre- 
sponds to creeping sections of mode number |^|, whereas the sign cr,- = ±1 signals 
whether the creeping path turns around the disk s- in the positive or negative sense. 
Additional full creeping turns around a disk Sj can be summed up as a geometrical 
series; therefore they do not lead to the introduction of a further symbol. 
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Figure 36.6: (a) The ghost itinerary (1,2, 3, 4). (b) 
The parent itinerary (1,3, 4). 




36.4.1 Ghost contributions 

An itinerary with a semiclassical ghost section at, say, disk sj can be shown to 
have the same weight as the corresponding itinerary without the $th symbol. 
Thus, semiclassically, they cancel each other in the tr ln(l - A) expansion, where 
they are multiplied by the permutation factor n/r with the integer r counting the 
repeats. For example, let (1,2,3,4) be a non-repeated periodic itinerary with a 
ghost section at disk 2 steming from the 4th-order trace trA 4 . By convention, 
an underlined disk index signals a ghost passage (as in figure 36.6a), with cor- 
responding semiclassical ghost traversal matrices also underlined, jV^A—' . 
Then its semiclassical, geometrical contribution to tr ln(l - A) cancels exactly 
against the one of its "parent" itinerary (1, 3,4) (see figure 36.6b) resulting from 
the 3rd-order trace: 

-I (4A 1 ^.3 A 3 - 4 A 4 ' 1 ) - i (3 A^A^A 4 - 1 ) 
= (+1-1)A 1 ' 3 A 3 ' 4 A 4 - 1 =0. 

The prefactors -1/3 and -1/4 are due to the expansion of the logarithm, the fac- 
tors 3 and 4 inside the brackets result from the cyclic permutation of the periodic 
itineraries, and the cancellation stems from the rule 

. . . A^i+I a£±U+2 . . . = . . . (_ A m+2) . . . _ (36 36) 

The reader might study more complicated examples and convince herself that the 
rule (36.36).is sufficient to cancel any primary or repeated periodic orbit with one 
or more ghost sections completely out of the expansion of tr ln(l - A) and there- 
fore also out of the cumulant expansion in the semiclassical limit: Any periodic 
orbit of length m with n(< m) ghost sections is cancelled by the sum of all 'par- 
ent' periodic orbits of length m - i (with 1 < i < n and i ghost sections removed) 
weighted by their cyclic permutation factor and by the prefactor resulting from 
the trace-log expansion. This is the way in which the nontrivial pruning for the 
A?-disk billiards can be derived from the exact quantum mechanical expressions 
in the semiclassical limit. Note that there must exist at least one index i in any 
given periodic itinerary which corresponds to a non-ghost section, since other- 
wise the itinerary in the semiclassical limit could only be straight and therefore 
nonperiodic. Furthermore, the series in the ghost cancelation has to stop at the 
2nd-order trace, tr A 2 , as tr A itself vanishes identically in the full domain which 
is considered here. 

'Actually, these are double-letter symbols as tr, and /, are only counted as a product. 
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36.5 Heisenberg uncertainty 

Where is the boundary ka « 2 m ~ l L/a coming from? 

This boundary follows from a combination of the uncertainty principle with 
ray optics and the non-vanishing value for the topological entropy of the 3-disk 
repeller. When the wave number k is fixed, quantum mechanics can only resolve 
the classical repelling set up to the critical topological order n.The quantum wave 
packet which explores the repelling set has to disentangle T different sections 
of size d ~ a/2 n on the "visible" part of the disk surface (which is of order a) 
between any two successive disk collisions. Successive collisions are separated 
spatially by the mean flight length L, and the flux spreads with a factor L/a. In 
other words, the uncertainty principle bounds the maximal sensible truncation in 
the cycle expansion order by the highest quantum resolution attainable for a given 
wavenumber k. 



Commentary 

Remark 36.1 Sources. This chapter is based in its entirety on ref. [ I]; the reader 
is referred to the fall exposition for the proofs and discussion of details omitted here, 
sect. 36.3 is based on appendix E of ref. [ 1 ] . We follow Franz [ 1 9] in applying the Watson 
contour method [20] to (36.23). The Airy and Debye approximations to the Hankel 
functions are given in ref. [21], the Airy expansion of the 1-disk zeros can be found 
in ref. [22]. For details see refs. [19, 22, 23, 1]. That the interior domains of the disks 
are excluded relatively to the free evolution without scattering obstacles was noted in 
refs. [24, 15]. 

The procedure for the semiclassical approximation of a general periodic itinerary 
(36.20) of length n can be found in ref. [ 1 ] for the case of the A^-disk systems. The reader 
interested in the details of the semiclassical reduction is advised to consult this reference. 

The ghost orbits were inttoduced in refs. [ 12, 24]. 

Remark 36.2 Krein-Friedel-Lloyd formula. In the literature (see, e.g., refs. [14, 
15] based on ref. [II] or ref. [I]) the transition from the quantum mechanics to the 
semiclassics of scattering problems has been performed via the semiclassical limit of the 
left hand sides of the Krein-Friedel-Lloyd sum for the (integrated) spectral density [ 5, 6, 
8, 9]. See also ref. [13] for a modern discussion of the Krein-Friedel-Lloyd formula and 
refs. [1, 17] for the connection of (35.17) to the Wigner time delay. 

The order of the two limits in (35.18) and (35.17) is essential, see e.g. Balian and 
Bloch [ ] who stress that smoothed level densities should be inserted into the Friedel 
sums. 

The necessity of the +ie in the semiclassical calculation can be understood by purely 
phenomenological considerations: Without the ie term there is no reason why one should 
be able to neglect spurious periodic orbits which solely are there because of the introduc- 
tion of the confining boundary. The subtraction of the second (empty) reference system 
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helps just in the removal of those spurious periodic orbits which never encounter the scat- 
tering region. The ones that do would still survive the first limit b — > oo, if they were not 
damped out by the +;e term. 



exercise 35.1 



Remark 36.3 T, C s , D 5 and A 55 ' matrices are trace-class In refs. [ l ] it has explicitly 
been shown that the T-matrix as well as the C s , D 5 and A" -matrices of the scattering 
problem from N < oo non-overlapping finite disks are all trace-class. The corresponding 
properties for the single-disk systems is particulary easy to prove. 
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Chapter 37 



Helium atom 



"But," Bohr protested, "nobody will believe me unless I 
can explain every atom and every molecule." Rutherford 
was quick to reply, "Bohr, you explain hydrogen and you 
explain helium and everybody will believe the rest." 
—John Archibald Wheeler (1986) 

(G. Tanner) 

So far much has been said about 1 -dimensional maps, game of pinball and 
other curious but rather idealized dynamical systems. If you have become 
impatient and started wondering what good are the methods learned so far 
in solving real physical problems, we have good news for you. We will show 
in this chapter that the concepts of symbolic dynamics, unstable periodic orbits, 
and cycle expansions are essential tools to understand and calculate classical and 
quantum mechanical properties of nothing less than the helium, a dreaded three- 
body Coulomb problem. 

This sounds almost like one step too much at a time; we all know how rich and 
complicated the dynamics of the three-body problem is - can we really jump from 
three static disks directly to three charged particles moving under the influence of 
their mutually attracting or repelling forces? It turns out, we can, but we have to 
do it with care. The full problem is indeed not accessible in all its detail, but we 
are able to analyze a somewhat simpler subsystem - collinear helium. This system 
plays an important role in the classical dynamics of the full three-body problem 
and its quantum spectrum. 

The main work in reducing the quantum mechanics of helium to a semiclassi- 
cal treatment of collinear helium lies in understanding why we are allowed to do 
so. We will not worry about this too much in the beginning; after all, 80 years and 
many failed attempts separate Heisenberg, Bohr and others in the 1920ties from 
the insights we have today on the role chaos plays for helium and its quantum 
spectrum. We have introduced collinear helium and learned how to integrate its 
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Figure 37.1: Coordinates for the helium three body 
problem in the plane. 




Figure 37.2: Collinear helium, with the two electrons 
on opposite sides of the nucleus. 



He 



trajectories in sect. B.2. Here we will find periodic orbits and determine the rele- 
vant eigenvalues of the Jacobian matrix in sect. 37.1. We will explain in sect. 37.5 
why a quantization of the collinear dynamics in helium will enable us to find parts 
of the full helium spectrum; we then set up the semiclassical spectral determinant 
and evaluate its cycle expansion. A full quantum justification of this treatment of 
helium is briefly discussed in sect. 37.5.1. 



37.1 Classical dynamics of collinear helium 

Recapitulating briefly what we learned in sect. B.2: the collinear helium system 
consists of two electrons of mass m e and charge —e moving on a line with respect 
to a fixed positively charged nucleus of charge +2e, as in figure 37.2. 

The Hamiltonian can be brought to a non-dimensionalized form 

v\ P 2 7 2 2 1 
ff=-L + ^ + _^ = _i. (37.1) 

2 2 r\ Y2 r\ + ri 

The case of negative energies chosen here is the most interesting one for us. It 
exhibits chaos, unstable periodic orbits and is responsible for the bound states and 
resonances of the quantum problem treated in sect. 37.5. 

There is another classical quantity important for a semiclassical treatment of 
quantum mechanics, and which will also feature prominently in the discussion in 
the next section; this is the classical action (33.15) which scales with energy as 

S(E)=(p dq(E) ■ p(E) = S , (37.2) 

with 5 being the action obtained from (37.1) for £ = -1, and coordinates q = 
(TU^2% P - (Pi>P2)- For the Hamiltonian (37.1), the period of a cycle and its 
action are related by (33.17), T p = \S p . 
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Figure 37.3: (a) A typical trajectory in the r t - 
r 2 plane; the trajectory enters here along the r t 
axis and escapes to infinity along the r 2 axis; (b) 
Poincare map (/2=0) for collinear helium. Strong 
chaos prevails for small r\ near the nucleus. 



a) 




b) 



After a Kustaanheimo-Stiefel transformation 



n = ef, 



ri = Q\, 



Pi 



2Qi ' 



Pi 



Pi 

2Q 2 ' 



(37.3) 



and reparametrization of time by dx = dt/r\r2, the equations of motion take form 
(B.15) 



exercise 37.1 



Pi = 2fii 
Pi = 2Q 2 



p\ o( G? 

8 ^ 2 



<2 2 - ^PzGf. 



(37.4) 



Individual electron-nucleus collisions at r\ = Q 2 X = 0 or r 2 = Q\ = 0 no longer 
pose a problem to a numerical integration routine. The equations (B.15) are sin- 
gular only at the triple collision R12 = 0, i.e., when both electrons hit the nucleus 
at the same time. 

The new coordinates and the Hamiltonian (B.14) are very useful when cal- 
culating trajectories for collinear helium; they are, however, less intuitive as a 
visualization of the three-body dynamics. We will therefore refer to the old coor- 
dinates n, r2 when discussing the dynamics and the periodic orbits. 



37.2 Chaos, symbolic dynamics and periodic orbits 



Let us have a closer look at the dynamics in collinear helium. The electrons are at- 
tracted by the nucleus. During an electron-nucleus collision momentum is trans- 
ferred between the inner and outer electron. The inner electron has a maximal 
screening effect on the charge of the nucleus, diminishing the attractive force on 
the outer electron. This electron - electron interaction is negligible if the outer 
electron is far from the nucleus at a collision and the overall dynamics is regular 
like in the 1 -dimensional Kepler problem. 
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Figure 37.4: The cycle 01 1 in the fundamental domain 
T\ > r 2 (full line) and in the full domain (dashed line). 




Things change drastically if both electrons approach the nucleus nearly si- 
multaneously. The momentum transfer between the electrons depends now sen- 
sitively on how the particles approach the origin. Intuitively, these nearly missed 
triple collisions render the dynamics chaotic. A typical trajectory is plotted in fig- 
ure 37.3 (a) where we used r\ and Y2 as the relevant axis. The dynamics can also 
be visualized in a Poincare surface of section, see figure37.3 (b). We plot here the 
coordinate and momentum of the outer electron whenever the inner particle hits 
the nucleus, i.e., r\ or Y2 = 0. As the unstructured gray region of the Poincare sec- 
tion for small r\ illustrates, the dynamics is chaotic whenever the outer electron is 
close to the origin during a collision. Conversely, regular motions dominate when- 
ever the outer electron is far from the nucleus. As one of the electrons escapes for 
almost any starting condition, the system is unbounded: one electron (say electron 
1) can escape, with an arbitrary amount of kinetic energy taken by the fugative. 
The remaining electron is trapped in a Kepler ellipse with total energy in the range 
[-1, -co]. There is no energy barrier which would separate the bound from the 
unbound regions of the phase space. From general kinematic arguments one de- 
duces that the outer electron will not return when p\ > 0, Y2 < 2 at p2 = 0, the 
turning point of the inner electron. Only if the two electrons approach the nucleus 
almost symmetrically along the line y\ = ri, and pass close to the triple collision 
can the momentum transfer between the electrons be large enough to kick one of 
the particles out completely. In other words, the electron escape originates from 
the near triple collisions. 

The collinear helium dynamics has some important properties which we now 

list. 



37.2.1 Reflection symmetry 

The Hamiltonian (B.6) is invariant with respect to electron-electron exchange; 
this symmetry corresponds to the mirror symmetry of the potential along the line 
r\ = T2, figure 37.4. As a consequence, we can restrict ourselves to the dynamics 
in the. fundamental domain r\ > Y2 and treat a crossing of the diagonal y\ = Y2 as 
a hard wall reflection. The dynamics in the full domain can then be reconstructed 
by unfolding the trajectory through back-reflections. As explained in chapter21, 
the dynamics in the fundamental domain is the key to the factorization of spectral 
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determinants, to be implemented here in (37.15). Note also the similarity between 
the fundamental domain of the collinear potential figure37.4, and the fundamental 
domain figure 12.12(b) in the 3-disk system, a simpler problem with the same 
binary symbolic dynamics. 



37.2.2 Symbolic dynamics 

We have already made the claim that the triple collisions render the collinear he- 
lium fully chaotic. We have no proof of the assertion, but the analysis of the 
symbolic dynamics lends further credence to the claim. 

The potential in (37.1) forms a ridge along the line r\ — r%. One can show 
that a trajectory passing the ridge must go through at least one two-body collision 
t\ = 0 or Y2 = 0 before coming back to the diagonal r\ = ri. This suggests 
a binary symbolic dynamics corresponding to the dynamics in the fundamental 
domain r\ >r2, the symbolic dynamics is linked to the Poincare map ri = 0 and 
the symbols 0 and 1 are denned as 

0: if the trajectory is not reflected from the line r\ = r2 between two collisions 
with the nucleus Y2 = 0; 

1 : if a trajectory is reflected from the line r\ = r2 between two collisions with 
the nucleus r2 = 0. 

Empirically, the symbolic dynamics is complete for a Poincare map in the 
fundamental domain, i.e., there exists a one-to-one correspondence between bi- 
nary symbol sequences and collinear trajectories in the fundamental domain, with 
exception of the 0 cycle. 

37.2.3 Periodic orbits 

The existence of a binary symbolic dynamics makes it easy to count the num- 
ber of periodic orbits in the fundamental domain, as in sect. 15.7.2. However, 
mere existence of these cycles does not suffice to calculate semiclassical spectral 
determinants. We need to determine their phase space trajectories and calculate 
their periods, topological indices and stabilities. A restriction of the periodic orbit 
search to a suitable Poincare surface of section, e.g. = 0 or r\ = ri, leaves us 
in general with a 2-dimensional search. Methods to find periodic orbits in multi- 
dimensional spaces have been described in chapteiT3. They depend sensitively on 
good starting guesses. A systematic search for all orbits can be achieved only af- 
ter combining multi-dimensional Newton methods with interpolation algorithms 




in depth: 

sect. 21.6, p. 432 
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Figure 37.5: Some of the shortest cycles in 
collinear helium. The classical collinear electron 
motion is bounded by the potential barrier -1 = 
—1jr\—1jri + \j (r\ + ri) and the condition r, > 0. 
The orbits are shown in the full ri-r 2 domain, the 
itineraries refers to the dynamics in the r\ > r> 
fundamental domain. The last figure, the 14-cycle 
00101 1001 101 11, is an example of a typical cycle 
with no symmetry. 



1 


0111 


w , 



\ 001011 












based on the binary symbolic dynamics phase space partitioning. All cycles up to 
symbol length 16 (some 8000 prime cycles) have been computed by such meth- 
ods, with some examples shown in figure 37.5. All numerical evidence indicates 
that the dynamics of collinear helium is hyperbolic, and that all periodic orbits are 
unstable. 

Note that the fixed point 0 cycle is not in this list. The 0 cycle would corre- 
spond to the situation where the outer electron sits at rest infinitely far from the 
nucleus while the inner electron bounces back and forth into the nucleus. The 
orbit is the limiting case of an electron escaping to infinity with zero kinetic en- 
ergy. The orbit is in the regular (i.e., separable) limit of the dynamics and is thus 
marginally stable. The existence of this orbit is also related to intermittent behav- 
ior generating the quasi-regular dynamics for large r\ that we have already noted 
in figure 37.3 (b). 

Search algorithm for an arbitrary periodic orbit is quite cumbersome to pro- 
gram. There is, however, a class of periodic orbits, orbits with symmetries, which 
can be easily found by a one-parameter search. The only symmetry left for the 
dynamics in the fundamental domain is time reversal symmetry; a time reversal 
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symmetric periodic orbit is an orbit whose trajectory in phase space is mapped 
onto itself when changing {p\ ,pi)—> (—pi, -pi), by reversing the direction of the 
momentum of the orbit. Such an orbit must be a "libration" or self -retracing cy- 
cle, an orbit that runs back and forth along the same path in the (q, ri) plane. The 
cycles 1, 01 and 001 in figure 37.5 are examples of self-retracing cycles. Luckily, 
the shortest cycles that we desire most ardently have this symmetry. 

Why is this observation helpful? A self-retracing cycle must start perpen- 
dicular to the boundary of the fundamental domain, that is, on either of the axis 
r2 = 0 or r\ = r%, or on the potential boundary -~ - — + - — 1. By 
shooting off trajectories perpendicular to the boundaries and monitoring the orbits 
returning to the boundary with the right symbol length we will find time reversal 
symmetric cycles by varying the starting point on the boundary as the only pa- 
rameter. But how can we tell whether a given cycle is self-retracing or not? All 
the relevant information is contained in the itineraries; a cycle is self -retracing if 
its itinerary is invariant under time reversal symmetry (i.e., read backwards) and 
a suitable number of cyclic permutations. All binary strings up to length 5 fulfill 
this condition. The symbolic dynamics contains even more information; we can 
tell at which boundary the total reflection occurs. One finds that an orbit starts out 
perpendicular 

• to the diagonal r\ = r2 if the itinerary is time reversal invariant and has an 
odd number of l's; an example is the cycle 001 in figure 37.5; 

• to the axis r2 = 0 if the itinerary is time reversal invariant and has an even 
number of symbols; an example is the cycle 001 1 in figure 37.5; 

• to the potential boundary if the itinerary is time reversal invariant and has 
an odd number of symbols; an example is the cycle 011 in figure 37.5. 

All cycles up to symbol length 5 are time reversal invariant, the first two non-time 
reversal symmetric cycles are cycles 001011 and 001101 in figure 37.5. Their 
determination would require a two-parameter search. The two cycles are mapped 
onto each other by time reversal symmetry, i.e., they have the same trace in the 
r\-ri plane, but they trace out distinct cycles in the full phase space. 

We are ready to integrate trajectories for classical collinear helium with the 
help of the equations of motions (B. 15) and to find all cycles up to length 5. There exercjse 37 5 
is only one thing not yet in place; we need the governing equations for the matrix 
elements of the Jacobian matrix along a trajectory in order to calculate stability 
indices. We will provide the main equations in the next section, with the details 
of the derivation relegated to the appendix C.5. 



37.3 Local coordinates, Jacobian matrix 

In this section, we will derive the equations of motion for the Jacobian matrix 
along a collinear helium trajectory. The Jacobian matrix is 4-dimensional; the 
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two trivial eigenvectors corresponding to the conservation of energy and displace- 
ments along a trajectory can, however, be projected out by suitable orthogonal 
coordinates transformations, see appendix C. We will give the transformation to 
local coordinates explicitly, here for the regularized coordinates (B.13), and state 
the resulting equations of motion for the reduced [2 x 2] Jacobian matrix. 

The vector locally parallel to the trajectory is pointing in the direction of the 
phase space velocity (7.3) 



dH 

v m - x m (t) = w m/ |— = (Hp 1 ,Hp 2 , 



-H 



-Hq 2 ) t , 



with Hq { = Jj?p, and Hp i = i = 1,2. The vector perpendicular to a trajec- 
tory x(t) = (Q\(t), Qi{t),P\{t),P2{t)) and to the energy manifold is given by the 
gradient of the Hamiltonian (B.14) 



y = VH = (H Ql ,H Q2 ,H Pl ,Hp 2 y 



By symmetry v m y m - co mn |p j^- = 0, so the two vectors are orthogonal. 
Next, we consider the orthogonal matrix 



O 



(ji,72,ylR,v) 



(37.5) 



-HpJR 
HpJR 

-Hq^ /r 
HqJR 



H Qi H qJ R 

~ h Qi H Qil R 

-Hp 2 HpJR 

Hpi Hp 2 /R 



Hp, 
Hp 2 

—Hn, 



-H 



Qi 



with R - \VH\ 2 = (Hq^ + Hq 2 + Hp t + Hp J, which provides a transformation to 
local phase space coordinates centered on the trajectory x{t) along the two vectors 
(y, v). The vectors are phase space vectors perpendicular to the trajectory and 
to the energy manifold in the 4-dimensional phase space of collinear helium. The 
Jacobian matrix (4.5) rotated to the local coordinate system by O then has the 
form 



exercise 37.6 



m = 



' win m \i * 0 1 

m 2 \ m 2 2 * 0 

0 0 10 

* * * 1 



, M = O l mO 



The linearized motion perpendicular to the trajectory on the energy manifold is 
described by the [2 x 2] matrix m; the 'trivial' directions correspond to unit eigen- 
values on the diagonal in the 3rd and 4th column and row. 

The equations of motion for the reduced Jacobian matrix m are given by 



m = \(t)m(t), 



(37.6) 
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with m(0) = 1. The matrix 1 depends on the trajectory in phase space and has the 
form 



1 = 



( hi 


In 


* 


0 ) 


hi 


hi 


* 


0 


0 


0 


0 


0 




* 


* 


o J 



where the relevant matrix elements £y are given by 

hi - i;[^Hq 1 q 2 (Hq 2 Hp 1 + Hq { Hp^) (37.7) 

K 

+(H Ql H Pl - Hq 2 Hp 2 )(Hq iQi - Hq 2 q 2 - H PlPl + Hp 2 p 2 )] 
hi = -2Hq 1 q 2 (Hq 1 Hq 2 - Hp t Hp 2 ) 

+(Hq 1 + Hp 2 ){H Q2 Q 2 + H PlPl ) + (H Q2 + H Pi )(Hq iQi + Hp 2 p 2 ) 

hi = -^[2(H Ql p 2 + Hq 2 p 1 )(Hq 2 H Pi + H Ql H Pg ) 

-(H 2 p l + H p 2 )( H QiQi + #2262) _ (. H Qi + h q 2 )( h PiPi + Hp 2 p 2 )] 
hi - -hi- 

Here Hq^j, Hp^j, i, j = 1,2 are the second partial derivatives of H with respect 
to the coordinates Qj, Pi, evaluated at the phase space coordinate of the classical 
trajectory. 



37.4 Getting ready 

Now everything is in place: the regularized equations of motion can be imple- 
mented in a Runge-Kutta or any other integration scheme to calculate trajectories. 
We have a symbolic dynamics and know how many cycles there are and how to 
find them (at least up to symbol length 5). We know how to compute the Jacobian 
matrix whose eigenvalues enter the semiclassical spectral determinant (34.12). By 
(33.17) the action S p is proportional to the period of the orbit, S p = 2T p . 

There is, however, still a slight complication. Collinear helium is an invariant 
4-dimensional subspace of the full helium phase space. If we restrict the dynamics 
to angular momentum equal zero, we are left with 6 phase space coordinates. That 
is not a problem when computing periodic orbits, they are oblivious to the other di- 
mensions. However, the Jacobian matrix does pick up extra contributions. When 
we calculate the Jacobian matrix for the full problem, we must also allow for dis- 
placements out of the collinear plane, so the full Jacobian matrix for dynamics for 
L = 0 angular momentum is 6 dimensional. Fortunately, the linearized dynamics 
in and off the collinear helium subspace decouple, and the Jacobian matrix can 
be written in terms of two distinct [2 x 2] matrices, with trivial eigen-directions 
providing the remaining two dimensions. The submatrix related to displacements 
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Table 37.1: Action S p (in units of 2n), Lyapunov exponent \A p \/T p for the motion in the collinear 
plane, winding number a p for the motion perpendicular to the collinear plane, and the topological 
index m p for all fundamental domain cycles up to topological length 6. 



p 




In 1 A 1 


o- p 


m p 


1 


1.82900 


0.6012 


0.5393 


2 


01 


3.61825 


1.8622 


1.0918 


4 


001 


5.32615 


3.4287 


1.6402 


6 


on 


5.39451 


1.8603 


1.6117 


6 


0001 


6.96677 


4.4378 


2.1710 


8 


0011 


7.04134 


2.3417 


2.1327 


8 


0111 


7.25849 


3.1124 


2.1705 


8 


00001 


8.56618 


5.1100 


2.6919 


10 


00011 


8.64306 


2.7207 


2.6478 


10 


00101 


8.93700 


5.1562 


2.7291 


10 


00111 


8.94619 


4.5932 


2.7173 


10 


01011 


9.02689 


4.1765 


2.7140 


10 


01111 


9.07179 


3.3424 


2.6989 


10 


000001 


10.13872 


5.6047 


3.2073 


12 


000011 


10.21673 


3.0323 


3.1594 


12 


000101 


10.57067 


6.1393 


3.2591 


12 


000111 


10.57628 


5.6766 


3.2495 


12 


001011 


10.70698 


5.3251 


3.2519 


12 


001101 


10.70698 


5.3251 


3.2519 


12 


001111 


10.74303 


4.3317 


3.2332 


12 


010111 


10.87855 


5.0002 


3.2626 


12 


011111 


10.91015 


4.2408 


3.2467 


12 
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off the linear configuration characterizes the linearized dynamics in the additional 
degree of freedom, the ©-coordinate in figure 37.1. It turns out that the linearized 
dynamics in the 0 coordinate is stable, corresponding to a bending type motion of 
the two electrons. We will need the Floquet exponents for all degrees of freedom 
in evaluating the semiclassical spectral determinant in sect. 37.5. 

The numerical values of the actions, Floquet exponents, stability angles, and 
topological indices for the shortest cycles are listed in table37.1. These numbers, 
needed for the semiclassical quantization implemented in the next section, an also 
be helpful in checking your own calculations. 



37.5 Semiclassical quantization of collinear helium 



Before we get down to a serious calculation of the helium quantum energy levels 
let us have a brief look at the overall structure of the spectrum. This will give us 
a preliminary feel for which parts of the helium spectrum are accessible with the 
help of our collinear model - and which are not. In order to keep the discussion as 
simple as possible and to concentrate on the semiclassical aspects of our calcula- 
tions we offer here only a rough overview. For a guide to more detailed accounts 
see remark 37.4. 



37.5.1 Structure of helium spectrum 

We start by recalling Bohr's formula for the spectrum of hydrogen like one- 
electron atoms. The eigenenergies form a Rydberg series 

_ e 4 m e Z 2 

En ~-—w (37 - 8) 

where Ze is the charge of the nucleus and nig is the mass of the electron. Through 
the rest of this chapter we adopt the atomic units e = }?%, = fi = 1 . 

The simplest model for the helium spectrum is obtained by treating the two 
electrons as independent particles moving in the potential of the nucleus neglect- 
ing the electron-electron interaction. Both electrons are then bound in hydrogen 
like states; the inner electron will see a charge Z = 2, screening at the same time 
the nucleus, the outer electron will move in a Coulomb potential with effective 
charge Z - 1 = 1. In this way obtain a first estimate for the total energy 

E ^ n = ~^~^ with n>N - (37 " 9) 

This double Rydberg formula contains already most of the information we need to 
understand the basic structure of the spectrum. The (correct) ionizations thresh- 
olds En = —-hi are obtained in the limit n — » oo, yielding the ground and excited 
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states of the helium ion He + . We will therefore refer to ./V as the principal quantum 
number. We also see that all states £^,« with ;V > 2 lie above the first ionization 
threshold for N = 1 . As soon as we switch on electron-electron interaction these 
states are no longer bound states; they turn into resonant states which decay into 
a bound state of the helium ion and a free outer electron. This might not come as 
a big surprise if we have the classical analysis of the previous section in mind: we 
already found that one of the classical electrons will almost always escape after 
some finite time. More remarkable is the fact that the first, N = I series consists 
of true bound states for all n, an effect which can only be understood by quantum 
arguments. 

The hydrogen-like quantum energies (37.8) are highly degenerate; states with 
different angular momentum but the same principal quantum number ;V share the 
same energy. We recall from basic quantum mechanics of hydrogen atom that 
the possible angular momenta for a given ,/V span / = 0,1 . . .N - 1 . How does 
that affect the helium case? Total angular momentum L for the helium three-body 
problem is conserved. The collinear helium is a subspace of the classical phase 
space for L = 0; we thus expect that we can only quantize helium states corre- 
sponding to the total angular momentum zero, a subspectrum of the full helium 
spectrum. Going back to our crude estimate (37.9) we may now attribute angular 
momenta to the two independent electrons, l\ and h say. In order to obtain total 
angular momentum L - 0 we need h = h = I and l z \ = -l z 2, that is, there are 
N different states corresponding to L = 0 for fixed quantum numbers N, n. That 
means that we expect ,/V different Rydberg series converging to each ionization 
threshold Ejy = -2 IN 2 . This is indeed the case and the ,/V different series can 
be identified also in the exact helium quantum spectrum, see figure 37.6. The 
degeneracies between the different N Rydberg series corresponding to the same 
principal quantum number N, are removed by the electron-electron interaction. 
We thus already have a rather good idea of the coarse structure of the spectrum. 

In the next step, we may even speculate which parts of the L = 0 spectrum 

can be reproduced by the semiclassical quantization of collinear helium. In the 

collinear helium, both classical electrons move back and forth along a common 

axis through the nucleus, so each has zero angular momentum. We therefore 

expect that collinear helium describes the Rydberg series with I - l\ - 1% — 0. 

These series are the energetically lowest states for fixed (N, n), corresponding to 

the Rydberg series on the outermost left side of the spectrum in figure 37.6. We 

will see in the next section that this is indeed the case and that the collinear model 

holds down to the N - \ bound state series, including even the ground state 

of helium! We will also find a semiclassical quantum number corresponding to 

the angular momentum / and show that the collinear model describes states for 

moderate angular momentum / as lone as / <sc N. . 

6 6 remark 37.4 

37.5.2 Semiclassical spectral determinant for collinear helium 

Nothing but lassitude can stop us now from calculating our first semiclassical 
eigenvalues. The only thing left to do is to set up the spectral determinant in terms 
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Figure 37.6: The exact quantum helium spectrum .3 
for L = 0. The energy levels denoted by bars 
have been obtained from full 3-dimensional quan- 
tum calculations [3], 
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of the periodic orbits of collinear helium and to write out the first few terms of its 
cycle expansion with the help of the binary symbolic dynamics. The semiclassic- 
al spectral determinant (34.12) has been written as product over all cycles of the 
classical systems. The energy dependence in collinear helium enters the classical 
dynamics only through simple scaling transformations described in sect. B. 2.1 
which makes it possible to write the semiclassical spectral determinant in the form 



det (H-E) sc = exp 



2 2 r 1 



Jr(sS p -m p j) 



7 f-( r (-det (1 - M; ± )) 1 / 2 |det (1 - 



,(37.10) 



with the energy dependence absorbed into the variable 




obtained by using the scaling relation (37.2) for the action. As explained in 
sect. 37.3, the fact that the [4 x 4] Jacobian matrix decouples into two [2 x 2] 



helium - 27dec2004 



ChaosBook.org version!4.4.1, Apr 21 2013 



CHAPTER 37. HELIUM ATOM 



717 



submatrices corresponding to the dynamics in the collinear space and perpendic- 
ular to it makes it possible to write the denominator in terms of a product of two 
determinants. Stable and unstable degrees of freedom enter the trace formula in 
different ways, reflected by the absence of the modulus sign and the minus sign 
in front of det (1 - M±). The topological index m p corresponds to the unstable 
dynamics in the collinear plane. Note that the factor i nN ^ present in (34.12) 
is absent in (37.10). Collinear helium is an open system, i.e., the eigenenergies 
are resonances corresponding to the complex zeros of the semiclassical spectral 
determinant and the mean energy staircase N(E) not denned. In order to obtain 
a spectral determinant as an infinite product of the form (34.18) we may proceed 
as in (19.9) by expanding the determinants in (37.10) in terms of the eigenvalues 
of the corresponding Jacobian matrices. The matrix representing displacements 
perpendicular to the collinear space has eigenvalues of the form exp(±27r/cr), re- 
flecting stable linearized dynamics, cr is the full winding number along the orbit 
in the stable degree of freedom, multiplicative under multiple repetitions of this 
orbit .The eigenvalues corresponding to the unstable dynamics along the collinear 
axis are paired as {A, 1 /A} with |A| > 1 and real. As in (19.9) and (34. 18) we may 
thus write 

[-det(l -M;)|det(l -Mp|]~ 1/2 (37.11) 
- [-(1 - A r )(l - A~ r )l(l - e W )(l - e- 2nircr )\ m 

OO 

The ± sign corresponds to the hyperbolic/inverse hyperbolic periodic orbits with 
positive/negative eigenvalues A. Using the relation (37.12) we see that the sum 
over r in (37.10) is the expansion of the logarithm, so the semiclassical spectral 
determinant can be rewritten as a product over dynamical zeta functions, as in 
(19.9): 



k=Q m=0 k=Q m=0 p 



where the cycle weights are given by 



(k,m) _ 1 i(sS p -m p Z-4ntf+l/2)o- p ) m ,ix 

p "lApA* ' { ' 



and m p is the topological index for the motion in the collinear plane which equals 
twice the topological length of the cycle. The two independent directions perpen- 
dicular to the collinear axis lead to a twofold degeneracy in this degree of freedom 
which accounts for an additional factor 2 in front of the winding number cr. The 
values for the actions, winding numbers and stability indices of the shortest cycles 
in collinear helium are listed in table 37. 1. 
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The integer indices i and k play very different roles in the semiclassical spec- 
tral determinant (37. 12). A linearized approximation of the flow along a cycle cor- 
responds to a harmonic approximation of the potential in the vicinity of the trajec- 
tory. Stable motion corresponds to a harmonic oscillator potential, unstable mo- 
tion to an inverted harmonic oscillator. The index t which contributes as a phase 
to the cycle weights in the dynamical zeta functions can therefore be interpreted 
as a harmonic oscillator quantum number; it corresponds to vibrational modes in 
the 0 coordinate and can in our simplified picture developed in sect. 37.5.1 be 
related to the quantum number I - h = h representing the single particle angular 
momenta. Every distinct I value corresponds to a full spectrum which we obtain 
from the zeros of the semiclassical spectral determinant 1 keeping t fixed. The 
harmonic oscillator approximation will eventually break down with increasing 
off-line excitations and thus increasing £. The index k corresponds to 'excitations' 
along the unstable direction and can be identified with local resonances of the in- 
verted harmonic oscillator centered on the given orbit. The cycle contributions 
tf'"^ decrease exponentially with increasing k. Higher k terms in an expansion of 
the determinant give corrections which become important only for large negative 
imaginary s values. As we are interested only in the leading zeros of (37.12), i.e., 
the zeros closest to the real energy axis, it is sufficient to take only the k = 0 terms 
into account. 

Next, let us have a look at the discrete symmetries discussed in sect. 37.2. 
Collinear helium has a C2 symmetry as it is invariant under reflection across 
the r\ = ri line corresponding to the electron-electron exchange symmetry. As 
explained in sects. 21.1.1 and 21.5, we may use this symmetry to factorize the 
semiclassical spectral determinant. The spectrum corresponding to the states sym- 
metric or antisymmetric with respect to reflection can be obtained by writing the 
dynamical zeta functions in the symmetry factorized form 

l/£ {0 = Y\(l-t a ) 2 Y](l-t 2 s ). (37.14) 



Here, the first product is taken over all asymmetric prime cycles, i.e., cycles that 
are not self-dual under the C2 symmetry. Such cycles come in pairs, as two equiv- 
alent orbits are mapped into each other by the symmetry transformation. The sec- 
ond product runs over all self-dual cycles; these orbits cross the axis n = r2 twice 
at a right angle. The self-dual cycles close in the fundamental domain r\ < r2 
already at half the period compared to the orbit in the full domain, and the cy- 
cle weights ts in (37.14) are the weights of fundamental domain cycles. The C2 
symmetry now leads to the factorization of (37.14) l/£ = £~ X £Z X , with 

= ~[(i-t a )]yi-ts), 

a s 

= n (1 ~ f " ) il (1 + fj) ' (37 - 15) 



setting k = 0 in what follows. The symmetric subspace resonances are given 
by the zeros of l/<£ \ antisymmetric resonances by the zeros of l/£ , with the 
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two dynamical zeta functions defined as products over orbits in the fundamental 
domain. The symmetry properties of an orbit can be read off directly from its 
symbol sequence, as explained in sect. 37.2. An orbit with an odd number of l's 
in the itinerary is self-dual under the C2 symmetry and enters the spectral deter- 
minant in (37.15) with a negative or a positive sign, depending on the symmetry 
subspace under consideration. 



37.5.3 Cycle expansion results 



So far we have established a factorized form of the semiclassical spectral det- 
erminant and have thereby picked up two good quantum numbers; the quantum 
number m has been identified with an excitation of the bending vibrations, the 
exchange symmetry quantum number ±1 corresponds to states being symmetric 
or antisymmetric with respect to the electron-electron exchange. We may now 
start writing down the binary cycle expansion (20.7) and determine the zeros of 
spectral determinant. There is, however, still another problem: there is no cycle 0 
in the collinear helium. The symbol sequence 0 corresponds to the limit of an outer 
electron fixed with zero kinetic energy at r\ =00, the inner electron bouncing back 
and forth into the singularity at the origin. This introduces intermittency in our 
system, a problem discussed in chapter 24. We note that the behavior of cycles 
going far out in the channel r\ or ri — > 00 is very different from those staying in the 
near core region. A cycle expansion using the binary alphabet reproduces states 
where both electrons are localized in the near core regions: these are the lowest 
states in each Rydberg series. The states converging to the various ionization 
thresholds - -2/N 2 correspond to eigenfunctions where the wave function 
of the outer electron is stretched far out into the ionization channel T^,r% — > 00. 
To include those states, we have to deal with the dynamics in the limit of large 
r\, Y2- This turns out to be equivalent to switching to a symbolic dynamics with an 
infinite alphabet. With this observation in mind, we may write the cycle expansion 
(....) for a binary alphabet without the 0 cycle as 



i/rc*)= 1 



f (f) _ AO 
'01 



'1 



\t iC) 

L '001 



+ t, 



Lt 0001 



+ i 



it) 
0011 



'OOl'l 



(0 
Oil 

(0 



+ i 



At) At), 
<()!<! J 

if) 



0111 



'on 



(37.16) 



The weights t p are given in (37.12), with contributions of orbits and composite 
orbits of the same total symbol length collected within square brackets. The cycle 
expansion depends only on the classical actions, stability indices and winding 
numbers, given for orbits up to length 6 in table 37.1. To get reacquainted with 
the cycle expansion formula (37.16), consider a truncation of the series after the 
first term 
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Table 37.2: Collinear helium, real part of the symmetric subspace resonances obtained by a cycle 
expansion (37.16) up to cycle length j. The exact quantum energies [3] are in the last column. The 
states are labeled by their principal quantum numbers. A dash as an entry indicates a missing zero 
at that level of approximation. 



N 


n 


;' - 1 
J 1 


;' - 4 
J ^ 


;' - 8 

J ° 


;'-12 


7=16 

j - lu 


-E 


1 


i 


3.0970 


2.9692 


2.9001 


2.9390 


2.9248 


2.9037 


2 


2 


0.8044 


0.7714 


0.7744 


0.7730 


0.7727 


0.7779 


2 


3 




0.5698 


0.5906 


0.5916 


0.5902 


0.5899 


2 


4 








0.5383 


0.5429 


0.5449 


3 


3 


0.3622 


0.3472 


0.3543 


0.3535 


0.3503 


0.3535 


3 


4 






0.2812 


0.2808 


0.2808 


0.2811 


3 


5 






0.2550 


0.2561 


0.2559 


0.2560 


3 


6 








0.2416 


0.2433 


0.2438 


4 


4 


0.2050 


0.1962 


0.1980 


0.2004 


0.2012 


0.2010 


4 


5 




0.1655 


0.1650 


0.1654 


0.1657 


0.1657 


4 


6 






0.1508 


0.1505 


0.1507 


0.1508 


4 


7 






0.1413 


0.1426 


0.1426 


0.1426 



The quantization condition ll£ c \s) = 0 leads to 
(Si/2n) 2 



-m,N 



[m+ { + 2(N + 5)cri] 2 ' 



m, N = 0,1,2,... , (37.17) 



with 5 1 /2n = 1.8290 for the action and cry - 0.5393 for the winding number, see 
table 37.1, the 1 cycle in the fundamental domain. This cycle can be described as 
the asymmetric stretch orbit, see figure 37.5. The additional quantum number N in 
(37.17) corresponds to the principal quantum number defined in sect. 37.5.1. The 
states described by the quantization condition 07.17) are those centered closest to 
the nucleus and correspond therefore to the lowest states in each Rydberg series 
(for a fixed m and ;V values), in figure 37.6. The simple formula (37.17) gives 
already a rather good estimate for the ground state of helium! Results obtained 
from (37.17) are tabulated in table 37.2, see the 3rd column under j — 1 and the 
comparison with the full quantum calculations. 

In order to obtain higher excited quantum states, we need to include more 
orbits in the cycle expansion (37.16), covering more of the phase space dynamics 
further away from the center. Taking longer and longer cycles into account, we 
indeed reveal more and more states in each /V-series for fixed m. This is illustrated 
by the data listed in table 37.2 for symmetric states obtained from truncations of 
the cycle expansion of 

Results of the same quality are obtained for antisymmetric states by calculat- 
ing the zeros of l/£ . Repeating the calculation with I = 1 or higher in (37.15) 
reveals states in the Rydberg series which are to the right of the energetically low- 
est series in figure 37.6. 



exercise 37.7 
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Resume 

We have covered a lot of ground starting with considerations of the classical prop- 
erties of a three-body Coulomb problem, and ending with the semiclassical he- 
lium spectrum. We saw that the three-body problem restricted to the dynamics 
on a collinear appears to be fully chaotic; this implies that traditional semiclassi- 
cal methods such as WXBquantization will not work and that we needed the full 
periodic orbit theory to obtain leads to the semiclassical spectrum of helium. As 
a piece of unexpected luck the symbolic dynamics is simple, and the semiclassi- 
cal quantization of the collinear dynamics yields an important part of the helium 
spectrum, including the ground state, to a reasonable accuracy. A sceptic might 
say: "Why bother with all the semiclassical considerations? A straightforward nu- 
merical quantum calculation achieves the same goal with better precision." While 
this is true, the semiclassical analysis offers new insights into the structure of the 
spectrum. We discovered that the dynamics perpendicular to the collinear plane 
was stable, giving rise to an additional (approximate) quantum number I. We thus 
understood the origin of the different Rydberg series depicted in figure37.6, a fact 
which is not at all obvious from a numerical solution of the quantum problem. 

Having traversed the long road from the classical game of pinball all the way 
to a credible helium spectrum computation, we could declare victory and fold 
down this enterprise. Nevertheless, there is still much to think about - what about 
such quintessentially quantum effects as diffraction, tunnelling, ...? As we shall 
now see, the periodic orbit theory has still much of interest to offer. 



Commentary 

Remark 37.1 Sources. The full 3-dimensional Hamiltonian after elimination of the 
center of mass coordinates, and an account of the finite nucleus mass effects is given in 
ref. [2]. The general two-body collision regularizing Kustaanheimo-Stiefel transforma- 
tion [5], a generalization of Levi-Civita's [13] Pauli matrix two-body collision regular- 
ization for motion in a plane, is due to Kustaanheimo [ ] who realized that the correct 
higher-dimensional generalization of the "square root removal" trick ( B.l 1), by introduc- 
ing a vector Q with property r — \ Q\ 2 , is the same as Dirac's trick of getting linear equation 
for spin 1/2 fermions by means of spinors. Vector spaces equipped with a product and a 
known satisfy \Q ■ Q\ — \Q\ 2 define normed algebras. They appear in various physical ap- 
plications - as quaternions, octonions, spinors. The technique was originally developed in 
celestial mechanics [ ] to obtain numerically stable solutions for planetary motions. The 
basic idea was in place as early as 1931, when H. Hopf [ 14] used a KS transformation in 
order to illustrate a Hopf 's invariant. The KS transformation for the collinear helium was 
introduced in ref. [ 2] . 

Remark 37.2 Complete binary symbolic dynamics. No stable periodic orbit and no 
exception to the binary symbolic dynamics of the collinear helium cycles have been found 
in numerical investigations. A proof that all cycles are unstable, that they are uniquely 
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labeled by the binary symbolic dynamcis, and that this dynamics is complete is, however, 
still missing. The conjectured Markov partition of the phase space is given by the triple 
collision manifold, i.e., by those trajectories which start in or end at the singular point 
r\ — r-i — 0. See alsoref. [2]. 

Remark 37.3 Spin and particle exchange symmetry. In our presentation of collinear 
helium we have completely ignored all dynamical effects due to the spin of the particles 
involved, such as the electronic spin-orbit coupling. Electrons are fermions and that deter- 
mines the symmetry properties of the quantum states. The total wave function, including 
the spin degrees of freedom, must be antisymmetric under the electron-electron exchange 
transformation. That means that a quantum state symmetric in the position variables must 
have an antisymmetric spin wave function, i.e., the spins are antiparallel and the total spin 
is zero (singletstate). Antisymmetric states have symmetric spin wave function with total 
spin 1 (tripletstates). The threefold degeneracy of spin 1 states is lifted by the spin-orbit 
coupling. 

Remark 37.4 Helium quantum numbers. The classification of the helium states in 
terms of single electron quantum numbers, sketched in sect. 37.5.1, prevailed until the 
1960's; a growing discrepancy between experimental results and theoretical predictions 
made it necessary to refine this picture. In particular, the different Rydberg series sharing 
a given A^-quantum number correspond, roughly speaking, to a quantization of the inter 
electronic angle 0, see figure 37.1, and can not be described in terms of single electron 
quantum numbers The fact that something is slightly wrong with the single electron 
picture laid out in sect. 37.5.1 is highlighted when considering the collinear configuration 
where both electrons are on the same side of the nucleus. As both electrons again have 
angular momentum equal to zero, the corresponding quantum states should also belong 
to single electron quantum numbers (Z i, fc) = (0, 0). However, the single electron picture 
breaks down completely in the limit 0 = 0 where electron-electron interaction becomes 
the dominant effect. The quantum states corresponding to this classical configuration are 
distinctively different from those obtained from the collinear dynamics with electrons on 
different sides of the nucleus. The Rydberg series related to the classical 0 = 0 dynamics 
are on the outermost rigth side in each N subspectrum in figure 37.6, and contain the 
energetically highest states for given N,n quantum numbers, see also remark 37.5. A 
detailed account of the historical development as well as a modern interpretation of the 
spectrum can be found in ref. [1]. 

Remark 37.5 Beyond the unstable collinear helium subspace. The semiclassical 
quantization of the chaotic collinear helium subspace is discussed inrefs. [7, 8, 9]. Classi- 
cal and semiclassical considerations beyond what has been discussed in sect. 37.5 follow 
several other directions, all outside the main of this book. 

A classical study of the dynamics of collinear helium where both electrons are on the 
same side of the nucleus reveals that this configuration is fully stable both in the collinear 
plane and perpendicular to it. The corresponding quantum states can be obtained with 
the help of an approximate EBK-quantization which reveals helium resonances with ex- 
tremely long lifetimes (quasi - bound states in the continuum). These states form the 
energetically highest Rydberg series for a given principal quantum number N, see fig- 
ure 37.6. Details can be found in refs. [ 10, 1 1]. 
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In order to obtain the Rydberg series structure of the spectrum, i.e., the succession 
of states converging to various ionization thresholds, we need to take into account the 
dynamics of orbits which make large excursions along the r\ or rj_ axis. In the chaotic 
collinear subspace these orbits are characterized by symbol sequences of form (aO ") where 
a stands for an arbitrary binary symbol sequence and 0 " is a succession of n O's in a row. 
A summation of the form Yin=o f <>0"> where t p are the cycle weights in (37.12), and cycle 
expansion of indeed yield all Rydberg states up the various ionization thresholds, see 
ref. [4]. For a comprehensive overview on spectra of two-electron atoms and semiclassical 
treatments ref. [1]. 



Exercises 

37.1. Kustaanheimo-Stiefel transformation. Check the 
Kustaanheimo-Stiefel regularization for collinear he- 
lium; derive the Hamiltonian (B.14) and the collinear 
helium equations of motion (B.15). 

37.2. Helium in the plane. Starting with the helium 
Hamiltonian in the infinite nucleus mass approximation 
mi, e = oo, and angular momentum L — 0, show that the 
three body problem can be written in terms of three inde- 
pendent coordinates only, the electron-nucleus distances 
r\ and r2 and the inter-electron angle ©, see figure B.l. 

37.3. Helium trajectories. Do some trial integrations of the 
collinear helium equations of motion (B. 15). Due to the 
energy conservation, only three of the phase space coor- 
dinates (Q\,Qi,P\,Pi) are independent. Alternatively, 
you can integrate in 4 dimensions and use the energy 
conservation as a check on the quality of your integra- 
tor. 

The dynamics can be visualized as a motion in the orig- 
inal configuration space (r\,ri), r-, > 0 quadrant, or, 
better still, by an appropriately chosen 2-dimensional 
Poincare section, exercise 37.4. Most trajectories will 
run away, do not be surprised - the classical collinear he- 
lium is unbound. Try to guess approximately the short- 
est cycle of figure 37.4. 

37.4. A Poincare section for collinear Helium. Construct 
a Poincare section of figure 37.3b that reduces the he- 
lium flow to a map. Try to delineate regions which cor- 
respond to finite symbol sequences, i.e. initial condi- 
tions that follow the same topological itinerary in fig- 
ure 37.3a space for a finite number of bounces. Such 
rough partition can be used to initiate 2-dimensional 
Newton-Raphson method searches for helium cycles, 
exercise 37.5. 



37.5. Collinear helium cycles. The motion in the (n, rj) 
plane is topologically similar to the pinball motion in a 
3-disk system, except that the motion is in the Coulomb 
potential. 

Just as in the 3-disk system the dynamics is simplified 
if viewed in the fundamental domain, in this case the 
region between r\ axis and the r\ = ri diagonal. Mod- 
ify your integration routine so the trajectory bounces off 
the diagonal as off a mirror. Miraculously, the symbolic 
dynamics for the survivors again turns out to be binary, 
with 0 symbol signifying a bounce off the r \ axis, and 
1 symbol for a bounce off the diagonal. Just as in the 
3-disk game of pinball, we thus know what cycles need 
to be computed for the cycle expansion (37.16). 

Guess some short cycles by requiring that topologically 
they correspond to sequences of bounces either return- 
ing to the same r, axis or reflecting off the diagonal. 
Now either Use special symmetries of orbits such as 
self-re tracing to find all orbits up to length 5 by a 1- 
dimensional Newton search. 

37.6. Collinear helium cycle stabilities. Compute the 
eigenvalues for the cycles you found in exercise 37.5, as 
described in sect. 37.3. You may either integrate the re- 
duced 2x2 matrix using equations (37.6) together with 
the generating function 1 given in local coordinates by 
(37.7) or integrate the full 4x4 Jacobian matrix, see 
sect. L.l. Integration in 4 dimensions should give 
eigenvalues of the form (1, 1, A p , l/A p ); The unit eigen- 
values are due to the usual periodic orbit invariances; 
displacements along the orbit as well as perpendicular 
to the energy manifold are conserved; the latter one 
provides a check of the accuracy of your computation. 
Compare with table 37.1; you should get the actions and 



exerHelium - 16apt2002 



ChaosBook.org versionl4.4. 1, Apr21 2013 



REFERENCES 



724 



Lyapunov exponents right, but topological indices and 
stability angles we take on faith. 

37.7. Helium eigenenergies. Compute the lowest eigenen- 
ergies of singlet and triplet states of helium by substi- 
tuting cycle data into the cycle expansion (37.16) for 
the symmetric and antisymmetric zeta functions (37.15). 
Probably the quickest way is to plot the magnitude of the 



zeta function as function of real energy and look for the 
minima. As the eigenenergies in general have a small 
imaginary part, a contour plot such as figure 20.1, can 
yield informed guesses. Better way would be to find the 
zeros by Newton method, sect. 20.3.1. How close are 
you to the cycle expansion and quantum results listed in 
table 37.2? You can find more quantum data in ref. [3]. 
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Chapter 38 



Diffraction distraction 



(N. Whelan) 



Diffraction effects characteristic to scattering off wedges are incorporated 
into the periodic orbit theory. 



38.1 Quantum eavesdropping 



As noted in chapter 37, the classical mechanics of the helium atom is undefined 
at the instant of a triple collision. This is a common phenomenon - there is often 
some singularity or discontinuity in the classical mechanics of physical systems. 
This discontinuity can even be helpful in classifying the dynamics. The points in 
phase space which have a past or future at the discontinuity form manifolds which 
divide the phase space and provide the symbolic dynamics. The general rule is that 
quantum mechanics smoothes over these discontinuities in a process we interpret 
as diffraction. We solve the local diffraction problem quantum mechanically and 
then incorporate this into our global solution. By doing so, we reconfirm the 
central leitmotif of this treatise: think locally - act globally. 

While being a well-motivated physical example, the helium atom is somewhat 
involved. In fact, so involved that we do not have a clue how to do it. In its 
place we illustrate the concept of diffractive effects with a pinball game. There 
are various classes of discontinuities which a billiard can have. There may be a 
grazing condition such that some trajectories hit a smooth surface while others 
are unaffected - this leads to the creeping described in chapter 35. There may be a 
vertex such that trajectories to one side bounce differently from those to the other 
side. There may be a point scatterer or a magnetic flux line such that we do not 
know how to continue classical mechanics through the discontinuities. In what 
follows, we specialize the discussion to the second example - that of vertices or 
wedges. To further simplify the discussion, we consider the special case of a half 
line which can be thought of as a wedge of angle zero. 
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a. 




Figure 38.1: Scattering of a plane wave off a half line. 




We start by solving the problem of the scattering of a plane wave off a half 
line (see figure 38.1). This is the local problem whose solution we will use to 
construct a global solution of more complicated geometries. We define the vertex 
to be the origin and launch a plane wave at it from an angle a. What is the total 
field? This is a problem solved by Sommerfeld in 1 896 and our discussion closely 
follows his. 

The total field consists of three parts - the incident field, the reflected field 
and the diffractive field. Ignoring the third of these for the moment, we see that 
the space is divided into three regions. In region I there is both an incident and a 
reflected wave. In region II there is only an incident field. In region III there is 
nothing so we call this the shadowed region. However, because of diffraction the 
field does enter this region. This accounts for why you can overhear a conversation 
if you are on the opposite side of a thick wall but with a door a few meters away. 
Traditionally such effects have been ignored in semiclassical calculations because 
they are relatively weak. However, they can be significant. 

To solve this problem Sommerfeld worked by analogy with the full line case, 
so let us briefly consider that much simpler problem. There we know that the 
problem can be solved by images. An incident wave of amplitude A is of the form 



where iff - <p - a and cf> is the angular coordinate. The total field is then given by 
the method of images as 



where the negative sign ensures that the boundary condition of zero field on the 
line is satisfied. 

Sommerfeld then argued that v(r, iff) can also be given a complex integral rep- 
resentation 



v(r, iff) 



Ae 



(38.1) 



v tot = v ( r > <P~ a)~ v(r, (f> + a), 



(38.2) 




(38.3) 
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Figure 38.2: The contour in the complex /? plane. 
The pole is at f} = -if/ (marked by x in the figure) 
and the integrand approaches zero in the shaded 
regions as the magnitude of the imaginary part of 
P approaches infinity. 




This is certainly correct if the function fifi, \p) has a pole of residue 1/2/n at yS = 
-if/ and if the contour C encloses that pole. One choice is 

s^'kw^i- <38 - 4) 

(We choose the pole to be at/3 = -ijj rather than f5 = if/ for reasons discussed later.) 
One valid choice for the contour is shown in figure 38.2. This encloses the pole 
and vanishes as |Im/3| — > oo (as denoted by the shading). The sections A and D2 
are congruent because they are displaced by 2n. However, they are traversed in 
an opposite sense and cancel, so our contour consists of just the sections Q and 
C2. The motivation for expressing the solution in this complicated manner should 
become clear soon. 

What have we done? We extended the space under consideration by a factor 
of two and then constructed a solution by assuming that there is also a source in 
the unphysical space. We superimpose the solutions from the two sources and at 
the end only consider the solution in the physical space to be meaningful. Fur- 
thermore, we expressed the solution as a contour integral which reflects the 2n 
periodicity of the problem. The half line scattering problem follows by analogy. 

Whereas for the full line the field is periodic in 2k, for the half line it is peri- 
odic in An. This can be seen by the fact that the field can be expanded in a series 
of the form {sin(0/2), sin(0), sin(30/2), • • •}. As above, we extend the space by 
thinking of it as two sheeted. The physical sheet is as shown in figure38.1 and the 
unphysical sheet is congruent to it. The sheets are glued together along the half 
line so that a curve in the physical space which intersects the half line is continued 
in the unphysical space and vice-versa. The boundary conditions are that the total 
field is zero on both faces of the half line (which are physically distinct boundary 
conditions) and that as r — > 00 the field is composed solely of plane waves and 
outgoing circular waves of the form g((p)exp(ikr)/^Jkr. This last condition is a 
result of Huygens' principle. 
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We assume that the complete solution is also given by the method of images 

as 

v tot = M ( r > (/> - o) - u(r, <f> + a). (38.5) 

where u(r, ip) is a 47r-periodic function to be determined. The second term is 
interpreted as an incident field from the unphysical space and the negative sign 
guarantees that the solution vanishes on both faces of the half line. Sommerfeld 
then made the ansatz that u is as given in equation (38.3) with the same contour 
C\ + C2 but with the An periodicity accounted for by replacing equation (38.4) 
with 

1 e^/ 2 

(We divide by 4n rather than 2n so that the residue is properly normalized.) The 
integral (38.3) can be thought of as a linear superposition of an infinity of plane 
waves each of which satisfies the Helmholtz equation (V 2 + k 2 )v = 0, and so their 
combination also satisfies the Helmholtz equation. We will see that the diffracted 
field is an outgoing circular wave; this being a result of choosing the pole at (3 = 
-if/ rather than [3 = if/ in equation (38.4). Therefore, this ansatz is a solution of 
the equation and satisfies all boundary conditions and therefore constitutes a valid 
solution. By uniqueness this is the only solution. 

In order to further understand this solution, it is useful to massage the contour. 
Depending on <p there may or may not be a pole between /? = —n and fi = n. In 
region I, both functions u(r, (p ± a) have poles which correspond to the incident 
and reflected waves. In region II, only u(r, <f> - a) has a pole corresponding to the 
incident wave. In region III there are no poles because of the shadow. Once we 
have accounted for the geometrical waves (i.e., the poles), we extract the diffracted 
waves by saddle point analysis at /3 = ±n. We do this by deforming the contours 
C so that they go through the saddles as shown in figure 38.2. 

Contour C\ becomes E2 + F while contour C2 becomes E\ — F where the 
minus sign indicates that it is traversed in a negative sense. As a result, F has no 
net contribution and the contour consists of just E\ and Ei. 

As a result of these machinations, the curves E are simply the curves D of 
figure 38.2 but with a reversed sense. Since the integrand is no longer 2n periodic, 
the contributions from these curves no longer cancel. We evaluate both stationary 
phase integrals to obtain 

e in/4 e ikr 

u(r, if/) a -A—— sec(if/f2)—— (38.7) 
V^r ikr 

so that the total diffracted field is 

e inlA I (<p-a\ l<h + a\\e ikr ^„ „ 
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-271: 



Figure 38.3: The contour used to evaluate the 
diffractive field after the contribution of possible 
poles has been explicitly evaluated. The curve F 
is traversed twice in opposite directions and has no 
net contribution. 




Jt 2jt S 



Note that this expression breaks down when cf> + a - n. These angles correspond 
to the borders among the three regions of figure 38.1 and must be handled more 
carefully - we can not do a stationary phase integral in the vicinity of a pole. 
However, the integral representation (38.3) and (38.6) is uniformly valid. 

©xsrcis© oo.i 

We now turn to the simple task of translating this result into the language of 
semiclassical Green's functions. Instead of an incident plane wave, we assume a 
source at point xf and then compute the resulting field at the receiver position x. 
If x is in region I, there is both a direct term, and a reflected term, if x is in region 
II there is only a direct term and if x is in region III there is neither. In any event 
these contributions to the semiclassical Green's function are known since the free 
space Green's function between two points X2 and x\ is 

Gdx 2 , x u k) = - l -H ( *\kd) * — — exp{i(kd + tt/4)}, (38.9) 

4 0 V^rW 

where d is the distance between the points. For a reflection, we need to multiply 
by -1 and the distance is the length of the path via the reflection point. Most 
interesting for us, there is also a diffractive contribution to the Green's function. 
In equation (38.8), we recognize that the coefficient A is simply the intensity at the 
origin if there were no scatterer. This is therefore replaced by the Green's function 
to go from the source to the vertex which we label xy. Furthermore, we recognize 
that exp(ikr)/ yfkr is, within a proportionality constant, the semiclassical Green's 
function to go from the vertex to the receiver. 

Collecting these facts, we say 

^diff( x ' x '' ^) = ^fC*' x v> k)d(9, 9')Gf(xv, x , k), (38.10) 
where, by comparison with equations (38.8) and (38.9), we have 

(6-Q'\ (9 + 9'\ 
d(6,e') = sec — -sec — . (38.11) 
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Here 9' is the angle to the source as measured from the vertex and 9 is the angle to 
the receiver. They were denoted as a and <p previously. Note that there is a sym- 
metry between the source and receiver as we expect for a time-reversal invariant 
process. Also the diffraction coefficient d does not depend on which face of the 
half line we use to measure the angles. As we will see, a very important property 
of G^jff is that it is a simple multiplicative combination of other semiclassical 
Green's functions. 

We now recover our classical perspective by realizing that we can still think of 
classical trajectories. In calculating the quantum Green's function, we sum over 
the contributions of various paths. These include the classical trajectories which 
connect the points and also paths which connect the points via the vertex. These 
have different weights as given by equations (38.9) and (38.10) but the concept of 
summing over classical paths is preserved. 

For completeness, we remark that there is an exact integral representation for 
the Green's function in the presence of a wedge of arbitrary opening angle [15]. 
It can be written as 



G(x, x, k) = g(r, r , k, 9' - 9) - g(r, r',k, 9' + 9) (38.12) 

where (r, 9) and (r 1 ,9') are the polar coordinates of the points x and i as measured 
from the vertex and the angles are measured from either face of the wedge. The 
function g is given by 



i r HUk Jr 2 + r' 2 -2rr'cosB) 
Sir, r ',k,t) = — — 



)c [+ c 2 i_exp(/^) 



exercise 38.2 



where v = y/n and y is the opening angle of the wedge, (ie y = 2n in the case of 
the half plane). The contour C\ + C2 is the same as shown in figure 38.2. 

The poles of this integral give contributions which can be identified with the 
geometric paths connecting x and xf. The saddle points at = ±/r give contribu- 
tions which can be identified with the diffractive path connecting x and i. The 
saddle point analysis allows us to identify the diffraction constant as 



4 sin - sin - sin — 

d(9,9') = y —. v — -, (38.14) 

v (cos a - cos 2f-\ (cos 2 - cos 2=2- J 



which reduces to (38.1 1) when v -2. Note that the diffraction coefficient vanishes 
identically if v = 1 fn where n is any integer. This corresponds to wedge angles 
of y = n /n (eg. n=l corresponds to a full line and n=2 corresponds to a right 
angle). This demonstration is limited by the fact that it came from a leading 
order asymptotic expansion but the result is quite general. For such wedge angles, 
we can use the method of images (we will require 2n - 1 images in addition to 
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Figure 38.4: The billiard considered here. The dy- 
namics consists of free motion followed by specular 
reflections off the faces. The top vertex induces diffrac- 
tion while the bottom one is a right angle and induces 




two specular geometric reflections. 



the actual source point) to obtain the Green's function and there is no diffractive 
contribution to any order. Classically this corresponds to the fact that for such 
angles, there is no discontinuity in the dynamics. Trajectories going into the vertex 
can be continued out of them unambiguously. This meshes with the discussion in 
the introduction where we argued that diffractive effects are intimately linked with 
classical discontinuities. 

The integral representation is also useful because it allows us to consider ge- 
ometries such that the angles are near the optical boundaries or the wedge angle 
is close to n/n. For these geometries the saddle point analysis leading to (38.14) 
is invalid due to the existence of a nearby pole. In that event, we require a more 
sophisticated asymptotic analysis of the full integral representation. 



38.2 An application 



Although we introduced diffraction as a correction to the purely classical effects; 
it is instructive to consider a system which can be quantized solely in terms of 
periodic diffractive orbits. Consider the geometry shown in figure 38.4 The clas- 
sical mechanics consists of free motion followed by specular reflections off faces. 
The upper vertex is a source of diffraction while the lower one is a right angle and 
induces no diffraction. This is an open system, there are no bound states - only 
scattering resonances. However, we can still test the effectiveness of the theory 
in predicting them. Formally, scattering resonances are the poles of the scattering 
S matrix and by an identity of Balian and Bloch are also poles of the quantum 
Green's function. We demonstrate this fact in chapter 35 for 2-dimensional scat- 
terers. The poles have complex wavenumber k, as for the 3-disk problem. 

Let us first consider how diffractive orbits arise in evaluating the trace of G 
which we call g(k). Specifying the trace means that we must consider all paths 
which close on themselves in the configuration space while stationary phase ar- 
guments for large wavenumber k extract those which are periodic - just as for 
classical trajectories. In general, g(k) is given by the sum over all diffractive and 
geometric orbits. The contribution of the simple diffractive orbit labeled y shown 
in figure 38.5 to g(k) is determined as follows. 

We consider a point P just a little off the path and determine the semiclassical 
Green's function to return to P via the vertex using (38.9) and (38.10). To leading 
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Figure 38.5: The dashed line shows a simple periodic 
diffractive orbit y. Between the vertex V and a point P 
close to the orbit there are two geometric legs labeled 
±. The origin of the coordinate system is chosen to be 
atR. 



P (x,y) 




order in y the lengths of the two geometric paths connecting P and V are d± = 
(L±x)+y 2 / (L±x) 2 /2 so that the phase factor ik(d++d-) equals HkL+iky 2 /(L 2 -x 2 ). 
The trace integral involves integrating over all points P and is 



J(2kL + n/2) C L dx r l ikf L\ 



e i(lkL + nll) ^ dx r° 
g y (k) « -2d 7 — 

8nk Jo VL 2 - x 2 J-o 



dye\ J (38.15) 



We introduced an overall negative sign to account for the reflection at the hard wall 
and multiplied by 2 to account for the two traversal senses, VRPV and VPRV. In 
the spirit of stationary phase integrals, we have neglected the y dependence ev- 
erywhere except in the exponential. The diffraction constant dy is the one corre- 
sponding to the diffractive periodic orbit. To evaluate the y integral, we use the 
identity 



f 

J — 0 



J*'* J -, (38.16) 



and thus obtain a factor which precisely cancels the x dependence in the x integral. 
This leads to the rather simple result 



87 ^-h\-^-\ e i^) (38.17) 



^Snkly 



where l y - 2L is the length of the periodic diffractive orbit. A more sophisticated 
analysis of the trace integral has been done [6] using the integral representation 
(38.13). It is valid in the vicinity of an optical boundary and also for wedges with 
opening angles close to n/n. 

Consider a periodic diffractive orbit with riy reflections off straight hard walls 
and fly diffractions each with a diffraction constant dyj. The total length of the 
orbit L y = J_] lyj is the sum of the various diffractive legs and ly is the length of 
the corresponding prime orbit. For such an orbit, 08.17) generalizes to 



exp{i(kLy + n r n-3nyK/4)}. (38.18) 
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exercise 38.3 

Each diffraction introduces a factor of 1 / and multi-diffractive orbits are thereby 
suppressed. 

If the orbit y is prime then Ly = l y . If y is the r'th repeat of a prime orbit /? we 
have L y - rip, n y = rpp and yu r = rcrp, where lp, pp and crp all refer to the prime 
orbit. We can then write 

8y = 8^ = -^ k fp (38.19) 

where 



tp = \Y\ ~j== \ ex P UWf> + Pp* - 3^/4)}. (38.20) 



j= i yfiiMjij 



It then makes sense to organize the sum over diffractive orbits as a sum over the 
prime diffractive orbits and a sum over the repetitions 



sdiffW = 2 X - ~Tk 2 bizz- (38 - 21) 

P r=l LK P 1 V 



We cast this as a logarithmic derivative (19.7) by noting that -# = Up 



dtp 

dk 

(Tptp/lk and recognizing that the first term dominates in the semiclassical limit. It 
follows that 



^-slfn^-^}- (38 - 22) 



In the case that there are only diffractive periodic orbits - as in the geometry of 
figure 38.4 - the poles of g(k) are the zeros of a dynamical zeta function 

l/ak) = Y\(l-tp). (38.23) 

P 

For geometric orbits, this function would be evaluated with a cycle expansion as 
discussed in chapter 20. However, here we can use the multiplicative nature of the 
weights tp to find a closed form representation of the function using a transition 
graph, as in chapter 14. This multiplicative property of the weights follows from 
the fact that the diffractive Green's function (38.10) is multiplicative in segment 
semiclassical Green's functions, unlike the geometric case. 
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Figure 38.6: The two-node transition graph with all 1 1 
the diffractive processes connecting the nodes. 



There is a reflection symmetry in the problem which means that all resonances 
can be classified as even or odd. Because of this, the dynamical zeta function fac- 
torizes as l/£ = l/£+£- (as explained in sects. 21.5 and 21.1.1) and we determine 
l/£+ and l/£_ separately using the ideas of symmetry decomposition of chap- 
ter 21. 

In the transition graph shown in figure 38.6, we enumerate all processes. We 
start by identifying the fundamental domain as just the right half of figure 38.4. 
There are two nodes which we call A and B. To get to another node from B, we 
can diffract (always via the vertex) in one of three directions. We can diffract back 
to B which we denote as process 1. We can diffract to fi's image point B and then 
follow this by a reflection. This process we denote as 2 where the bar indicates 
that it involves a reflection. Third, we can diffract to node A. Starting at A we can 
also diffract to a node in three ways. We can diffract to B which we denote as 4. 
We can diffract to B! followed by a reflection which we denote as 4. Finally, we 
can diffract back to A which we denote as process 5. Each of these processes has 
its own weight which we can determine from the earlier discussion. First though, 
we construct the dynamical zeta functions. 

The dynamical zeta functions are determined by enumerating all closed loops 
which do not intersect themselves in figure 38.6. We do it first for l/£ + because 
that is simpler. In that case, the processes with bars are treated on an equal footing 
as the others. Appealing back to sect. 21.5 we find 

l/£+ - 1 - h - ti - ?5 - hk - ht% + hh + > 

= 1 - fa + 1- 2 + t 5 ) - 2t 3 t 4 + t 5 (h + h) (38.24) 

where we have used the fact that U, - t 4 by symmetry. The last term has a positive 
sign because it involves the product of shorter closed loops. To calculate l/£, 
we note that the processes with bars have a relative negative sign due to the group 
theoretic weight. Furthermore, process 5 is a boundary orbit (see sect.21.3.1) and 
only affects the even resonances - the terms involving £ are absent from l/£_. The 
result is 



l/S- = 1 - h + - t 3 t 4 + t 3 t- 4 , 

= l-(.h- ti). (38.25) 

Note that these expressions have a finite number of terms and are not in the form 

■ .■ , exercise 38.4 

of a curvature expansion, as for the 3-disk problem. 

It now just remains to fix the weights. We use equation (38.20) but note that 
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Figure 38.7: The even resonances of the wedge scat- 
terer of figure 38.4 plotted in the complex /c-plane, 
with L = 1. The exact resonances are represented 
as circles and their semiclassical approximations as 




20.0 40.0 



each weight involves just one diffraction constant. It is then convenient to define 
the quantities 

2 exp{j(2fcL + 2n)} 2 exp{i(2kH + n)} 

Ua = TtFtt — Ub = Ttttttj — ■ (38 " 26) 

The lengths L and H - L/ V2 are defined in figure 38.4; we set L = 1 throughout. 
Bouncing inside the right angle at A corresponds to two specular reflections so that 
p = 2. We therefore explicitly include the factor exp (i2n) in (58.26) although it is 
trivially equal to one. Similarly, there is one specular reflection at point B giving 
p = 1 and therefore a factor of exp {in). We have defined u& and ub because, 
together with some diffraction constants, they can be used to construct all of the 
weights. Altogether we define four diffraction coefficients: 4ts is the constant 
corresponding to diffracting from B to A and is found from (38.11) with 9' = 3n/4 
and 6 - n and equals 2 sec (7r/8) « 2.165. With analogous notation, we have 4u 
and cIbb - ds'B which equal 2 and 1 + V2 respectively. d,j = dp due to the Green's 
function symmetry between source and receiver referred to earlier. Finally, there 
is the diffractive phase factor s - exp (-i3n/4) each time there is a diffraction. 
The weights are then as follows: 

t\ = sdsB^B ^2 = sdjj'BU^i — = — sd^B^AUB 

?5 = sdAAu\- (38.27) 



Each weight involves two u's and one d. The u's represent the contribution to 
the weight from the paths connecting the nodes to the vertex and the d gives the 
diffraction constant connecting the two paths. 

The equality of dsB and ds'B implies that t\ = From (38.25) this means that 
there are no odd resonances because 1 can never equal 0. For the even resonances 
equation (38.24) is an implicit equation for k which has zeros shown in figure38.7. 

For comparison we also show the result from an exact quantum calculation. 
The agreement is very good right down to the ground state - as is so often the 
case with semiclassical calculations. In addition we can use our dynamical zeta 
function to find arbitrarily high resonances and the results actually improve in that 
limit. In the same limit, the exact numerical solution becomes more difficult to 
find so the dynamical zeta function approximation is particularly useful in that 

case- 
exercise 38.5 

In general a system will consist of both geometric and diffractive orbits. In 
that case, the full dynamical zeta function is the product of the geometric zeta 
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function and the diffractive one. The diffractive weights are typically smaller by 
order 0(1/ y/k) but for small k they can be numerically competitive so that there is 
a significant diffractive effect on the low-lying spectrum. It might be expected that 
higher in the spectrum, the effect of diffraction is weaker due to the decreasing 
weights. However, it should be pointed out that an analysis of the situation for 
creeping diffraction [7] concluded that the diffraction is actually more important 
higher in the spectrum due to the fact that an ever greater fraction of the orbits 
need to be corrected for diffractive effects. The equivalent analysis has not been 
done for edge diffraction but a similar conclusion can probably be expected. 

To conclude this chapter, we return to the opening paragraph and discuss the 
possibility of doing such an analysis for helium. The important point which al- 
lowed us to successfully analyze the geometry of figure38.4 is that when a trajec- 
tory is near the vertex, we can extract its diffraction constant without reference to 
the other facets of the problem. We say, therefore, that this is a "local" analysis 
for the purposes of which we have "turned off" the other aspects of the prob- 
lem, namely sides AB and AB' . By analogy, for helium, we would look for some 
simpler description of the problem which applies near the three body collision. 
However, there is nothing to "turn off." The local problem is just as difficult as 
the global one since they are precisely the same problem, just related by scaling. 
Therefore, it is not at all clear that such an analysis is possible for helium. 



Resume 



In this chapter we have discovered new types of periodic orbits contributing to the 
semiclassical traces and determinants. Unlike the periodic orbits we had seen so 
far, these are not true classical orbits. They are generated by singularities of the 
scattering potential. In these singular points the classical dynamics has no unique 
definition, and the classical orbits hitting the singularities can be continued in 
many different directions. While the classical mechanics does not know which 
way to go, quantum mechanics solves the dilemma by allowing us to continue in 
all possible directions. The likelihoods of different paths are given by the quan- 
tum mechanical weights called diffraction constants. The total contribution to a 
trace from such orbit is given by the product of transmission amplitudes between 
singularities and diffraction constants of singularities. The weights of diffractive 
periodic orbits are at least of order 1/V& weaker than the weights associated with 
classically realizable orbits, and their contribution at large energies is therefore 
negligible. Nevertheless, they can strongly influence the low lying resonances 
or energy levels. In some systems, such as the N disk scattering the diffraction 
effects do not only perturb semiclassical resonances, but can also create new low 
energy resonances. Therefore it is always important to include the contributions of 
diffractive periodic orbits when semiclassical methods are applied at low energies. 
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Commentary 

Remark 38.1 Classical discontinuities. Various classes of discontinuities for billiard 
and potential problems discussed in the literature: 

• a grazing condition such that some trajectories hit a smooth surface while others 
are unaffected, refs. [1, 2, 3, 7] 

• a vertex such that trajectories to one side bounce differently from those to the other 
side, refs. [2,4, 5, 8, 9]. 

• a point scatterer [10, 11] or a magnetic flux line [12, 13] such that we do not know 
how to continue classical mechanics through the discontinuities. 

Remark 38.2 Geometrical theory of diffraction. In the above discussion we borrowed 
heavily from the ideas of Keller who was interested in extending the geometrical ray 
picture of optics to cases where there is a discontinuity. He maintained that we could 
hang onto that ray-tracing picture by allowing rays to strike the vertex and then leave at 
any angle with amplitude (38.8). Both he and Sommerfeld were thinking of optics and not 
quantum mechanics and they did not phrase the results in terms of semiclassical Green's 
functions but the essential idea is the same. 

Remark 38.3 Generalizations Consider the effect of replacing our half line by a wedge 
of angle j\ and the right angle by an arbitrary angle jo- If 72 > J\ and j2 > n/2 this is an 
open problem whose solution is given by equations (38.24) and (38.25) (there will then 
be odd resonances) but with modified weights reflecting the changed geometry [ 8]. (For 
72 < -t/2, more diffractive periodic orbits appear and the dynamical zeta functions are 
more complicated but can be calculated with the same machinery.) When j2 = J\, the 
problem in fact has bound states [21, 22]. This last case has been of interest in studying 
electron transport in mesoscopic devices and in microwave waveguides. However we can 
not use our formalism as it stands because the diffractive periodic orbits for this geometry 
lie right on the border between illuminated and shadowed regions so that equation ( 38.7) 
is invalid. Even the more uniform derivation of [ 6] fails for that particular geometry, the 
problem being that the diffractive orbit actually lives on the edge of a family of geometric 
orbits and this makes the analysis still more difficult. 

Remark 38.4 Diffractive Green's functions. The result (38.17) is proportional to 
the length of the orbit times the semiclassical Green's function (38.9) to go from the 
vertex back to itself along the classical path. The multi-diffractive formula (38.18) is 
proportional to the total length of the orbit times the product of the semiclassical Green's 
functions to go from one vertex to the next along classical paths. This result generalizes 
to any system — either a pinball or a potential — which contains point singularities such 
that we can define a diffraction constant as above. The contribution to the trace of the 
semiclassical Green's function coming from a diffractive orbit which hits the singularities 
is proportional to the total length (or period) of the orbit times the product of semiclassical 
Green's functions in going from one singularity to the next. This result first appeared in 
reference [2] and a derivation can be found in reference [ 9]. A similar structure also exists 
for creeping [ 2] . 



whelm - 30nov2001 



ChaosBook.org version 14.4.1, Apr 21 2013 



EXERCISES 



738 



Remark 38.5 Diffractive orbits for hydrogenic atoms. An analysis in terms of diffrac- 
tive orbits has been made in a different atomic physics system, the response of hydrogenic 
atoms to strong magnetic fields [23]. In these systems, a single electron is highly excited 
and takes long traversals far from the nucleus. Upon returning to a hydrogen nucleus, 
it is re-ejected with the reversed momentum as discussed in chapter 37. However, if the 
atom is not hydrogen but sodium or some other atom with one valence electron, the re- 
turning electron feels the charge distribution of the core electrons and not just the charge 
of the nucleus. This so-called quantum defect induces scattering in addition to the clas- 
sical re-ejection present in the hydrogen atom. (In this case the local analysis consists of 
neglecting the magnetic field when the trajectory is near the nucleus.) This is formally 
similar to the vertex which causes both specular reflection and diffraction. There is then 
additional structure in the Fourier transform of the quantum spectrum corresponding to 
the induced diffractive orbits, and this has been observed experimentally [ 24]. 



Exercises 

38.1. Stationary phase integral. Evaluate the two station- 
ary phase integrals corresponding to contours E i and Ei 
of figure 38.3 and thereby verify (38.7). 

(N. Whelan) 

38.2. Scattering from a small disk Imagine that instead 
of a wedge, we have a disk whose radius a is much 
smaller than the typical wavelengths we are considering. 
In that limit, solve the quantum scattering problem - find 
the scattered wave which result from an incident plane 
wave. You can do this by the method of partial waves - 
the analogous three dimensional problem is discussed in 
most quantum textbooks. You should find that only the 
m = 0 partial wave contributes for small a. Following 
the discussion above, show that the diffraction constant 
is 

d = (38.28) 

where y e — 0.577 ■ ■ ■ is Euler's constant. Note that in 
this limit d depends weakly on k but not on the scatter- 
ing angle. 

(N. Whelan) 

38.3. Several diffractive legs. Derive equation ( 3 8 . 1 8) . The 

calculation involves considering slight variations of the 
diffractive orbit as in the simple case discussed above. 
Here it is more complicated because there are more 
diffractive arcs - however you should convince yourself 



that a slight variation of the diffractive orbit only affects 
one leg at a time. 

(N. Whelan) 

.4. Unsymmetrized dynamical zeta function. As- 
sume you know nothing about symmetry decomposi- 
tion. Construct the 3-node transition graph for fig- 
ure 38.1 by considering A, B and B' to be physically 
distinct. Write down the corresponding dynamical zeta 
function and check explicitly that for B = B ' it factor- 
izes into the product of the even and odd dynamical zeta 
functions. Why is there no term t% in the full dynamical 
zeta function? 

(N. Whelan) 

.5. Three point scatterers. 

Consider the limiting case of the three disk game of pin- 
ball of figure 1 . 1 where the disks are very much smaller 
than their spacing R. Use the results of exercise 38.2 to 
construct the desymmetrized dynamical zeta functions, 
as in sect. 21.6. You should find =1-2? where 

t = de'^ 3 "^ I yf&rkR. Compare this formula with that 
from chapter 11. By assuming that the real part of k 
is much greater than the imaginary part show that the 
positions of the resonances are k„R = a„ - iB„ where 
a„ = 2nn + 3tt/4, B n = log ( yj2na„ld^ and n is a non- 
negative integer. (See also reference [11].) 

(N. Whelan) 



exer Whelan - 18dec97 



ChaosBook.org versionl4.4.1, Apr21 2013 



REFERENCES 



739 



References 

[38.1] A. Wirzba, CHAOS 2, 77 (1992); 

[38.2] G. Vattay, A. Wirzba and P. E. Rosenqvist, Phys. Rev. Lett. 73, 2304 
(1994); G. Vattay, A. Wirzba and P. E. Rosenqvist in Proceedings of the 
International Conference on Dynamical Systems and Chaos: vol. 2, edited 
by Y.Aizawa, S.Saito and K.Shiraiwa (World Scientific, Singapore, 1994). 

[38.3] H. Primack, H. Schanz, U. Smilansky and I. Ussishkin, Phys. Rev. Lett. 
76, 1615 (1996). 

[38.4] N. D. Whelan, Phys. Rev. E 51, 3778 (1995). 

[38.5] N. Pavloff and C. Schmit, Phys. Rev. Lett. 75, 61 (1995). 

[38.6] M. Sieber, N. Pavloff, C. Schmit, Phys. Rev. E 55, 2279 (1997). 

[38.7] H. Primack et. al., J. Phys. A 30, 6693 (1997). 

[38.8] N. D. Whelan, Phys. Rev. Lett. 76, 2605 (1996). 

[38.9] H. Bruus and N. D. Whelan, Nonlinearity, 9, 1 (1996). 

[38.10] P. Seba, Phys. Rev. Lett. 64, 1855 (1990). 

[38.11] P. E. Rosenqvist, N. D. Whelan and A. Wirzba, J. Phys. A 29, 5441 

(1996) . 

[38.12] M. Brack et. al., Chaos 5, 317 (1995). 

[38.13] S. M. Reimann et. al., Phys. Rev. A 53, 39 (1996). 

[38.14] A. Sommerfeld, Mathem. Ann. 47, 317 (1896); Optics (Academic Press, 
New York 1954). 

[38.15] H. S. Carslaw, Proc. London Math. Soc. (Ser. 1) 30, 121 (1989); H. S. 
Carslaw, Proc. London Math. Soc. (Ser. 2) 18, 291 (1920). 

[38.16] J. B. Keller, J. Appl. Phys. 28, 426 (1957). 

[38.17] A. Voros, J. Phys. A 21. 685 (1988). 

[38.18] see for example, D. Ruelle, Statistical Mechanics, Thermodynamic For- 
malism (Addison-Wesley, Reading MA, 1978). 

[38.19] see for example, P. Grassberger, Z. Naturforsch. 43a, 671 (1988). 

[38.20] P. Cvitanovic and B. Eckhardt, Nonlinearity 6, 277 (1993). 

[38.21] P. Exner, P. Seba and P. Stovicek, Czech J. Phys B39, 1181 (1989). 

[38.22] Hua Wu and D. W. L. Sprung, J. Appl. Phys. 72, 151 (1992). 

[38.23] P. A. Dando, T. S. Monteiro, D. Delande and K. T. Taylor, Phys. Rev. 
Lett. 74, 1099 (1995). P. A. Dando, T. S. Monteiro and S. M. Owen, preprint 

(1997) . 



refsWhelan - I8dccl997 



ChaosBook.org versionl4.4.1, Apr21 2013 



References 740 

[38.24] D. Delande et. al., J. Phys. B 27, 2771 (1994); G. Raithel et. al., J. Phys. 
B 27, 2849 (1994); M. Courtney et. al., Phys. Rev. Lett., 73, 1340 (1994). 



refsWhelan- I8decl997 



ChaosBook.org versionl4.4.1, Apr21 2013 



Epilogue 



Nowadays, whatever the truth of the matter may be (and 
we will probably never know), the simplest solution is no 
longer emotionally satisfying. Everything we know about 
the world militates against it. The concepts of indetermi- 
nacy and chaos have filtered down to us from the higher 
sciences to confirm our nagging suspicions. 



— L. Sante, "Review of 'American Tabloid' by James 
Ellroy," New York Review of Books (May 11, 1995) 




motion on a strange attractor can be approximated by shadowing long or- 



bits by sequences of nearby shorter periodic orbits. This notion has here 



L -™- been made precise by approximating orbits by prime cycles, and evalu- 
ating associated curvatures. A curvature measures the deviation of a long cycle 
from its approximation by shorter cycles; the smoothness of the dynamical system 
implies exponential fall-off for (almost) all curvatures. We propose that the theo- 
retical and experimental non-wandering sets be expressed in terms of the symbol 
sequences of short cycles (a topological characterization of the spatial layout of 
the non-wandering set) and their eigenvalues (metric structure) 



Cycles as the skeleton of chaos 

We wind down this all-too-long treatise by asking: why cycle? 

We tend to think of a dynamical system as a smooth system whose evolu- 
tion can be followed by integrating a set of differential equations. Traditionally 
one used integrable motions as zeroth-order approximations to physical systems, 
and accounted for weak nonlinearities perturbatively. However, when the evo- 
lution is actually followed through to asymptotic times, one discovers that the 
strongly nonlinear systems show an amazingly rich structure which is not at all 
apparent in their formulation in terms of differential equations. In particular, the 
periodic orbits are important because they form the skeleton onto which all trajec- 
tories trapped for long times cling. This was already appreciated century ago by 
H. Poincare, who, describing in Les methodes nouvelles de la mechanique celeste 
his discovery of homoclinic tangles, mused that "the complexity of this figure will 
be striking, and I shall not even try to draw it." Today such drawings are cheap and 
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plentiful; but Poincare went a step further and, noting that hidden in this apparent 
chaos is a rigid skeleton, a tree of cycles (periodic orbits) of increasing lengths 
and self-similar structure, suggested that the cycles should be the key to chaotic 
dynamics. 

The zeroth-order approximations to harshly chaotic dynamics are very differ- 
ent from those for the nearly integrable systems: a good starting approximation 
here is the stretching and kneading of a baker's map, rather than the winding of a 
harmonic oscillator. 

For low dimensional deterministic dynamical systems description in terms of 
cycles has many virtues: 

1. cycle symbol sequences are topological invariants: they give the spatial 
layout of a non-wandering set 

2. cycle eigenvalues are metric invariants: they give the scale of each piece of 
a non-wandering set 

3. cycles are dense on the asymptotic non-wandering set 

4. cycles are ordered hierarchically: short cycles give good approximations 
to a non-wandering set, longer cycles only refinements. Errors due to ne- 
glecting long cycles can be bounded, and typically fall off exponentially or 
super-exponentially with the cutoff cycle length 

5. cycles are structurally robust: for smooth flows eigenvalues of short cycles 
vary slowly with smooth parameter changes 

6. asymptotic averages (such as correlations, escape rates, quantum mechan- 
ical eigenstates and other "thermodynamic" averages) can be efficiently 
computed from short cycles by means of cycle expansions 

Points 1, 2: That the cycle topology and eigenvalues are invariant properties 
of dynamical systems follows from elementary considerations. If the same dy- 
namics is given by a map / in one set of coordinates, and a map g in the next, 
then / and g (or any other good representation) are related by a reparametrization 
and a coordinate transformation / = ]\ x o g oh. As both / and g are arbitrary 
representations of the dynamical system, the explicit form of the conjugacy h is 
of no interest, only the properties invariant under any transformation h are of gen- 
eral import. The most obvious invariant properties are topological; a fixed point 
must be a fixed point in any representation, a trajectory which exactly returns to 
the initial point (a cycle) must do so in any representation. Furthermore, a good 
representation should not mutilate the data; h must be a smooth transformation 
which maps nearby periodic points of / into nearby periodic points of g. This 
smoothness guarantees that the cycles are not only topological invariants, but that 
their linearized neighborhoods are also metrically invariant. In particular, the cy- 
cle eigenvalues (eigenvalues of the Jacobian matrixs df(x)/dx of periodic orbits 
f(x) = x) are invariant. 
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Point 5: An important virtue of cycles is their structural robustness. Many 
quantities customarily associated with dynamical systems depend on the notion 
of "structural stability," i.e., robustness of non-wandering set to small parameter 
variations. 

Still, the sufficiently short unstable cycles are structurally robust in the sense 
that they are only slightly distorted by such parameter changes, and averages com- 
puted using them as a skeleton are insensitive to small deformations of the non- 
wandering set. In contrast, lack of structural stability wreaks havoc with long time 
averages such as Lyapunov exponents, for which there is no guarantee that they 
converge to the correct asymptotic value in any finite time numerical computation. 

The main recent theoretical advance is point 4: we now know how to con- 
trol the errors due to neglecting longer cycles. As we seen above, even though 
the number of invariants is infinite (unlike, for example, the number of Casimir 
invariants for a compact Lie group) the dynamics can be well approximated to 
any finite accuracy by a small finite set of invariants. The origin of this conver- 
gence is geometrical, as we shall see in appendix J. 1.2, and for smooth flows the 
convergence of cycle expansions can even be super-exponential. 

The cycle expansions such as (20.7) outperform the pedestrian methods such 
as extrapolations from the finite cover sums (22.2) for a number of reasons. The 
cycle expansion is a better averaging procedure than the naive box counting al- 
gorithms because the strange attractor is here pieced together in a topologically 
invariant way from neighborhoods ("space average") rather than explored by a 
long ergodic trajectory ("time average"). The cycle expansion is co-ordinate and 
reparametrization invariant - a finite nth level sum (22.2) is not. Cycles are of 
finite period but infinite duration, so the cycle eigenvalues are already evaluated 
in the n — > oo limit, but for the sum (22.2) the limit has to be estimated by numer- 
ical extrapolations. And, crucially, the higher terms in the cycle expansion (20.7) 
are deviations of longer prime cycles from their approximations by shorter cycles. 
Such combinations vanish exactly in piecewise linear approximations and fall off 
exponentially for smooth dynamical flows. 

In the above we have reviewed the general properties of the cycle expansions; 
those have been applied to a series of examples of low-dimensional chaos: 1- 
d strange attractors, the period-doubling repeller, the Henon-type maps and the 
mode locking intervals for circle maps. The cycle expansions have also been 
applied to the irrational windings set of critical circle maps, to the Hamiltonian 
period-doubling repeller, to a Hamiltonian three-disk game of pinball, to the three- 
disk quantum scattering resonances and to the extraction of correlation exponents, 
Feasibility of analysis of experimental non-wandering set in terms of cycles is 
discussed in ref. [1]. 
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Homework assignment 

"Lo! thy dread empire Chaos is restor'd, Light dies before 
thy uncreating word; Thy hand, great Anarch, lets the cur- 
tain fall, And universal darkness buries all." 
— Alexander Pope, The Dunciad 

We conclude cautiously with a homework assignment posed May 22, 1990 
(the original due date was May 22, 2000, but alas...): 

1. Topology Develop optimal sequences ("continued fraction approximants") 
of finite subshift approximations to generic dynamical systems. Apply to 
(a) the Henon map, (b) the Lorenz flow and (c) the Hamiltonian standard 
map. 

2. Non-hyperbolicity Incorporate power-law (marginal stability orbits,"intermittency") 
corrections into cycle expansions. Apply to long-time tails in the Hamilto- 
nian diffusion problem. 

3. Phenomenology Carry through a convincing analysis of a genuine experi- 
mentally extracted data set in terms of periodic orbits. 

4. Invariants Prove that the scaling functions, or the cycles, or the spectrum 
of a transfer operator are the maximal set of invariants of an (physically 
interesting) dynamically generated non-wandering set. 

5. Field theory Develop a periodic orbit theory of systems with many unstable 
degrees of freedom. Apply to (a) coupled lattices, (b) cellular automata, (c) 
neural networks. 

6. Tunneling Add complex time orbits to quantum mechanical cycle expan- 
sions (WKB theory for chaotic systems). 

7. Unitarity Evaluate corrections to the Gutz wilier semiclassical periodic or- 
bit sums, (a) Show that the zeros (energy eigenvalues) of the appropriate 
Selberg products are real, (b) Find physically realistic systems for which 
the "semiclassical" periodic orbit expansions yield the exact quantization. 

8. Atomic spectra Compute the helium spectrum from periodic orbit expan- 
sions (already accomplished by Wintgen and Tanner!). 

9. Symmetries Include fermions, gauge fields into the periodic orbit theory. 

10. Quantum field theory Develop quantum theory of systems with infinitely 
many classically unstable degrees of freedom. Apply to (a) quark confine- 
ment (b) early universe (c) the brain. 
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Good-bye. I am leaving because I am bored. 
— George Saunders' dying words 

Nadie puede escribir un libro. Para Que un libro sea ver- 
daderamente, Se requieren la aurora y el poniente Siglos, 
armas y el mar que une y separa. 

— Jorge Luis Borges El Hacedor, Ariosto y los arabes 
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associative, 808 
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chaotic, 492 

space, 335, 350 

time, 336, 348 
averaging, 29 

axiom A, 413, 468, 473, 475, 856 
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basin of attraction, 41 
basis vector, 807 
BER 
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shift, 228, 450, 455, 463, 474, 476, 
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shift eigenfunctions, 470 
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bifurcation 

Airy function approximation, 632 

bizarre, 831 

generic, 148 

Hopf, 549 

saddle-node, 74 
billiard, 143-148 

map, 144 

stability, 112, 145 

stadium, 143, 149, 173, 506, 883, 
886, 924 

binary 

prime cycles, 248, 257, 306 

symbolic dynamics 
collinear helium, 708 

tree, alternating, 237 
Birkhoff 

coordinates, 72, 144, 149 

ergodic theorem, 336 
block 

finite sequence, 234 
block, pruning, 236 
Bohr 
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helium, 704, 714 

Uetli Schwur, 782 
Bohr-de Broglie picture, 613 
Bohr-Sommerfeld quantization, 613, 631, 

664, 783 
Boltzmann 

equation, 25, 524 

stosszahlansatz, 25 
Boltzmann, L., 25, 769 
boredom, 745, 918 
Borges, J.L., 745 
Botox, 58 

boundary orbits, 428 
bounded operators, 904 
Bourbaki,N., 68 
Bowen, R., 31 
brain, rat, 5, 32 
branch cut, 487 

singularity, 488 
Brownian noise, 576 
Bunimovich 

stadium, see stadium billiard 
Burnett coefficient, 518 
Burnett coefficients, 26, 517, 524 
butterfly effect, 32 

Cjv = Dj symmetry, 257, 432 
canonical transformation, 129-131, 818 
Cartan 

-Killing classification, 129 
Cartan, E. , 205 
Cartwright, M.L., 173, 770 
Cauchy criterion, 902 
Cauchy-Green strain tensor, 115 
caustic, 641 

ceiling function, 377, 475 
center, 91 

center manifold, 540 
center of mass, 283 
central moment, 353 
centralizer, 160 
chain rule, matrix, 900 
chain-recurrent, 42 
change 

of coordinates, 48 
chaology, see chaos 
chaos, 7-9 

caveats, 10 

deterministic, 30 

diagnostics, 51 

quantum, 31 

skeleton of, 13, 14 

spatiotemporal, 530 

successes, 10 
character 



representation, 865 
characteristic 

equation, 813 

exponent, 117 

function, 328 

polynomial, 308, 814, 867 

value, 117 
chart, 39, 70 
section border, 62 
chicken heart palpitations, 8 
circle group, see SO(2) 
circle map, 228, 292, 323, 515, 521, 549- 
551, 556, 563, 566, 743, 938, 
969, 975 

critical, 552, 557 
class, 159 
class algebra, 886 
Clebsch-Gordan 

coefficients, 810 
co-moving frame, 105 
coarse-graining, 328 
coding, see symbolic dynamics 
collinear helium, 615 

symbolic dynamics, 708 
combinatorics 

teaching, 229 
compact 

group, 170 

invariant set, 159 
complete 

N-ary dynamics, 291 

symbolic dynamics, 291 
completeness relation, 809, 814, 868 
complex eigenvalues, 821 
complex Lorenz flow, 178, 207 

equivariance, 186 

relative periodic orbit, 190 
complexity 

algorithmic, 321, 416 
confession 

C.N. Yang, 335 

Kepler, 767 

St. Augustine, 328 
configuration space, 50 
conjugacy, 795 

invariant, 107 

smooth, 48, 107, 804 

topological, 230 
conjugate momentum, 126 
conjugate, hermitian, 864 
connection formulas, 630 
connection, method of, 205 
conservation 

equation, 573 
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phase-space volume, 128, 131, 132, 
135, 340 

continuity equation, 338, 340, 574, 578, 
639 

contour integral, 385 
contracting 

Floquet multipliers, 103, 368 

flow, 41,47, 88 

map, 97, 239 

state space, Rossler, 96, 419 
convergence 

abscissa of, 898 
abscissa of absolute, 413 
abscissa of abysimal, 413 
mediocre, 894 
radius, 379 

super-exponential, 460, 594 
convexity, 363 
coordinate 

change, 48, 795 

longitudinal, 649 

transformations, 804 
Copenhagen School, xiv, 783 
correlation 

decay 

power law, 482 

function, 470 

spectrum, 470 

time, 444 
coset, 158 
cost function, 595 

covariant Lyapunov vector, 119-121 
covariant vector, 80, 100, 101, 119 
covering 

symbolic dynamics, 234 
creeping 

1-disk, 695 
critical 

point, 111, 226, 231, see equilib- 
rium point 
value, 227, 515 
cross-section, 194, 205 
cumulant, 353, 416 

expansion, 307, 311,401 
Plemelj-Smithies, 908 
cumulant-generating function, 353 
curvature 

correction, 397 
expansion, 29, 398 
cycle, 111, see periodic orbit 
expansion, 19, 397, 663 
3-disk, 418 
finite subshift, 409 
Lyapunov exponent, 409 



stability ordered, 411 
fundamental, 308, 397, 892 
limit, 42, 103, 111, 114 
Lyapunov exponent, 117 
marginal stability, 106, 193 
prime, 235, 267, 323, 370 

3-disk, 267, 604 

Henon map, 597 
pruning, 319 
Rossler flow, 270, 282 
stability, 99-109 

Gauss map, 560 
stable, 111 
superstable, 111 
weight, 382 
cycle point, see periodic point 
cyclic 

group, 861, 862 
invariance, 267 
permutation matrix, 87 1 
symmetry, 304 
cyclist 

Lie groups, 184 

damped Newton method, 273 
Danish pastry, see symbol plane 
Darboux basis, 128, 138 
de Broglie wavelength, 633 
Debye approximation, 702 
decay 

rate, 389 

rate of correlations, 47 1 
decomposition 

irreducible, 870 
defining 

representation, 864 

vector space, 863 
deformation gradient, 90 
degenerate 

eigenvalues, 812 
degree of freedom, 10, 126, 544, 625 
delta function, see Dirac delta 

Dirac, 620 
density, 328, 573 

evolution, 24 

phase space, 340 
density of states 

average, 659 

Green's function, 621 

quantum, 620 
derivative, lattice, 871 
desymmetrization 

3-disk, 436 
desymmetrized state space, 194 
determinant 



index 27sep2010 



ChaosBook.org versionl4.4.1, Apr 21 2013 



INDEX 



749 



for flows, 381 

Fredholm, 911 

graph, 322 

Hadamard, 379 

spectral, 24, 307, 379 

trace relation, 307 

trace-class operator, 905 
deterministic dynamics, 7, 38, 337 
diagonalizing matrix, 867 
diffeomorphism, 73 
differential equations 

almost ordinary, 53 
diffraction 

Green's function, 729 

Keller, 737 

Sommerfeld, 737 
diffusion 

anisotropic, 575 

anomalous, 521 

constant, 352 

equation, 575 

limited aggregates, 32 

tensor, 575 
digraph, see transition graph 
dihedral group, 861, 862 
dike map, 232, 239 
dimension 

box counting, 92 1 

fractal, 921 

generalized, 11 

information, 92 1 , 922 

intrisic, 10, 544 

symplectic, 137 
Dirac delta, 22, 23, 313, 330, 343, 354, 
370, 377, 390, 464, 575, 622, 
642, 654 

derivatives, 343 

Jacobian, 338 
Dirac path integral, 653 
Dirichlet 

series, 898 
Dirichlet series, 416, 891, 898, 899 
discrete 

Fourier transform, 875 
dissipation 

rate, Kuramoto-Sivashinsky, 545 

rate, Navier-Stokes, 546 
dissipative 

map, 97, 239 
divergence rate, local, 36 1 
divergence ultraviolet, 66 1 
DLA, 32 

dof, see degree of freedom 
dot product, 182 



doubling map, 228, 467 
drift, along group tangent, 1 80 
dual 

representation, 807, 863, 864 

space, 807, 863 

vector space, 863 
Duffing oscillator, 44, 51, 60, 127 
dynamical 

localization, quantum, 139 

transitivity, 289 

zeta function, 19, 383 
Euler product rep., 383 
dynamical system, 37, 39 

axiom A, 413, 473, 856 

equivalent, 803 

gradient, 53 

hyperbolic, 388, 467 

infinite, 545 

smooth, 20, 21, 29, 39, 315, 770, 
941,944, 946 

dynamics 

deterministic, 7, 38 
hyperbolic, 291 
irreversible, 43 
reversible, 43 
spatiotemporal, 31 
stochastic, 8 
symbolic, 12, 218, 233 
symmetry, 155, 178 
topological, 218, 233, 235, 290 

edge, 291 
eigendirection, 80 
eigenfunction 

Perron-Frobenius operator, 453 

energy, 618 

Perron-Frobenius, 453 
eigenstate, see eigenfunction 
eigenvalue, 389 

Perron-Frobenius operator, 453 

complex, 821 

degenerate, 812 

exponential spacing, 380 

zero, 631, 646 
Einstein diffusion formula, 575, 587 
Einstein, A, 786 
elastic 

scattering, 670 
elliptic 

stability, 133 
empirical mean, 352 
enemy 

thy, 482 
energy 

Kuramoto-Sivashinsky, 545 
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Navier-Stokes, 546 
English 

plain, 204, 234 
ensemble 

microcanonical, 363 
entire function, 453 
entropy 

barrier, 413, 775, 899 

Gauss map, 567 

Kolmogorov, 148, 322, 916, 918, 
923, 924 

topological, 8, 302, 313, 322 
equations of variations, 78 
equilibrium, 187 

Kuramoto-Sivashinsky, 537 

Lorenz flow, 45, 64 

point, 44, 100, 334, 537, 597 

Rossler flow, 46, 53, 92, 222 

relative, 187 
equilibrium measure, see natural mea- 
sure 
equivalence 

of dynamical systems, 803 
equivariance, 155, 156 

complex Lorenz flow, 186 
equivariant, see relative 
ergodic 

average, 336 

theorem 

multiplicative, 120 

theory, 336 
error correlation matrix, 117 
error matrix, 90 

escape rate, 14, 15, 343, 359, 361, 385, 
387, 392, 403, 409, 417, 418, 
442, 456, 476, 579, 858, 891, 
918, 922, 923, 967 
3-disk, 408,418,440 
intermittency, 49 1 
vanishing, 344, 407, 829 
essential 

spectral radius, 463, 472 
spectrum, 463 
Euler 

formula, 82, 455 
limit, 84 

-MacLaurin formula, 47 1 

product, 84, 387 

product rep. 

dynamical zeta function, 383 

totient function, 553 
Eulerian coordinates, 47, 58, 79 
evolution 

group, 53 



kernel probabilistic, 337 

operator, 22, 355 
quantum, 620 

semigroup, 355 
evolution operator, 90 
expanding 

Floquet multipliers, 103, 368 
expectation, 353 
expectation value, 350, 364 
expected value, 353 
exponent 

Floquet, 103 
exponential 

convergence, 379, 460 

decay rate of correlations, 47 1 

of a matrix, 83 

proliferation, 22, 321, 416 
extremal point, 626 

factor group, 159 
false zeros, 387 
Farey 

map, 221,481,485, 505 

mediant, 554 

series, 552 

tree, 554 
Feigenbaum 

constant, 377 
Feynman path integral, 645, 653 
fibre, 43 
fibre bundle, 43 
Fick law, 574 
finite group, 86 1 
finite subshift 

cycle expansion, 409 
first return time, 58, 71, 501 
fixed point, 267 

maps, 75 

marginally stable, 481 

subspace, 160 

under G, 160, 162, 163 
Floquet 

exponent, 88, 103, 132 

multiplier, 101, 103, 242, 368 

multiplier, metric invariant, 106 

theorem, 102 

theory, 103, 111 

vector, 100 
flow, 37-50 

-invariant set, 99 

autonomous, 44 

contracting, 41, 47, 88 

deterministic, 337 

elliptic, 104 

generator of, 338, 842 
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Hamiltonian, 126, 886 
hyperbolic, 104, 133, 389 
incompressible, 88, 340 
infinite-dimensional, 529-545 
invariant subspace, 161 
inverse hyperbolic, 133 
linear, 81, 89 
linearized, 90 
map, 39 

nonhyperbolic, 104 

spectral determinant, 381 

stability, 84 

stationary, 44 

stochastic, 337 

stretch & fold, 226 
flow map, 50 
flows, 73 

Fokker-Planck equation, 578 
form, normal, 801 
Fourier mode, truncation, 531 
Fourier transform, discrete, 875 
Frechet derivative, 90 
fractal, 32, 919 

aggregates, 11 

dimension, 99, 921 

geometry of nature, 1 1 

probabilistic, 11 

science, 11 
Fredholm 

determinant, 911 

theory, 460, 461 
free energy, 354 
freezing, 194 

Frenkel-Kontorova model, 606 
frequency analysis, 5 1 
Fresnel integral, 626, 632 
full shift, 291 
function, 73 

L 2 square-integrable, 472 

analytic, 471 

space, piecewise constant, 371 
functional, 335 

composition, 42 

Lyapunov, 41 
fundamental 

cycle, 308 

cycles, 892 

domain, 257 

collinear helium, 707 

matrix, 83,90, 810, 846 

G-equivariant, 181 
G-fixed, 160, 162, 163 
G-invariant 



basis, see invariant polynomial ba- 
sis 

polynomial basis, 170, 199 
G-invariant 

polynomial basis, 170, 173, 174,205 
Gp-symmetric, 162 
Galerkin truncation, 533 
Galilean invariance, 530, 545 
Gatto Nero, professor, 23 1 
gauge 

fixing, 70 

invariance, 70 
gauge fixing, 63 

Gauss map, 344, 505, 553, 554, 564 

cycle stability, 560 

metric entropy, 567 
Gauss shift, see Gauss map 
Gaussian 

integral, 343,517,589, 654 

integral, cZ-dimensional, 589, 642 

noise, 951 

probability density, 584 
general linear group, 154 
generating 

function, 370, 606 

orbit, 238 

partition, 235 
generator 

anti-hermitian, 183, 184 

Lie algebra, 129, 182 

of flow, 338, 842 
Gilmore, R., 173 
GL(d) 

general linear group, 154 
GL(n, F), 862 
golden mean, 292, 323 

pruning, 239, 292, 309, 323, 392, 
829, 973 

renormalization, 556, 564 
good taste, 237 
gradient 

algorithm, 595 

system, 53 
grammar 

symbolic dynamics, 236 
graph 

irreducible, 291 

strongly connected, 29 1 

transition, 288 
Gray codes, 237 
Green's function, 622 

analogue of, 846 

density of states, 621 

diffraction, 729 
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energy dependent, 620 

regularized, 661 

scattering, 676 

semiclassical, 651, 653, 654 

short distance, 650, 65 1 

trace, 620 

long orbits, 650 
Greene's residue criterion, 133 
group, 861 

compact, 170 

cyclic, 861, 862 

dihedral, 861, 862 

dynamical, 43 

evolution, 53 

finite, 153, 861 

general linear, 862 

Lie, 129, 180, 182, 184, 861 

matrix, 154 

not a, 886 

orbit, 160, 171, 181, 187 

orbit, equilibrium, 209 

orbit, marginal eigenvalue, 193 

orbit, slice, 188, 195, 198, 202 

orbit, velocity, 816 

order of, 153, 861 

representation, 865 

semi-, 338, 842 

symmetric, 861 

symmetry, reduction, 194 

tangent field, 182, 186 
Gutzwiller 

path integral, 653 

trace formula, 613, 659 
Gutzwiller, M., 774 

Hadamard determinant, 379 
Hadamard product, 662 
Hamilton 

-Cayley theorem, 809, 814 

-Jacobi equation, 585, 634, 651, 959 

equations, 634 

principal function, 585, 637 
Hamiltonian, 635 

dynamics, 125-136 

equations, 126 

flow, 126, 886 

spectral determinant, 383 
stability, 130, 819 

flows, stability, 817 

Henon map, 132 

matrix, 129, 137 

repeller, periodic orbits, 283 

separable, 618 
Hankel function, 650, 685, 702 
Hankel singular values, 115 

index 27scp2010 



Hannay-Ozorio de Almeida sum rule, 448 
harmonic oscillator, 804 
Harter, W. G, 886 
heat 

equation, 575 
Heaviside function, 621 
Heisenberg, 783 
picture, 902 
Heisenberg, W., 783 
helium, 704, 783 

collinear, 54, 75, 127, 615, 723 
cycles, 283, 723 
eigenenergies, 724 
fundamental domain, 707 
Poincare section, 723 
stabilities, 723 
stability, 283 
Helmholtz equation, 685 
Helmholtz free energy, 354 
Henon map, 67, 73, 133 
attractor, 122, 336 
cycles, 283, 595 
fixed points, 75, 249 
Hamiltonian, 132 
horseshoe, 249 
inverse, 249 

Lyapunov exponent, 122 

natural measure, 334 

prime cycles, 597, 607 

pruning, 895 

pruning front, 260 

stability, 96, 1 12 

structural stability, 26 1 

symmetries, 885 

time delay map, 275 

transient, 597 
Henon, M., 74 
Henon-Heiles 

symbolic dynamics, 173 
hermitian 

conjugation, 864 

matrix, 864 
heroes 

unsung, xiv, xix 
Hessian matrix, 130 
heteroclinic 

connection, 223, 224, 779 

intersection, 259 

orbit, 51, 94, 140, 173, 223, 229, 
238, 258 

Hilbert 

basis, see invariant polynomial ba- 
sis 

space, 619 
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Hilbert-Schmidt 

operators, 904 

condition, 46 1 
Hilbert-Weyl theorem, 170 
Holmes, P., 544 
homoclinic 

orbit, 45, 223 

point, 768 

tangency, 243 

tangle, 259, 563 
Hopf bifurcation, 549 
Hopf 's last hope, 780 
Hopf, Ebehardt, 549, 779, 780 
Hopf, Heinz, 721 
horseshoe, 248 

complete, 251 
hydrodynamical 

interpretation of QM, 653 
hyperbolic 

flow, 104, 133, 389 

non-, 25 

orbit, partially, 104 
systems, 349, 388, 403, 408, 442, 
467, 474, 492, 510, 521, 614, 
666, 776, 785, 918, 923, 964 
hyperbolicity assumption, 17, 369 

image space, 194, 200 
in/out nodes, 91 

inadmissible symbol sequence, 235 
incommensurate, 41 
incompressible flow, 88 
indecomposability, 289 

metric, 220 
index 

Maslov, see topological index 
index summation, repeated, 87, 130, 862 
indifferent stability, 80 
induced map, 495 
inertial manifold, 533, 544 
infinite-dimensional flows, 529-545 
inflection point, 550 
information dimension, 921, 922 
initial 

conditions, sensitivity to, 8 
point x 0 , 16, 39, 78 
state xq, 16, 39 
injective, 68 

integrable system, 127, 794 

integrated observable, 348, 349, 355, 361, 

370, 384, 397, 893 
integration 

by parts, lattice, 872 

Runge-Kutta, 53 



intermittency, 148, 344, 411, 417, 466, 
480, 521, 526, 614, 719, 923, 
932 

anomalous diffusion, 521 

escape rate, 491 

piecewise linear model, 483 

resummation, 497 

stability ordering, 500 
invariance 

cyclic, 267 

Galilean, 530, 545 

of flows, 105 

symplectic, 126, 137, 817 
invariant, 865 

density, see natural measure 

matrix, 864 

measure, 334 

measure, Gauss map, 344 

metric, 100, 107 

points, 160 

polynomial basis, 170, 173, 174, 194, 
199, 201,202, 205, 209 
SO(2), 200 
set, compact, 159 
subgroup, 159 
subspace, 161 
tensor, 185 
topological, 100 
tori, 203 
vector, 864 
inverse 

hyperbolic, 103, 133 
iteration, 271 

iteration, Hamiltonian repeller, 283 
inversion, 154 
involution, 862 
inward/outward spirals, 91 
irreducible 

decomposition, 870 

graph, 291 

matrix, 289 

segment, 166 
irrep, 870 

irreversibility, 24, 43 
Ising model, 256, 431, 436, 929, 931, 
942, 943, 948, 949 

isotropy 

subgroup, 161-163, 171, 172 
isotypic decomposition, 172 
iteration, 39 

inverse, 271 

Hamiltonian repeller, 283 

map, 66 
itinerary, 12, 14, 72, 219, 268 
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bi-infinite, 221, 234 
future, 227, 234 
past, 234 

Jacobi, C.G.J., 89 
Jacobian 

matrix, 16, 79, 90,811 
jacobian, 87, 329 

Henon map, 96 
Jonquiere function, 485, 506, 522, 523, 
526 

Jordan normal form, 813, 815 

KAM, 138 

tori, 480 
Kamiltonian, 130 
Kaplan- Yorke criterion, 83 
Karhunen-Loeve, 246 
Keller diffraction, 737 
Keller, J.B., 773 
Keplerian orbit, 613 
kernel 

resolving, 462 
kneading 

determinant, 238 

sequence, 232, 239 

theory, 23 1 

value, 232, 239 
Kolmogorov entropy, 148, 322, 916, 918, 

923, 924 
Koopman operator, 841, 846 
Kraichnan, Robert H., 780 
Kramers, 783 

Krein-Friedel-Lloyd formula, 677 
Kronecker delta, 807, 863 
KS, see Kustaanheimo-Stiefel 
Kuramoto, Y., 544 
Kuramoto-Sivashinsky 

equilibria, 537 

system, 534, 538, 540, 544 
kurtosis, 354, 364, 518 
Kustaanheimo-Stiefel transformation, 705, 
798 

L 2 function space, 472 
Lagrangian, 637 

coordinates, 47, 58, 79 

frame, 105 

manifold, 638 
laminar states, 480 
Langevin equation, 577, 587 
Laplace 

transform, 23, 313, 358, 372, 373, 

377, 620, 651,843 
transform, discrete, 306, 370, 503 



Laplace, Pierre-Simon de, 6 
Laplacian 

diagonalization, 888 

diagonalized, lattice, 878 

inverse, lattice, 873 

lattice, 872 

non-local, 888 
last hope, Hopf 's, 780 
lattice 

derivative, 871 

Fourier transform, 875 

integration by parts, 872 

Laplacian, 872 

Laplacian, diagonalized, 878 

Laplacian, inverse, 873 
least action principle, 267, 604 
Legendre transform, 637 
Leibniz, Gottfried Wilhelm, 6 
Letellier, C, 173 
level set, 126 
Liapunov, see Lyapunov 
libration orbit, 710, see self-retracing 
Lie 

algebra, 129, 182, 185, 808 
bracket, 186 
derivative, 186 

group, 129, 180, 182, 184, 861 

product, 808 
lifetime, 15 

matrix, 680 
limit cycle, 42, 103, 111, 114 
linear 

flow, 81, 89 

space, 806 

stability, 77, 99, 537 
linearized 

flow, 90 
link, 291 
Liouville 

equation, 340 

operator, 341 

theorem, 128, 131, 132, 135, 340 
little group, 162 
Littlewood, J.E., 770 
local 

divergence rate, 361 

stability, 77, 99, 537 
localization, quantum dynamical, 139 
logistic map, see unimodal 
longitudinal 

coordinate, 649 

loop 

intersecting, 308 
Lorentz gas, 417, 480, 510, 525, 527 
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Lorentzian, 622 

Lorenz flow, 45, 63, 93-95, 164, 167, 
175, 223 

complex, see complex Lorenz flow 

polar coordinates, 174, 887 

proto-Lorenz, 175 

symmetry, 157, 174 
Lorenz, E.N., 74, 173 
loxodromic, 819 

quartet, 132, 138 
Lozi map, 68, 74 
Lyapunov 

characteristic numbers, 120 

covariant vector, 119-121 

equation, 120 

exponent, 8, 99, 114-118, 804 
cycle, 117 

cycle expansion, 409 

equilibrium, 538 

natural measure, 361 

numerical, 118, 121 
function, 120 
functional, 4 1 
mode, 118, 120 
orbit, 120 

time, 8, 10, 26, 43, 348 
vector, 120 
Lyapunov, A.M., 771 

manifold, 39 

unstable, 243 
map, 39, 65-69, 73 

area preserving, 886 
contracting, 97, 239 
dike, 232, 239 
dissipative, 97, 239 
expanding, 221 
fixed point, 75 
Henon, 67, 595, 885 

Hamiltonian, 132 

prime cycles, 597 
Hamiltonian 

Henon, 132 
iteration, 66 
logistic, see unimodal 
Lozi, 68, 74 
once-folding, 248 
order preserving, 230 
orientation preserving, 886 
orientation reversing, 886 
quadratic, 68 

return, 16, 58, 59, 71, 72, 222, 223, 

233, 245, 249, 270 
sawtooth, 157, 163, 422 
stability, 85 
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tent, 226 
unimodal, 226 
mapping, 73 
marginal 

stability, 16, 80, 103, 193, 368, 466, 
480 
cycle, 106, 193 
fixed point, 48 1 
Markov 

chain, 234 

graph, see transition graph 

matrix, 289, 331,409 

partition, 518, 772 
finite, 221, 291 
infinite, 298 
not unique, 243 
Maslov index, see topological index 
material invariant, 573 
Mather, see Aubry-Mather theory 
matrix 

diagonalizing, 867 

exponential, 83, 844 

group, 154 

hermitian, 864 

invariant, 864 

irreducible, 289 

negative definite, 115 

of variations, see stability matrix 

positive definite, 115, 589 

product, 808 

representation, 155, 808 

stability, 78, 585 
Maupertuis, P.L.M. de, 138, 267, 604 
mean, 353 
measure, 328 

continuous, 804 

invariant, 334 

natural, 74, 335, 342, 350, 445, 448, 
509, 779, 785 
mechanics 

quantum, 618 

statistical, 24 
mediocre 

convergence, 894 
memory 

ra-step, 219 

finite, 293 
method of connections, 205 
metric 

entropy, 99 

entropy, Gauss map, 567 
indecomposability, 220, 896 
invariant, 100, 107 

Floquet multiplier, 106 
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transitivity, 896 
microcanonical ensemble, 363 
Mira, C, 74 
Misiurewicz, M., 74 
mixing, 8, 9, 17, 336 
mode, normal, 887 
Moebius inversion, 318 
moment, 353 

moment-generating function, 353 
monodromy matrix, 85, 108, 368, 818 
Morse index, see topological index 
moving frame, 194, 195 

SO(2), 196 
multi-scattering matrix, 688 
multifractals, 938 

multiplicative ergodic theorem, 120 
multiplicative noise, 577, 581, 584, 587 
multiplier, Floquet, 103, 242 
multipoint shooting method, 273 

,/V-disk, transition matrix, 29 1 
natural density, see natural measure 
natural invariant, see natural measure 
natural measure, 74, 279, 335, 342, 350, 

361, 445, 448, 469, 509, 779, 

785 

nature, geometry of, 1 1 
Navier-Stokes equation, 529 
Navier-Stokes flow 

stability, 83 
negative definite matrix, 115 
neighborhood, 77, 109 
Nero, G., 231 
neutral, see marginal 
New York subway map, 243 
Newton method, 272 

convergence, 272 

damped, 273 

flows, 275 

optimal section, 825 
Newtonian dynamics, 125 
node, 291 
noise 

Brownian, 576 

Gaussian, 584, 586, 951 

multiplicative, 577, 581, 584, 587 

white, 584 
non-hyperbolic 

systems, 448, 504, 506 
non-wandering set, 41, 250 
nonequilibrium, 509 
nonhyperbolic 

flow, 104, 105 
norm, 902 
normal 



divisor, 159 
form, 801 
mode, 887 

washing machine, 153 

obscure 

foundations, 783 
jargon, 204, 218 
topology, 535, 539 
observable, 335, 341, 348, 368, 444, 492, 
502, 509, 521, 773, 841, 926, 
931,948, 949 
integrated, 348, 349, 355, 361, 370, 

384, 397, 893 
simultaneous, 869 
vector, 364 
ODEs, 529 

almost, 53 
0(n) group, 861 
1-disk 

creeping, 695 
scattering, 686 
semiclassical scattering, 692 
Onsager-Machlup, 587 
open system, 14, 222, 350, 359, 360, 
389, 443, 615, 619, 656, 684, 
717,731 

operator 

evolution, 355 
Hilbert-Schmidt, 904 
just a matrix, 872 
Koopman, 841, 846 
Liouville, 341 
norm, 902 

Perron-Frobenius, 331, 363 
positive, 904 
regularization, 910 
resolvent, 306, 358, 843 
semigroup 

bounded, 358, 843 
shift, 232, 234, 871 
stepping, 871 
trace-class, 903 
orbit, 40, 66, 159 
group, 187 
inadmissible, 231 
Keplerian, 613 

periodic, 40, 235, 397, 657, 658 

relative, 192 

returning, 656 

space, 194 
order preserving map, 230 
ordering 

spatial, 228, 251 
ordinary differential equations, see ODEs 
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orientation 

preserving map, 886 

reversing map, 886 
orthogonality relation, 809, 814, 868 
Oseledec ergodic theorem, 120 

Palais slice, 206 
palpitations, chicken heart, 8 
paradise 

this side of, 440 
partial differential equations, see PDEs 
partially hyperbolic 

invariant tori, 203 

orbit, 104 
partition, 219, 235 

state space, 328 

function, 354, 363 

generating, 235 

infinite, 239, 313, 322 

Markov, 221 
passive scalar, 573 
past topological coordinate, 253 
path integral 

stochastic, see Wiener integral 
PDEs, 39, 529 
period 

relation to action, 712 

relative, 190 
periodic 

orbit, 13, 40, 104, 111, 189, 235, 
397, 657, 658 
condition, 266, 279, 594 
extraction, 266-280, 594-604 
Hamiltonian repeller, 283 
inverse iteration, 27 1 
multipoint shooting, 273 
Newton method, 272 
relative, 190 

relaxation algorithm, 595 
short, 166, 172 
point, 13, 16, 21, 22, 40, 230, 235, 
833 

admissible, 306 
count, 317 
unstable, 14 
Perron-Frobenius 
matrix, 289 

operator, 331, 363, 453 

theorem, 469, 475, 941 
Peter- Weyl theorem, 180, 184 
phase space, 38, 128, see state space 

3-disk, 896 

density, 340 

vs. state space, 50 
piecewise 



constant function, 371 

linear map, 504 

linear map, intermittency, 483 

linear map, repeller, 360 
pinball, see 3-disk 

simulator, 149 
plain English, 204, 234 
plane Couette flow 

energy, 545 

relative solutions, 192 

stability, 816 

symmetries, 158, 161, 173, 180 

unstable manifold, 245 
Plemelj-Smithies cumulants, 908 
POD, 246 

Poincare invariants, 135 
Poincare return map, 58, 59, 71 

cycle, 108 

polynomial, 67 

stability, 86 
Poincare section, 14, 58-65, 71, 72, 249 

3-disk, 144 

border, 75, see chart border 

Henon trick, 73 

hyperplane, 59, 245 
Poincare, H., 5, 9, 15 
point 

non-wandering, 41 

periodic, 13, 235 

scatterer, 738 

wandering, 4 1 
point-wise invariant, see G-fixed 
Poisson 

bracket, 186, 339, 340, 342, 817 

resummation, 23, 498 
polar coordinates, 804 
polar decomposition, 115, 121, 122 
Pollicott, M., 363, 502 
polylogarithm, 485 
polynomial 

characteristic, 308 

topological, 312 
Pomeau, Y., 74 

positive definite matrix, 1 15, 589 
positive operators, 904 
post-processing, 195, 203 
potential problems, 53 
power law 

correlation decay, 482 
pressure 

thermodynamic, 363 

topological, 363 
prime cycle, 235, 267, 323, 370 

3-disk, 248, 323, 604 
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binary, 248, 257, 306 
count, 318 

Henon map, 597, 607 

ternary, 256 
primitive cycle, see prime cycle 
principal 

directions, 115 

stretches, 115 
probabilistic zeta function, 502 
probability 

density, Gaussian, 584 

matrix, 289 
product 

Lie, 808 

matrix, 808 
profile, spatial, 39 

projection operator, 814, 867, 868, 875, 
876 

complete, orthonormal, 876 
propagator, 620 

semiclassical, 642 

short time, 643, 650 

Van Vleck, 644 
pruning, 12, 480 

block, 236 

front, 254 

3-disk, 221, 262 

golden mean, 239, 292, 309, 323, 
392, 829, 973 

individual cycles, 319 

primary interval, 232 

rules, 291 

symbolic dynamics, 235 
pseudo-cycle, 396 

quadratic map, 68 
quantization 

Bohr-Sommerfeld, 613 

semiclassical, 656 

WKB, 623, 627 
quantum 

chaology, see chaos, quantum 

chaos, 616, 662 

dynamical localization, 139 

evolution, 620 

interference, 633 

mechanics, 618 

potential, 652 

propagator, 620 

resonances, 613 

theory, old, 783 
quasiperiodicity, 41 
quotient 

group, 159 

space, 194 
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state space, 63, 160, 194 

radius of convergence, 379 
random 

walk, 576 
random matrix theory, 616 
Rayleigh-Benard flow, 45 
recoding, 236, 248, 256 
rectangle, 252 
rectification 

flows, 794 

maps, 800 
recurrence, 41, 218 

time, see return time 
reduced state space, 160, 193, 194 
reducible representation, 180, 184 
reflection, 154 
regular group action, 195 
regularization, 662, 798 

Green's function, 661 

operator, 910 
relative 

equilibrium, 187 

orbit, 192 

period, 190 

periodic orbit, 190 

solutions, 373 
relaxation algorithm, 595 
renormalization, 148 

golden mean, 556, 564 
repeated index summation, 87, 130, 862 
repeller, 14, 42, 359,615 

piecewise-linear, 360 

single fixed point, 453 
representation, 155 

character, 865 

defining, 864 

dual, 807, 863, 864 

equivalent, 866 

faithful, 866 

linear, 181 

matrix, 808, 865 

reducible, 180, 184 

regular, 866 

space, 862 

standard, 862 
representative point, 38 
residue, 135 

Greene's, 133 

stability, 133, 139 
resolvent 

kernel, 462 

operator, 306, 358, 843 
resonances 
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quantum, 613 

Ruelle-Pollicott, 363, 502 
resummation 

intermittency, 497 
return map, 16, 71, 72, 249, 270 

Rossler flow, 222 
return time, 502 

distribution, 502 
returning orbit, 656 
reversible 

dynamics, 43 
Riemann zeta function, 505, 522, 898 
Rossler 

attractor, 61 

cycles, 270, 282 

equilibria, 53, 92, 222 

flow, 46, 51,53, 61,75, 96, 118 

Lyapunov exponent, 122 

return map, 222 

web diagram, 233 
rotating wave, 188 
Roux, Henriette, 30, 88 
Ruelle 

-Pollicott resonances, 363, 502 
zeta function, see dynamical zeta 
function 

Ruelle, D., 31, 363, 502 

Runge-Kutta integration, 53 

running orbit 

Lorentz gas, 513 

Rutherford, 704 

Rydberg series, 714 

S\see SO(2) 

saddle, 91, 100, 103 

saddle point, see stationary phase 

saddle-node bifurcation, 74 

sample variance, 352 

sausage, (A^+l)-dimensional, 180 

sawtooth map, 157, 163,422 

scalar multiplication, 806 

scattering 

3-dimensional spheres, 148 

elastic, 670 

Green's function, 676 

matrix, 67 1 

phase shift, 678 

point, 738 
schmactals, see fractal 
Schrodinger 

equation, 618 

equation, time-independent, 618 

picture, 902 
Schwartzian derivative, 804 
section 



optimal, 825 

stroboscopic, 57 
section, Poincare, 14, 58, 71, 72, 144 
secular equation, 813, 814, 867 
self-retracing cycle, 710 
self-similar, 22 
semiclassical 

approximation, 634 

Green's function, 651, 654 

propagator, 642 

quantization, 656 

resonances 
3-disk, 964 

spectral determinant 
collinear helium, 715 

wave function, 640 

semiclassical zeta function, 663 
semigroup, 338, 842 

dynamical, 43, 53, 85, 101, 105, 
119, 145 

evolution, 355 

operator, 358, 843 
sensitivity to initial conditions, 8, 32, 34, 
114 

set, non-wandering, 250 
shadowing, 20, 315,408 

3-disk, 408 
shift, 234 

Bernoulli, 228, 455, 503 

finite type, 236 

full, 234, 291 

map, 551 

operator, 232, 234, 871 

sub-, 235 
short periodic orbit, 166, 172 
similarity transformation, 107 
simultaneous observables, 869 
Sinai, Ya., 31 

Sinai-Bowen-Ruelle measure, see natu- 
ral measure 
single fixed point 
repeller, 453 

spectral determinant, 454 
singular value decomposition, 121 
singular values, 115, 119, 122 
singularity 

branch cut, 488 
sink, 42, 100, 103 
Sivashinsky, G.I., 544 
skeleton of chaos, 13, 14 
skewness, 353 
slice, 194, 195, 205 

condition, 195 

linear, 195 
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Palais, 206 
Smale, S., 11, 31, 242, 259, 321, 391, 
771 

wild idea, 381, 391 
small divisor problem, 369 
S -matrix, 671 
smooth, 182 

conjugacy, 48, 106, 107, 804 

dynamical system, 545 

dynamics, 20, 21, 29, 39, 315, 770, 
941, 944, 946 
approximated, 891 

dynamics, spectral determinant, 474 

interaction, 948 

potential, 148 
SO(2), 182, 196 

SO(2), 92, 180, 184, 186, 200, 208, 209, 
211, 804, 811 

irreducible representation, 183 
SO(3), 182, 782 
solution 

symmetry, 187 
Sommerfeld 

diffraction, 737 
source, 100, 103 
Sp(d) 

symplectic group, 1 29 
space 

analytic functions, 47 1 

average, 335, 350 

configuration, 50 

defining vector, 863 

density functions, 371 

dual, 807, 863 

linear, 806 

phase, 50 

state, 50 

vector, 806 
space average, 541 
span, 807 
spatial profile, 39 
spatiotemporal chaos, 530 
spatiotemporal dynamics, 3 1 
spectral 

decomposition, 810, 814, 860, 869 
determinant, 24, 307, 379 

1 - dimensional maps, 382 

2- dimensional hyperbolic Hamil- 
tonian flow, 383 

1- dof, 664 

2- dof, 665 
entire, 380, 460 
for flows, 381 

infinite product rep., 381 



single fixed point, 454 
weighted, 388 

radius, 454, 463 
essential, 472 

stability, 521 

staircase, 621 
spectrum 

B aimer, 613 
specular reflection, 143 
Spiegel, Edward A., 780 
SRB measure, see natural measure 
St. Augustine, 328 
stability, 77-88 

billiards, 112, 145 

continuous symmetry, 192 

eigenvalue, see Floquet multiplier 

elliptic, 369 

exact, 113 

exponent, 88, 89, see Floquet expo- 
nent 
flow, 84 

Hamiltonian flow, 819 

Hamiltonian flows, 130, 817 

indifferent, 80 

linear, 77, 99, 537 

maps, 85 

marginal, 80, 193 

matrix, 78, 90, 585 

matrix, symplectic, 130 

multiplier, 89, see Floquet multiplier 

neutral, see marginal 

ordering 

cycle expansions, 411 

intermittent flows, 500 
Poincare map cycle, 108 
Poincare return map, 86 
residue, 133, 139 
spectral, 521 

structural, 250, 251, 259, 316, 521 

window, 104 
stability equations, 90 
stabilizer, 162, 163, 172 
stabilizer subgroup, see isotropy subgroup 
stable 

cycle, 111 

manifold, 16, 243-245, 249 
stadium billiard, 143, 149, 173, 506, 883, 
886, 924 

stagnation point, see equilibrium point 
staircase 

mean eigenvalue density, 717 

spectral, 621 
standard deviation, 353 
standard map, 133, 138, 480 
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standard representation space, 862 
standardized moment, 353 
standing orbit 

Lorentz gas, 513 
standing wave, 44, 192 
state, 218, 291 

set, 219 
state space, 38, 39, 128 

discretization, 363 

partition, 328 

reduced, 160, 193 

volume M, 359 

vs. phase space, 50 
stationary 

flow, 44 

phase, 337, 605, 626, 629, 631, 646, 
654, 684, 699, 728, 730, 958 

phase approximation, 626, 632, 644, 
657, 729, 738 

point, see equilibrium point 

state, 334 
stationary Lyapunov basis, 120 
statistical mechanics, 24 
steady state, see equilibrium point 
stepping operator, 87 1 
Sterling formula, 632 
stochastic 

dynamics, 8, 337 

matrix, 289 

path integral, see Wiener integral 
Stokes theorem, 136, 638 
stosszahlansatz, 25, 524 
strange attractor, 41, 47, 1 14 

Rossler flow, 61 
stretch & fold, 68, 226 
stretches, 115 

stroboscopic sections method, 57 
strongly connected graph, 29 1 
structural stability, 27, 250, 251, 259, 
316,521,524,532, 743,785 

Henon map, 26 1 
structure constant, 185, 808 
subgroup 

isotropy, 161 
subshift, 235 

finite type, 236, 259, 290-292, 294 
S£/(n) group, 861 
super-exponential 

convergence, 594 
superstable 

cycle, 111 

fixed point, 594 
surjective, 68 

survival probability, 15, see escape rate 



S VD, see singular value decomposition 
symbol 

sequence 

inadmissible, 235 

square, 252 
symbol square, 25 1 

symbolic dynamics, 12, 218-236, 827- 
835 

3-disk, 34, 221,262 
at a bifurcation, 148 
binary 

collinear helium, 708 
coding, 236 

transition graph, 409 
complete, 226, 237, 250, 291 
covering, 234 
grammar, 236 
Henon-Heiles, 173 
pruned, 235 

recoding, 236, 248, 256 

unimodal, 227 
symmetric group, 861 
symmetry 

D 3 , 257, 432 

3-disk, 153, 157-159, 164, 169, 257, 

432, 436 
continuous, 178-202 
cyclic, 304 

discrete, 152-170, 248, 255 

dynamical system, 155, 178 

Henon map, 885 

of a solution, 162, 163, 171, 172 

solution, 187 

under G p , 162 
symmetry -reduced space, 194 
symplectic 

2-form, 128 

dimension, 137 

group Sp(d), 129 

group Sp(2D), 818 

Henon map, 132 

integrator, 845 

invariance, 126, 137, 817 

map, 131 

transformation, 129, 130, 248, 340 
system 

open, 359 
syzygy, 170, 174, 200, 201 

tangent 

bundle, 43, 78 

field, 182 

field, group, 186 

linear equations, 81 

map, see stability matrix 
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space, 78, 182 
tangent linear propagator, 90 
tangent map, 90 
Tauberian theorem, 505 
teaching 

combinatorics, 229 
template, 222, 271 
tensor 

invariant, 185 
tent map, 226, 804 
ternary 

prime cycles, 256 
tessellation 

smooth dynamics, 891 
thermodynamic al 

pressure, 363 
3-body problem, 615, 704, 767, 784, 796 
3-dimensional sphere 

scattering, 148 
3-disk 

boundary orbits, 428 

convergence, 465, 891 

cycle 

analytically, 282 
count, 171,434, 840 
expansion, 418 

escape rate, 364, 408, 418, 440 

fractal dimension, 919 

geometry, 144 

hyperbolicity, 369 

phase space, 896 

pinball, 6, 145, 148 

point scatterer, 738 

prime cycles, 18, 248, 267, 323, 604 

pruning front, 22 1 , 262 

semiclassical resonances, 964 

shadowing, 408 

simulator, 149 

state space, 14, 72, 919 

symbolic dynamics, 12, 34, 221, 262 

symmetry, 153, 157-159, 164, 169, 
257, 432, 436 

time 

arrow of, 24 
as parametrization, 49 
average, 336, 348 
ceiling function, see ceiling func- 
tion 

delay, Wigner, 678 
ordered integration, 84, 88 
turnover, 92, 94,812 

time average, 541 

time-f forward map, 50, 73, 187 

topological 



conjugacy, 230 

dynamics, 218, 233, 235, 236, 290 
entropy, 8, 302, 313 
future coordinate, 230 
index, 641 

topological index, 659, 783 
invariant, 100 
Markov chain, 234 
parameter, 233 
polynomial, 312 
trace formula, 306 
transitivity, 289 
zeta function, 312, 313 
torus, 41 

totient function, 553 
t p cycle weight, 382 
trace 

formula 
classical, 23 
flows, 372 
Gutzwiller, 659 
maps, 370, 454 
topological, 306, 313 
weight, 400 
local, 304 
trace-class operator, 689, 903 

determinant, 905 
trajectory, 39, 82 
discrete, 66 
transfer 

map, 138 

matrix, 138, 331, 361 

operator, 371, 391 

spectrum, 380 
transformation 

canonical, 248 

coordinate, 804 

symplectic, 248 
transient, 41, 220, 297 
transition 

graph, 288-298 
infinite, 311 

matrix, 288, 302, 304 

matrix, ,/V-disk, 291 
transitive, 42 

transplacement gradient, 90 
transversality 

border of, 75, see chart border 

condition, 59 

Thorn's, 785 
transverse stability, 650 
traveling wave, 188, see relative equilib- 
rium 

Trotter product formula, 902 
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truncation 

Fourier, 531 

Galerkin, 533 
turbulence, 9, 10, 544 

problem of, 529 
turnback point, 247 
turnover time, 92, 94, 812 

Ulam map, 226, 282, 283, 804 

skew, 331,450 

tent, 344, 393 
ultraviolet divergence, 661 
unimodal 

kneading value, 239 

map, 226 

map, symbolic dynamics, 227 
well ordered symbols, 239 
unstable 

cycle, 111 

manifold, 16, 243-245, 249 

periodic orbit), see periodic orbit 

periodic point, 14 
unsung 

heroes, xiv, xix 
UPO (Unstable Periodic Orbit), see pe- 
riodic orbit 

van Kampen, N. G., 587 
Van Vleck 

propagator, 644 
variance, 353 
variational principle, 586 
vector 

basis, 807 

field, 43 

field, singularities, 794 
invariant, 864 
observable, 364 
space, 806 
defining, 863 
dual, 863 
velocity gradients matrix, 78, 90 
vertex, 29 1 

visitation frequency, 335 
visitation sequence, see itinerary 
volume preservation, 147 
von Neumann 

ergodicity, 846 



wave, standing, 44, 192 
web diagram 

Rossler flow, 233 
weight 

multiplicative, 29 
well ordered symbols 

unimodal, 239 
Wentzel-Kramers-Brillouin, see WKB 
Weyl 

Peter- Weyl theorem, 180, 184 

rule, 660 
Weyl, H., 867 
white noise, 584 
Wiener integral, 585 
Wigner delay time, 678 
winding number, 135, 550, 551 
WKB, 623, 634 

connection formulas, 630 

quantization, 623, 627 

wave function, 641 

Yang, C.N., 335 
Young, L.-S., 74 

zero eigenvalue, 631, 646 
zero, false, 387 
zeta function 

Artin-Mazur, 312 

dynamical, 19, 383 

probabilistic , 502 

Ruelle, see dynamical 

topological, 312, 313 



Waleffe,R, 781 
walk, see itinerary 
wandering point, 41 
wave function 

semiclassical, 640 

WKB, 641 
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Appendix A 

A brief history of chaos 



Laws of attribution 

1. Arnol'd's Law: everything that is discovered is 
named after someone else (including Arnol'd's 
law) 

2. Berry's Law: sometimes, the sequence of an- 
tecedents seems endless. So, nothing is discovered 
for the first time. 

3. Whiteheads's Law: Everything of importance has 
been said before by someone who did not discover 
it. 

— Sir Michael V. Berry 

Writing a history of anything is a reckless undertaking, especially a history of 
something that has preoccupied at one time or other any serious thinker from 
ancient Sumer to today's Hong Kong. A mathematician, to take an example, might 
see it this way: "History of dynamical systems." Nevertheless, here comes yet 
another very imperfect attempt. 



A.l Chaos is born 

I'll maybe discuss more about its history when I learn 
more about it. 

— Maciej Zworski 

(R. Mainieri and P. Cvitanovic) 

Trying to predict the motion of the Moon has preoccupied astronomers since 
antiquity. Accurate understanding of its motion was important for deter- 
mining the longitude of ships while traversing open seas. 
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Kepler's Rudolphine tables had been a great improvement over previous ta- 
bles, and Kepler was justly proud of his achievements. He wrote in the introduc- 
tion to the announcement of Kepler's third law, Harmonice Mundi (Linz, 1619) in 
a style that would not fly with the contemporary Physical Review Letters editors: 

What I prophesied two-and-twenty years ago, as soon as I discovered 
the five solids among the heavenly orbits-what I firmly believed long before 
I had seen Ptolemy's Harmonics-what I had promised my friends in the title 
of this book, which I named before I was sure of my discovery-what sixteen 
years ago, I urged as the thing to be sought-that for which I joined Tycho 
Brahe, for which I settled in Prague, for which I have devoted the best part 
of my life to astronomical contemplations, at length I have brought to light, 
and recognized its truth beyond my most sanguine expectations. It is not 
eighteen months since I got the first glimpse of light, three months since 
the dawn, very few days since the unveiled sun, most admirable to gaze 
upon, burst upon me. Nothing holds me; I will indulge my sacred fury; I 
will triumph over mankind by the honest confession that I have stolen the 
golden vases of the Egyptians to build up a tabernacle for my God far away 
from the confines of Egypt. If you forgive me, I rejoice; if you are angry, I 
can bear it; the die is cast, the book is written, to be read either now or in 
posterity, I care not which; it may well wait a century for a reader, as God 
has waited six thousand years for an observer. 

Then came Newton. Classical mechanics has not stood still since Newton. 
The formalism that we use today was developed by Euler and Lagrange. By the 
end of the 1800's the three problems that would lead to the notion of chaotic 
dynamics were already known: the three-body problem, the ergodic hypothesis, 
and nonlinear oscillators. 



A.l.l Three-body problem 

Bernoulli used Newton's work on mechanics to derive the elliptic orbits of Kepler 
and set an example of how equations of motion could be solved by integrating. 
But the motion of the Moon is not well approximated by an ellipse with the Earth 
at a focus; at least the effects of the Sun have to be taken into account if one wants 
to reproduce the data the classical Greeks already possessed. To do that one has 
to consider the motion of three bodies: the Moon, the Earth, and the Sun. When 
the planets are replaced by point particles of arbitrary masses, the problem to be 
solved is known as the three -body problem. The three-body problem was also 
a model to another concern in astronomy. In the Newtonian model of the solar 
system it is possible for one of the planets to go from an elliptic orbit around the 
Sun to an orbit that escaped its dominion or that plunged right into it. Knowing 
if any of the planets would do so became the problem of the stability of the solar 
system. A planet would not meet this terrible end if solar system consisted of 
two celestial bodies, but whether such fate could befall in the three-body case 
remained unclear. 

After many failed attempts to solve the three-body problem, natural philoso- 
phers started to suspect that it was impossible to integrate. The usual technique for 
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integrating problems was to find the conserved quantities, quantities that do not 
change with time and allow one to relate the momenta and positions at different 
times. The first sign on the impossibility of integrating the three-body problem 
came from a result of Burns that showed that there were no conserved quantities 
that were polynomial in the momenta and positions. Burns' result did not pre- 
clude the possibility of more complicated conserved quantities. This problem was 
settled by Poincare and Sundman in two very different ways [1,2]. 

In an attempt to promote the journal Acta Mathematica, Mittag-Leffler got 
the permission of the King Oscar II of Sweden and Norway to establish a mathe- 
matical competition. Several questions were posed (although the king would have 
preferred only one), and the prize of 2500 kroner would go to the best submission. 
One of the questions was formulated by Weierstrass: 

Given a system of arbitrary mass points that attract each other according 
to Newton's laws, under the assumption that no two points ever collide, try 
to find a representation of the coordinates of each point as a series in a 
variable that is some known function of time and for all of whose values the 
series converges uniformly. 

This problem, whose solution would considerably extend our under- 
standing of the solar system, . . . 

Poincare's submission won the prize. He showed that conserved quantities that 
were analytic in the momenta and positions could not exist. To show that he 
introduced methods that were very geometrical in spirit: the importance of state 
space flow, the role of periodic orbits and their cross sections, the homoclinic 
points. 

The interesting thing about Poincare's work was that it did not solve the prob- 
lem posed. He did not find a function that would give the coordinates as a function 
of time for all times. He did not show that it was impossible either, but rather that 
it could not be done with the Bernoulli technique of finding a conserved quantity 
and trying to integrate. Integration would seem unlikely from Poincare's prize- 
winning memoir, but it was accomplished by the Finnish-born Swedish mathe- 
matician Sundman. Sundman showed that to integrate the three-body problem 
one had to confront the two-body collisions. He did that by making them go away 
through a trick known as regularization of the collision manifold. The trick is not 
to expand the coordinates as a function of time t, but rather as a function of ^fi. 
To solve the problem for all times he used a conformal map into a strip. This 
allowed Sundman to obtain a series expansion for the coordinates valid for all 
times, solving the problem that was proposed by Weirstrass in the King Oscar EE's 
competition. 

The Sundman's series are not used today to compute the trajectories of any 
three-body system. That is more simply accomplished by numerical methods or 
through series that, although divergent, produce better numerical results. The con- 
formal map and the collision regularization mean that the series are effectively in 
the variable 1 - e~ Quite rapidly this gets exponentially close to one, the ra- 
dius of convergence of the series. Many terms, more terms than any one has ever 
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wanted to compute, are needed to achieve numerical convergence. Though Sund- 
man's work deserves better credit than it gets, it did not live up to Weirstrass's 
expectations, and the series solution did not "considerably extend our understand- 
ing of the solar system.' The work that followed from Poincare did. 

A.1.2 Ergodic hypothesis 

The second problem that played a key role in development of chaotic dynamics 
was the ergodic hypothesis of Boltzmann. Maxwell and Boltzmann had combined 
the mechanics of Newton with notions of probability in order to create statistical 
mechanics, deriving thermodynamics from the equations of mechanics. To eval- 
uate the heat capacity of even a simple system, Boltzmann had to make a great 
simplifying assumption of ergodicity: that the dynamical system would visit every 
part of the phase space allowed by conservation laws equally often. This hypoth- 
esis was extended to other averages used in statistical mechanics and was called 
the ergodic hypothesis. It was reformulated by Poincare to say that a trajectory 
comes as close as desired to any phase space point. 

Proving the ergodic hypothesis turned out to be very difficult. By the end of 
twentieth century it has only been shown true for a few systems and wrong for 
quite a few others. Early on, as a mathematical necessity, the proof of the hypoth- 
esis was broken down into two parts. First one would show that the mechanical 
system was ergodic (it would go near any point) and then one would show that it 
would go near each point equally often and regularly so that the computed aver- 
ages made mathematical sense. Koopman took the first step in proving the ergodic 
hypothesis when he realized that it was possible to reformulate it using the recently 
developed methods of Hilbert spaces [I]. This was an important step that showed 
that it was possible to take a finite-dimensional nonlinear problem and reformu- 
late it as a infinite-dimensional linear problem. This does not make the problem 
easier, but it does allow one to use a different set of mathematical tools on the 
problem. Shortly after Koopman started lecturing on his method, von Neumann 
proved a version of the ergodic hypothesis, giving it the status of a theorem |2]. 
He proved that if the mechanical system was ergodic, then the computed averages 
would make sense. Soon afterwards Birkhoff published a much stronger version 
of the theorem. 



A.1.3 Nonlinear oscillators 

The third problem that was very influential in the development of the theory of 
chaotic dynamical systems was the work on the nonlinear oscillators. The prob- 
lem is to construct mechanical models that would aid our understanding of phys- 
ical systems. Lord Rayleigh came to the problem through his interest in under- 
standing how musical instruments generate sound. In the first approximation one 
can construct a model of a musical instrument as a linear oscillator. But real in- 
struments do not produce a simple tone forever as the linear oscillator does, so 
Lord Rayleigh modified this simple model by adding friction and more realistic 
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models for the spring. By a clever use of negative friction he created two basic 
models for the musical instruments. These models have more than a pure tone 
and decay with time when not stroked. In his book The Theory of Sound Lord 
Rayleigh introduced a series of methods that would prove quite general, such as 
the notion of a limit cycle, a periodic motion a system goes to regardless of the 
initial conditions. 



A.2 Chaos grows up 

(R. Mainieri) 

The theorems of von Neumann and Birkhoff on the ergodic hypothesis were 
published in 1912 and 1913. This line of enquiry developed in two directions. One 
direction took an abstract approach and considered dynamical systems as trans- 
formations of measurable spaces into themselves. Could we classify these trans- 
formations in a meaningful way? This lead Kolmogorov to the introduction of the 
concept of entropy for dynamical systems. With entropy as a dynamical invariant 
it became possible to classify a set of abstract dynamical systems known as the 
Bernoulli systems. The other line that developed from the ergodic hypothesis was 
in trying to find mechanical systems that are ergodic. An ergodic system could 
not have stable orbits, as these would break ergodicity. So in 1898 Hadamard 
published a paper with a playful title of billiards where he showed that 
the motion of balls on surfaces of constant negative curvature is everywhere un- 
stable. This dynamical system was to prove very useful and it was taken up by 
Birkhoff. Morse in 1923 showed that it was possible to enumerate the orbits of 
a ball on a surface of constant negative curvature. He did this by introducing a 
symbolic code to each orbit and showed that the number of possible codes grew 
exponentially with the length of the code. With contributions by Artin, Hedlund, 
and H. Hopf it was eventually proven that the motion of a ball on a surface of con- 
stant negative curvature was ergodic. The importance of this result escaped most 
physicists, one exception being Krylov, who understood that a physical billiard 
was a dynamical system on a surface of negative curvature, but with the curvature 
concentrated along the lines of collision. Sinai, who was the first to show that a 
physical billiard can be ergodic, knew Krylov's work well. 

The work of Lord Rayleigh also received vigorous development. It prompted 
many experiments and some theoretical development by van der Pol, Duffing, 
and Hayashi. They found other systems in which the nonlinear oscillator played 
a role and classified the possible motions of these systems. This concreteness 
of experiments, and the possibility of analysis was too much of temptation for 
Mary Lucy Cartwright and J.E. Littlewood [5], who set out to prove that many 
of the structures conjectured by the experimentalists and theoretical physicists 
did indeed follow from the equations of motion. Birkhoff had found a 'remark- 
able curve' in a two dimensional map; it appeared to be non-differentiable and it 
would be nice to see if a smooth flow could generate such a curve. The work of 
Cartwright and Littlewood lead to the work of Levinson, which in turn provided 
the basis for the horseshoe construction of S. Smale. 
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In Russia, Lyapunov paralleled the methods of Poincare and initiated the 
strong Russian dynamical systems school [6]. Andronov earned on with the study 
of nonlinear oscillators and in 1937 introduced together with Pontryagin the no- 
tion of coarse systems. They were formalizing the understanding garnered from 
the study of nonlinear oscillators, the understanding that many of the details on 
how these oscillators work do not affect the overall picture of the state space: there 
will still be limit cycles if one changes the dissipation or spring force function by 
a little bit. And changing the system a little bit has the great advantage of elim- 
inating exceptional cases in the mathematical analysis. Coarse systems were the 
concept that caught Smale's attention and enticed him to study dynamical systems. 



A.3 Chaos with us 



(R. Mainieri) 

In the fall of 1961 Steven Smale was invited to Kiev where he met Arnol'd, 
Anosov, Sinai, and Novikov. He lectured there, and spent a lot of time with 
Anosov. He suggested a series of conjectures, most of which Anosov proved 
within a year. It was Anosov who showed that there are dynamical systems for 
which all points (as opposed to a non-wandering set) admit the hyperbolic struc- 
ture, and it was in honor of this result that Smale named these systems Axiom-A. 
In Kiev Smale found a receptive audience that had been thinking about these prob- 
lems. Smale's result catalyzed their thoughts and initiated a chain of developments 
that persisted into the 1970's. 

Smale collected his results and their development in the 1967 review article 
on dynamical systems, entitled "Differentiable dynamical systems" [7]. There are 
many great ideas in this paper: the global foliation of invariant sets of the map into 
disjoint stable and unstable parts; the existence of a horseshoe and enumeration 
and ordering of all its orbits; the use of zeta functions to study dynamical systems. 
The emphasis of the paper is on the global properties of the dynamical system, on 
how to understand the topology of the orbits. Smale's account takes you from a 
local differential equation (in the form of vector fields) to the global topological 
description in terms of horseshoes. 

The path traversed from ergodicity to entropy is a little more confusing. The 
general character of entropy was understood by Weiner, who seemed to have spo- 
ken to Shannon. In 1948 Shannon published his results on information theory, 
where he discusses the entropy of the shift transformation. Kolmogorov went 
far beyond and suggested a definition of the metric entropy of an area preserving 
transformation in order to classify Bernoulli shifts. The suggestion was taken by 
his student Sinai and the results published in 1959. In 1960 Rohlin connected 
these results to measure-theoretical notions of entropy. The next step was pub- 
lished in 1965 by Adler and Palis, and also Adler, Konheim, Mc Andrew; these 
papers showed that one could define the notion of topological entropy and use it 
as an invariant to classify continuous maps. In 1967 Anosov and Sinai applied 
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the notion of entropy to the study of dynamical systems. It was in the context 
of studying the entropy associated to a dynamical system that Sinai introduced 
Markov partitions in 1968. 

Markov partitions allow one to relate dynamical systems and statistical me- 
chanics; this has been a very fruitful relationship. It adds measure notions to the 
topological framework laid down in Smale's paper. Markov partitions divide the 
state space of the dynamical system into nice little boxes that map into each other. 
Each box is labeled by a code and the dynamics on the state space maps the codes 
around, inducing a symbolic dynamics. From the number of boxes needed to 
cover all the space, Sinai was able to define the notion of entropy of a dynamical 
system. In 1970 Bowen came up independently with the same ideas, although 
there was presumably some flow of information back and forth before these pa- 
pers got published. Bowen also introduced the important concept of shadowing of 
chaotic orbits. We do not know whether at this point the relations with statistical 
mechanics were clear to everyone. They became explicit in the work of Ruelle. 
Ruelle understood that the topology of the orbits could be specified by a symbolic 
code, and that one could associate an 'energy' to each orbit. The energies could 
be formally combined in a 'partition function' to generate the invariant measure 
of the system. 

After Smale, Sinai, Bowen, and Ruelle had laid the foundations of the statisti- 
cal mechanics approach to chaotic systems, research turned to studying particular 
cases. The simplest case to consider is 1 -dimensional maps. The topology of the 
orbits for parabola-like maps was worked out in 1973 by Metropolis, Stein, and 
Stein [I ]. The more general 1 -dimensional case was worked out in 1976 by Milnor 
and Thurston in a widely circulated preprint, whose extended version eventually 
got published in 1988 [9]. 

A lecture of Smale and the results of Metropolis, Stein, and Stein inspired 
Feigenbaum to study simple maps. This lead him to the discovery of the universal- 
ity in quadratic maps and the application of ideas from field-theory to dynamical 
systems. Feigenbaum's work was the culmination in the study of 1 -dimensional 
systems; a complete analysis of a nontrivial transition to chaos. Feigenbaum intro- 
duced many new ideas into the field: the use of the renormalization group which 
lead him to introduce functional equations in the study of dynamical systems, the 
scaling function which completed the link between dynamical systems and statis- 
tical mechanics, and the presentation functions which describe the dynamics of 
scaling functions. 

The work in more than one dimension progressed very slowly and is still far 
from completed. The first result in trying to understand the topology of the or- 
bits in two dimensions (the equivalent of Metropolis, Stein, and Stein, or Milnor 
and Thurston's work) was obtained by Thurston. Around 1975 Thurston was giv- 
ing lectures "On the geometry and dynamics of diffeomorphisms of surfaces." 
Thurston's techniques exposed in that lecture have not been applied in physics, 
but much of the classification that Thurston developed can be obtained from the 
notion of a 'pruning front' formulated independently by Cvitanovic. 
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Once one develops an understanding of the topology of the orbits of a dynam- 
ical system, one needs to be able to compute its properties. Ruelle had already 
generalized the zeta function introduced by Artin and Mazur [10], so that it could 
be used to compute the average value of observables. The difficulty with Ruelle's 
zeta function is that it does not converge very well. Starting out from Smale's 
observation that a chaotic dynamical system is dense with a set of periodic orbits, 
Cvitanovic used these orbits as a skeleton on which to evaluate the averages of 
observables, and organized such calculations in terms of rapidly converging cy- 
cle expansions. This convergence is attained by using the shorter orbits used as a 
basis for shadowing the longer orbits. 

This account is far from complete, but we hope that it will help get a sense of 
perspective on the field. It is not a fad and it will not die anytime soon. 



A.4 Periodic orbit theory 

Pure mathematics is a branch of applied mathematics. 

— Joe Keller, after being asked to define applied 
mathematics 

(P. Cvitanovic) 

The history of periodic orbit theory is rich and curious; recent advances are equally 
inspired by more than a century of developments in three separate subjects: 1. 
classical chaotic dynamics, initiated by Poincare and put on its modern footing 
by Smale [7], Ruelle [11], and many others, 2. quantum theory initiated by Bohr, 
with the modern 'chaotic' formulation by Gutzwiller [12, 13], and 3. analytic 
number theory initiated by Riemann and formulated as a spectral problem by Sel- 
berg [14, 15]. Following different lines of reasoning and driven by different mo- 
tivations, the three separate roads all arrive at trace formulas , zeta functions and 
spectral determinants. 

The fact that these fields are all related is far from obvious, and even today 
the practitioners tend to cite papers only from their sub-speciality. In Gutzwiller's 
words [1 ], "The classical periodic orbits are a crucial stepping stone in the un- 
derstanding of quantum mechanics, in particular when then classical system is 
chaotic. This situation is very satisfying when one thinks of Poincare who empha- 
sized the importance of periodic orbits in classical mechanics, but could not have 
had any idea of what they could mean for quantum mechanics. The set of energy 
levels and the set of periodic orbits are complementary to each other since they are 
essentially related through a Fourier transform. Such a relation had been found 
earlier by the mathematicians in the study of the Laplacian operator on Rieman- 
nian surfaces with constant negative curvature. This led to Selberg's trace formula 
in 1956 which has exactly the same form, but happens to be exact." A posteriori, 
one can say that zeta functions arise in both classical and quantum mechanics be- 
cause the dynamical evolution can be described by the action of linear evolution 
(or transfer) operators on infinite-dimensional vector spaces. The spectra of these 
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operators are given by the zeros of appropriate determinants. One way to evalu- 
ate determinants is to expand them in terms of traces, logdet(X) = tr(logX). In 
this way the spectrum of an evolution operator becomes related to its traces, i.e. 
periodic orbits. A deeper way of restating this is to observe that the trace formu- 
las perform the same role in all of the above problems; they relate the spectrum 
of lengths (local dynamics) to the spectrum of eigenvalues (global eigenstates), 
and for nonlinear geometries they play a role analogous to the one that Fourier 
transform plays for the circle. 

Distant history is easily sanitized and mythologized. As we approach the 
present, our vision is inevitably more myopic; for very different accounts cov- 
ering the same recent history, see V. Baladi [1 6] (a mathematician's perspective), 
and M. V. Berry [17] (a quantum chaologist's perspective). We are grateful for any 
comments from the reader that would help make what follows fair and balanced. 

M. Gutzwiller was the first to demonstrate that chaotic dynamics is built upon 
unstable periodic orbits in his 1960's work on the quantization of classically 
chaotic quantum systems, where the 'Gutzwiller trace formula' gives the semi- 
classical quantum spectrum as a sum over classical periodic orbits [18, 19, 20, 
12]. Equally important was D. Ruelle's 1970's work on hyperbolic systems, 
where ergodic averages associated with natural invariant measures are expressed 
as weighted sums on the infinite set of unstable periodic orbits embedded in the 
underlying chaotic set [21, 22]. This idea can be traced back to the following 
sources: 1. the foundational 1967 review [ ], where S. Smale proposed as "a wild 
idea in this direction" a (technically incorrect, but prescient) zeta function over 
periodic orbits, 2. the 1965 Artin-Mazur zeta function for counting periodic or- 
bits [10], and 3. the 1956 Selberg number-theoretic zeta functions for Riemann 
surfaces of constant curvature [14]. That one could compute using these infinite 
sets was not clear at all. Ruelle [11] never attempted explicit computations, and 
Gutzwiller only attempted to implement summations over anisotropic Kepler pe- 
riodic orbits by treating them as Ising model configurations [23] (In retrospect, 
Gutzwiller was lucky; it turns out that the more periodic orbits one includes, the 
worse convergence one gets [24]). 

For a long time the convergence of such sums bedeviled the practitioners, un- 
til the mathematically rigorous spectral determinants for hyperbolic deterministic 
flows, and the closely related semiclassicaly exact Gutzwiller Zeta functions were 
recast in terms of highly convergent cycle expansions. Under these circumstances, 
a relatively few short periodic orbits lead to highly accurate long time averages of 
quantities measured in chaotic dynamics and of spectra for quantum systems. The 
idea, in a nutshell, is that long orbits are shadowed by shorter orbits, and the nth 
term in a cycle expansion is the difference between the shorter cycles estimate of 
the period n-cycles' contribution and the exact ^-cycles sum. For unstable, hy- 
perbolic flows, this difference falls off exponentially or super-exponentially |50]. 
Contrary to what some literature says, cycle expansions are no more 'clever re- 
summations' than the Plemelj -Smithies cumulant evaluation of a determinant is a 
'resummation', and their theory is considerably more reassuring than what prac- 
titioners of quantum chaos fear: there is no 'abscissa of absolute convergence', 
there is no 'entropy barrier', and the exponential proliferation of cycles is not the 
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problem. 

Cvitanovic derived 'cycle expansions' in 1986-87, in an effort to prove that 
the mode-locking dimension for critical circle maps discovered by Jensen, Bak 
and Bohr [25] is universal; the same kind of periodic orbits are involved in the 
Henon map, but now in renormalization 'time'. The symbolic dynamics of the 
Henon attractor (the pruning front conjecture [26]) is coded by transition graphs, 
topological entropy is given by roots of their determinants. This observation led to 
the study of convergence of spectral determinants for both discrete-time (iterated 
maps) and continuous-time deterministic flows (both ODEs and PDEs). Cycle 
expansions thus arose not from temporal dynamics, but from studies of scalings in 
period-doubling and cycle-map renormalizations |27, 28, 29]. This work was done 
in collaboration with R. Artuso (PhD 1987-1989), G. Gunaratne, and E. Aurell 
(PhD 1984-1989), and it was written under the watchful eye of parrot Gaspar in 
Fundacao de Faca, Porto Seguro, as two long Recycling of strange sets papers |30, 
27]: 7. Cycle expansions and II. Applications. The main lesson was that one should 
never split theory and applications into papers numbered I and II; part II, which 
covers many interesting results, has barely been glanced at by anyone. 
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The first published paper on these developments was Auerbach et al. |31] 
Exploring chaotic motion through periodic orbits (submitted March 1987). Here 
only a 'level sum' approximation (20.41), 
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to the trace formula is presented as an nth order estimate of the leading Perron- 
Frobenius eigenvalue S(„), and applied to the Henon attractor (Eq. (4) of the above 
paper). (The exact weight of an unstable prime periodic orbit p (for level sum 
(18.7)) had been conjectured by Kadanoff and Tang [ ] in 1984.) Even as it 
was written, the heuristics of this paper was rendered obsolete by the exact cycle 
expansions, and yet, mysteriously, this might be one of the most cited periodic 
orbits papers. 

The first attempt to make cycle expansions accessible to every person was 
condensed into Phys. Rev. Letter, Invariant measurement of strange sets in terms 
of cycles (submitted March 1988) [33]. However, the two long papers by Artuso 
et al. [30, 27] are a better read. 

Several applications of the new methodology are worth mentioning. One was 
the accurate calculation of the leading dozen eigenvalues of the period-doubling 
operator [27, 28, 34]. Another breakthrough was the cycle expansion of determin- 
istic transport coefficients [35, 36, 37], such as diffusion constants without any 
probabilistic assumptions. The classical Boltzmann equation for the evolution of 
1 -particle density is based on Stosszahlansatz, the assumption that velocities of 
colliding particles are not correlated. In periodic orbit theory all correlations are 
included in cycle averaging formulas, such as the cycle expansion for a particle 
diffusing chaotically across a spatially-periodic array. 
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Physicists tend to obsess about matters weightier than iterating maps, so Cvi- 
tanovic and Eckhardt showed that cycle expansions reproduce quantum resonances 
of Eckhardt's 3-disk scatterer [38] to rather impressive accuracy [39] (submitted 
February 1989). Gaspard and Rice published a lovely triptic of articles (submit- 
ted September 1988) about the same 3-disk system (classical, semiclassical and 
quantum scattering) [40, 41, 42]. In 1992 P. E. Rosenqvist [43, 44], in his PhD 
thesis, combined the magic of spectral determinants with their symmetry factor- 
izations [23, 45] to take cycle expansions to ridiculous accuracy; for example, 
periodic orbits up to 10 bounces determine the classical escape rate for a 3-disk 
pinball to be 

y = 0.4103384077693464893384613078192 .... 

Try to extract this from a direct numerical simulation, or a log-log plot of level 
sums (A.l)! Prior to cycle expansions, the best accuracy that Gaspard and Rice 
achieved by applying Markov approximations to the spectral determinant [40] was 
1 significant digit, y - 0.45. 

A 3-disk billiard is exceptionally nice, uniformly hyperbolic repeller. More 
often than not, good symbolic dynamics for a given flow is either not available, 
or its grammar is not finite, or the convergence of cycle expansions is affected 
by non-hyperbolic regions of state space. In those cases truncations such as the 
stability cutoff of Dahlqvist and Russberg [46, 47] and Dettmann and Morriss [48] 
might be helpful. The idea is to truncate the cycle expansion by including only the 
shadowing combinations of pseudo-cycles {pi , p2 ■ ■ ■ , Pk) such that lA^, • • • A Pk \ < 
A max , with the cutoff A max equal to or smaller than the most unstable Ap in the 
data set. 

It is pedagogically easier to motivate sums over periodic orbits by starting with 
discrete time dynamical systems, but most flows of physical interest are continu- 
ous in time. The weighted averages of periodic orbits for continuous time flows 
were introduced by Bowen, who treated them as Poincare section suspensions 
weighted by the 'time ceiling' function, and were incorporated into dynamical 
zeta functions by Parry and Pollicott [49] and Ruelle [50]. For people steeped 
in quantum mechanics it all looked very unfamiliar, so in 1991 Cvitanovic and 
Eckhardt reformulated spectral determinants for continuous time flows along the 
lines of Gutzwiller's derivation of the semi-classical trace formula |51]. As a con- 
sequence, quantum mechanicians [17, 52, 53] tend to cite this paper as the first 
paper on cycle expansions. 

2D billiards are only toys, but quantization of helium is surely not just a game. 
By implementing cycle expansions in 1991, the group of Dieter Wintgen obtained 
a surprisingly accurate helium spectrum [54, 55] from a small set of its shortest 
cycles. This happened 50 years after old quantum theory had failed to do so and 
20 years after Gutzwiller first introduced his quantization of chaotic systems [12]. 

The Copenhagen group gave many conference and seminar talks about cycle 
expansions. In December 1986, Cvitanovic presented results on the periodic-orbit 
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description of the topology of Lozi and Henon attractors and the periodic-orbit 
computation of associated dynamical averages, at the meeting on "Chaos and 
Related Nonlinear Phenomena: Where do we go from here?" This meeting was 
organized by Moshe Shapiro and Itamar Procaccia and held in the kibutz Kiryat 
Anavim. A great meeting, and Celso Grebogi was in the audience. After the 
"Where do we go from here?" meeting, the Maryland group wrote a series of 
papers on unstable periodic orbits, or 'UPOs'. In the first paper [56], Unstable 
periodic orbits and the dimensions of multifractal chaotic attractor (submitted 
September 1987), the focus was on fractal dimensions of chaotic attractors, as 
was the fashion in the late 1980's. They prove that the natural measure of 
a mixing hyperbolic attractor is given by the limit of a sum over the unstable 
periodic points X; of long period n, embedded in a chaotic attractor. Each periodic 
point is weighted by the inverse of the product of its periodic orbit's expanding 
Floquet multipliers Ay, Eq. (14) in their paper: 

po(Ats)=lim X xjeM s . (A.2) 

xjeFixf" 1 
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remark 5.1 



This is an approximate level sum formula for natural measure, a special case of 
(A.l), with leading Perron-Frobenius eigenvalue s = 0 (no escape), and f3 = 0 
(observable =1). The first paper does cite Auerbach et al. [31], in which the 
same approximate level sum seems to have been published for the first time. Ever 
since then, various cyclist teams cite exclusively their own papers and some of the 
mathematicians of the 1970's. 

So you have now written a paper that uses periodic orbits. What is one to cite? 
Work by Sinai-Bowen-Ruelle is smarter and more profound than the vast majority 
of 'chaos' publications from the 1980s on. If you are not actually computing any- 
thing using periodic orbits and are reluctant to refer to recent contributions, you 
can safely credit Ruelle [22, 1 1] for deriving the dynamical (or Ruelle) zeta func- 
tion, and Gutzwiller for formulating semiclassical quantization as a Zeta function 
over unstable periodic orbits [12, 13]. There are no cycle expansions in these pa- 
pers or in Bowen's work (see, for example, the description in Scholarpedia.org). 
If you have computed something using sums weighted by periodic-orbit weights, 
cite the first paper that introduced them, as well as a useful up-to-date reference, 
which in this case is ChaosBook.org. Do not faint because this webbook is 
available on (gasp!) the internet - it's third millennium, and having a continuously 
updated, hyperlinked and reliable reference has its virtues. 

Depending on the context, one should also cite 1) Zoldi and Greenside |57] for 
being the second to determine unstable periodic orbits (127 of them) for Kuramoto- 
Sivashinsky, on a domain larger than what was studied in ref. [79], 2) Lopez et 
al. [58] for being the first to determine relative periodic orbits in a spatio-temporal 
PDE (complex Landau-Ginzburg), and 3) Kazantsev [59] for being the first to de- 
termine periodic orbits in a weather model, and for his variational method for find- 
ing periodic orbits. We love these authors, but not for their 'escape-time weight- 

6F remark 20.1 

mg . 
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While derivations of (A.l) by Kadanoff and Tang 1984 and Auerbach et al. 
1987 were heuristic, Grebogi, Ott and Yorke 1987 prove (A.2) by taking the 
n — > oo limit. In actual computations it would be madness to attempt to take such 
limit, as longer and longer periodic orbits are exponentially more and more un- 
stable, exponentially growing in number, and non-computable; and the natural 
measure po is everywhere singular, with support on a fractal set, with its n — > oo 
limit even more impossible to compute. And why would one take this limit? The 
whole point of cycle expansions is that it is smarter to compute averages without 
constructing po- 

Taking a limit to obtain a proof is good mathematics, but in statistical mechan- 
ics a partition function is not a limit of anything; it is the full sum of all states. 
Likewise, its ergodic theory cousin, the spectral determinant is not a long-time 
limit; it is the exact sum over all periodic orbits. Cycle expansions were intro- 
duced in a non-rigorous manner, on purpose [ ] : the exposition was meant not to 
frighten a novice, innocent of Borel measurable a to Q. sets. This was set right in 
the elegant PhD thesis of H. H Rugh's in 1992, The correlation spectrum for hy- 
perbolic analytic maps [60], which proves that the zeros of spectral determinants 
are indeed the Ruelle-Pollicott resonances [61, 62, 63]. The proof is well within 
mathematicians' comfort zone, so they tend to cite Rugh's paper as the paper on 
'Fredholm determinants', and, as always, throw in "a sense of Grothendieck" for 
good measure [16, 64], without citing earlier papers on cycle expansions. 

If you intend to determine and use periodic orbits, here is the message: Heuris- 
tic 'level sums' are approximations to the exact trace formulas (that are derived 
here, in ChaosBook, and Gaspard monograph [65] with no more effort than the 
heuristic approximations), not smart for computations; faster convergence is ob- 
tained by utilizing the shadowing that is built into the exact cycle expansions of 
dynamical zeta functions and spectral determinants. Cycle expansions are not 
heuristic, in classical deterministic dynamics they are exact expansions in the un- 
stable periodic orbits [33, 30, 27]; in quantum mechanics and stochastic mechan- 
ics they are semi-classically exact. So why would one prefer a limit of a heuristic 
sum such as (A.2) to the exact spectral determinant, convergent exact periodic 
orbits sums, and exact periodic orbits formulas for dynamical averages of observ- 
ables? It is not even wrong. Perhaps if one is very fond of baker's maps [66], 
which, being piecewise linear, have no cycle expansion curvature terms, one does 
not appreciate the shadowing cancelations built into the spectral determinants and 
their cycle expansions. That might be the reason why linear thinkers stop at the 
level sum (A.2). 
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A.5 Dynamicist's vision of turbulence 



The key theoretical concepts that form the basis of dynamical theories of tur- 
bulence are rooted in the work of Poincare, Hopf, Smale, Ruelle, Gutzwiller 
and Spiegel. In Poincare's 1889 analysis of the three-body problem |67], he in- 
troduced the geometric approach to dynamical systems and methods that lie at 
the core of the theory developed here: qualitative topology of state space flows, 
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Poincare sections, key roles played by equilibria, periodic orbits, heteroclinic con- 
nections, and their stable/unstable manifolds. 

In a seminal 1948 paper [68], Ebehardt Hopf visualized the function space 
of allowable Navier-Stokes velocity fields as an infinite-dimensional state space, 
parameterized by viscosity, boundary conditions and external forces, with instan- 
taneous state of a flow represented by a point in this state space. Laminar flows 
correspond to equilibrium points, globally stable for sufficiently large viscosity. 
As the viscosity decreases (as the Reynolds number increases), turbulent states 
set in, represented by chaotic state space trajectories. Hopf's observation that 
viscosity causes a contraction of state space volumes under the action of dy- 
namics led to his key conjecture: that long-term, typically observed solutions of 
the Navier-Stokes equations lie on finite-dimensional manifolds embedded in the 
infinite-dimensional state space of allowed states. Hopf's manifold, known today 
as the 'inertial manifold,' is well-studied in the mathematics of spatio-temporal 
PDEs. Its finite dimensionality for non-vanishing 'viscosity' parameter has been 
rigorously established in certain settings by Foias and collaborators [59]. Hopf 
presciently noted that "the geometrical picture of the phase flow is, however, not 
the most important problem of the theory of turbulence. Of greater importance is 
the determination of the probability distributions associated with the phase flow". 
Hopf's call for understanding probability distributions associated with the phase 
flow has indeed proven to be a key challenge, one in which dynamical systems the- 
ory has made the greatest progress in the last half century. In particular, the Sinai- 
Ruelle-Bowen ergodic theory of 'natural' or SRB measures has played a critical 
role in understanding dissipative systems with chaotic behavior [7, 70, 71, 11]. 

Hopf noted "[t]he great mathematical difficulties of these important problems 
are well known and at present the way to a successful attack on them seems hope- 
lessly barred. However, there is no doubt that many characteristic features of the 
hydrodynamical phase flow occur in a much larger class of similar problems gov- 
erned by non-linear space-time systems. In order to gain insight into the nature 
of hydrodynamical phase flows we are, at present, forced to find and to treat sim- 
plified examples within that class." Hopf's call for geometric state space analysis 
of simplified models first came to fulfillment with the influential Lorenz's trunca- 
tion [72] of the Rayleigh-Benard convection state space. The Proper Orthogonal 
Decomposition (POD) models of boundary-layer turbulence brought this type of 
analysis closer to physical hydrodynamics [73, 74]. Further significant progress 
has proved possible for systems such as the 1 -spatial dimension Kuramoto-Siva- 
shinsky flow [75, 76], which is a paradigmatic model of turbulent dynamics, as 
well as one of the most extensively studied spatially extended dynamical systems. 

Today, as we hope to have convinced the reader, with modern computation and 
experimental insights, the way to a successful attack on the full Navier-Stokes 
problem is no longer "hopelessly barred." We address the challenge in a way 
Hopf could not divine, employing methodology developed only within the past 
two decades, explained in depth in this book. 

Hopf, however, to the best of our knowledge, never suggested that turbulent 
flow should be analyzed in terms of 'recurrent flows', i.e. time-periodic solutions 
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of the defining PDEs. The story so far goes like this: in 1960 Ed Spiegel was 
Robert Kraichnan's research associate. Kraichnan told him, "Flow follows a reg- 
ular solution for a while, then another one, then switches to another one; that's 
turbulence." It was not too clear, but Kraichnan's vision of turbulence moved Ed. 
In 1962 Spiegel and Derek Moore investigated a set of 3rd order convection equa- 
tions which seemed to follow one periodic solution, then another, and continued 
going from periodic solution to periodic solution. Ed told Derek, "This is turbu- 
lence!" and Derek said "This is wonderful!" He gave a lecture at Caltech in 1964 
and came back very angry. They pilloried him there. "Why is this turbulence?" 
they kept asking and he could not answer, so he expunged the word 'turbulence' 
from their 1966 paper [77] on periodic solutions. In 1970 Spiegel met Kraichnan 
and told him, "This vision of turbulence of yours has been very useful to me." 
Kraichnan said: "That wasn't my vision, that was Hopf 's vision." What Hopf ac- 
tually said and where he said it remains deeply obscure to this very day. There are 
papers that lump him together with Landau, as the 'Landau-Hopf 's incorrect the- 
ory of turbulence,' a proposal to deploy incommensurate frequencies as building 
blocks of turbulence. This was Landau's guess and was the only one that could be 
implemented at the time. 

The first paper to advocate a periodic orbit description of turbulent flows is 
thus the 1966 Spiegel and Moore paper [77, 78]. Thirty years later, in 1996 
Christiansen et al. [79] proposed (in what is now the gold standard for exemplary 
ChaosBook . org/projects) that the periodic orbit theory be applied to infinite- 
dimensional flows, such as the Navier-Stokes, using the Kuramoto-Sivashinsky 
model as a laboratory for exploring the dynamics close to the onset of spatiotem- 
poral chaos. The main conceptual advance in this initial foray was the demonstra- 
tion that the high-dimensional (16-64 mode Galerkin truncations) dynamics of this 
dissipative flow can be reduced to an approximately 1 -dimensional Poincare return 
map s —> f(s), by choosing the unstable manifold of the shortest periodic orbit as 
the intrinsic curvilinear coordinate from which to measure near recurrences. For 
the first time for any nonlinear PDE, some 1,000 unstable periodic orbits were 
determined numerically. What was novel about this work? First, dynamics on a 
strange attractor embedded in a high-dimensional space was essentially reduced 
to 1 -dimensional dynamics. Second, the solutions found provided both a quali- 
tative description and highly accurate quantitative predictions for the given PDE 
with the given boundary conditions and system parameter values. 

How is it possible that the theory originally developed for low dimensional 
dynamical systems can work in the oo -dimensional PDE state spaces? For dis- 
sipative flows the number of unstable, expanding directions is often finite and 
even low-dimensional; perturbations along the oo of contracting directions heal 
themselves, and play only a minor role in cycle weights - hence the long-time dy- 
namics is effectively finite dimensional. For a more precise statement, see Ginelli 
et al. [80]. 

The 1996 project went as far as one could with methods and computation re- 
sources available, until 2002, when new variational methods were introduced |81, 
82, 83]. Considerably more unstable, higher-dimensional regimes have become 
accessible [8 ]. Of course, nobody really cares about Kuramoto-Sivashinsky. It is 
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only a model; it was not until the full Navier-Stokes calculations of Eckhardt, Ker- 
swell and collaborators [85, 86, 87] that the fluid dynamics community started to 
appreciate that the dynamical (as opposed to statistical) analysis of wall-bounded 
flows is now feasible [88]. 



A.6 Gruppenpest 

How many Tylenols should I take with this?... (never took 
group theory, still need to be convinced that there is any 
use to this beyond mind-numbing formalizations.) 
— Fabian Waleffe, forced to read chapter 9. 

If you are not fan of chapter 9 "World in a mirror," and its elaborations, you 
are not alone. Or, at least, you were not alone in the 1930s. That is when the 
articles by two young mathematical physicists, Eugene Wigner and Johann von 
Neumann [8 C )], and Wigner's 1931 Gruppentheorie [90] started Die Gruppenpest 
that plagues us to this very day. 

According to John Baez [91], the American physicist John Slater, inventor of 
the 'Slater determinant,' is famous for having dismissed groups as unnecessary to 
physics. He wrote: 

"It was at this point that Wigner, Hund, Heitler, and Weyl entered the picture 
with their 'Gruppenpest:' the pest of the group theory [actually, the correct trans- 
lation is 'the group plague'] ... The authors of the 'Gruppenpest' wrote papers 
which were incomprehensible to those like me who had not studied group the- 
ory... The practical consequences appeared to be negligible, but everyone felt that 
to be in the mainstream one had to learn about it. I had what I can only describe 
as a feeling of outrage at the turn which the subject had taken ... it was obvious 
that a great many other physicists were disgusted as I had been with the group- 
theoretical approach to the problem. As I heard later, there were remarks made 
such as 'Slater has slain the 'Gruppenpest". I believe that no other piece of work 
I have done was so universally popular." 

A. John Coleman writes in Groups and Physics - Dogmatic Opinions of a 
Senior Citizen [92]: "The mathematical elegance and profundity of Weyl's book 
[Theory of Groups and QM] was somewhat traumatic for the English-speaking 
physics community. In the preface of the second edition in 1930, after a visit to 
the USA, Weyl wrote, "It has been rumored that the 'group pest' is gradually being 
cut out of quantum physics. This is certainly not true in so far as the rotation and 
Lorentz groups are concerned; ..." In the autobiography of J. C. Slater, published 
in 1975, the famous MIT physicist described the "feeling of outrage" he and other 
physicists felt at the incursion of group theory into physics at the hands of Wigner, 
Weyl et al. In 1935, when Condon and Shortley published their highly influential 
treatise on the "Theory of Atomic Spectra", Slater was widely heralded as having 
"slain the Gruppenpest". Pages 10 and 11 of Condon and Shortley's treatise are 
fascinating reading in this context. They devote three paragraphs to the role of 
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group theory in their book. First they say, "We manage to get along without 
it." This is followed by a lovely anecdote. In 1928 Dirac gave a seminar, at 
the end of which Weyl protested that Dirac had said he would make no use of 
group theory but that in fact most of his arguments were applications of group 
theory. Dirac replied, "I said that I would obtain the results without previous 
knowledge of group theory!" Mackey, in the article referred to previously, argues 
that what Slater and Condon and Shortley did was to rename the generators of the 
Lie algebra of SO(3) as "angular momenta" and create the feeling that what they 
were doing was physics and not esoteric mathematics." 

From AIP Wigner interview: AIP: "In that circle of people you were working 
with in Berlin, was there much interest in group theory at this time?" WIGNER: 
"No. On the opposite. Schrodinger coined the expression, 'Gruppenpest' must 
be abolished." "It is interesting, and representative of the relations between math- 
ematics and physics, that Wigner's paper was originally submitted to a Springer 
physics journal. It was rejected, and Wigner was seeking a physics journal that 
might take it when von Neumann told him not to worry, he would get it into the 
Annals of Mathematics. Wigner was happy to accept his offer [93]." 



A.7 Death of the Old Quantum Theory 

In 1913 Otto Stern and Max Theodor Felix von Laue went 
up for a walk up the Uetliberg. On the top they sat down 
and talked about physics. In particular they talked about 
the new atom model of Bohr. There and then they made 
the 'Uetli Schwur:' If that crazy model of Bohr turned out 
to be right, then they would leave physics. It did and they 
didn't. 

— A. Pais, Inward Bound: of Matter and Forces in 
the Physical World 

One afternoon in May 1991 , Dieter Wintgen is sitting in his office at the Niels Bohr 
Institute beaming with the unparalleled glee of a boy who has just committed a 
major mischief. The starting words of the manuscript he has just penned are 

The failure of the Copenhagen School to obtain a reasonable . . . 

Wintgen was 34 years old at the time, a scruffy kind of guy, always wearing san- 
dals and holed out jeans, the German flavor of a 90's left winger and mountain 
climber. He worked around the clock with his students Gregor Tanner and Klaus 
Richter to complete the work that Bohr himself would have loved to have seen 
done back in 1916: a 'planetary' calculation of the helium spectrum. 

Never mind that the 'Copenhagen School' refers not to the old quantum the- 
ory, but to something else. The old quantum theory was no theory at all; it was a 
set of rules bringing some order to a set of phenomena which defied logic of clas- 
sical theory. The electrons were supposed to describe planetary orbits around the 
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nucleus; their wave aspects were yet to be discovered. The foundations seemed 
obscure, but Bohr's answer for the once-ionized helium to hydrogen ratio was 
correct to five significant figures and hard to ignore. The old quantum theory 
marched on, until by 1924 it reached an impasse: the helium spectrum and the 
Zeeman effect were its death knell. 

Since the late 1890's it had been known that the helium spectrum consists of 
the orthohelium and parahelium lines. In 1915 Bohr suggested that the two kinds 
of helium lines might be associated with two distinct shapes of orbits (a suggestion 
that turned out to be wrong). In 1916 he got Kramers to work on the problem, and 
he wrote to Rutherford, "I have used all my spare time in the last months to make 
a serious attempt to solve the problem of ordinary helium spectrum ... I think 
really that at last I have a clue to the problem." To other colleagues he wrote that 
"the theory was worked out in the fall of 1916" and of having obtained a "partial 
agreement with the measurements." Nevertheless, the Bohr-Sommerfeld theory, 
while by and large successful for hydrogen, was a disaster for neutral helium. 
Heroic efforts of the young generation, including Kramers and Heisenberg, were 
of no avail. 

For a while Heisenberg thought that he had the ionization potential for he- 
lium, which he had obtained by a simple perturbative scheme. He wrote enthu- 
siastic letters to Sommerfeld and was drawn into a collaboration with Max Born 
to compute the spectrum of helium using Born's systematic perturbative scheme. 
To a first approximation, they reproduced the earlier calculations. The next level 
of corrections turned out to be larger than the computed effect. The concluding 
paragraph of Max Born's classic "Vorlesungen iiber Atommechanik" from 1925 
sums it up in a somber tone [94]: 

(...) the systematic application of the principles of the quantum theory 
(...) gives results in agreement with experiment only in those cases where 
the motion of a single electron is considered; it fails even in the treatment 
of the motion of the two electrons in the helium atom. 

This is not surprising, for the principles used are not really consistent. 
(...) A complete systematic transformation of the classical mechanics into 
a discontinuous mechanics is the goal towards which the quantum theory 
strives. 

That year Heisenberg suffered a bout of hay fever, and the old quantum theory 
was dead. In 1926 he gave the first quantitative explanation of the helium spec- 
trum. He used wave mechanics, electron spin and the Pauli exclusion principle, 
none of which belonged to the old quantum theory. As a result, planetary orbits 
of electrons were cast away for nearly half a century. 

Why did Pauli and Heisenberg fail with the helium atom? It was not the fault 
of the old quantum mechanics, but rather it reflected their lack of understanding of 
the subtleties of classical mechanics. Today we know what they missed in 1913- 
24, the role of conjugate points (topological indices) along classical trajectories 
was not accounted for, and they had no idea of the importance of periodic orbits 
in nonintegrable systems. 
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Since then the calculation for helium using the methods of the old quantum 
mechanics has been fixed. Leopold and Percival [95] added the topological indices 
in 1980, and in 1991 Wintgen and collaborators [54, 55] understood the role of 
periodic orbits. Dieter had good reasons to gloat; while the rest of us were prepar- 
ing to sharpen our pencils and supercomputers in order to approach the dreaded 
3-body problem, they just went ahead and did it. What it took-and much else-is 
described in this book. 

One is also free to ponder what quantum theory would look like today if all this 
was worked out in 1917. In 1994 Predrag Cvitanovic gave a talk in Seattle about 
helium and cycle expansions to-inter alia-Hans Bethe, who loved it so much that 
after the talk he pulled Predrag aside and they trotted over to Hans' secret place: 
the best lunch on campus (Business School). Predrag asked: "Would quantum 
mechanics look different if in 1917 Bohr and Kramers et al. figured out how to 
use the helium classical 3-body dynamics to quantize helium?" 

Bethe was very annoyed. He responded with an exasperated look - in Bethe 
Deutschinglish (if you have ever talked to him, you can do the voice over your- 
self): 

"It would not matter at all!" 
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Commentary 

Remark A.l Notion of global foliations. For each paper cited in dynamical systems 
literature, there are many results that went into its development. As an example, take the 
notion of global foliations that we attribute to Smale. As far as we can trace the idea, it 
goes back to Rene Thom; local foliations were already used by Hadamard. Smale attended 
a seminar of Thom in 1958 or 1959. In that seminar Thom was explaining his notion of 
transversality. One of Thorn's disciples introduced Smale to Brazilian mathematician 
Peixoto. Peixoto (who had learned the results of the Andronov-Pontryagin school from 
Lefschetz) was the closest Smale had ever come until then to the Andronov-Pontryagin 
school. It was from Peixoto that Smale learned about structural stability, a notion that got 
him enthusiastic about dynamical systems, as it blended well with his topological back- 
ground. It was from discussions with Peixoto that Smale got the problems in dynamical 
systems that lead him to his 1960 paper on Morse inequalities. The next year Smale pub- 
lished his result on the hyperbolic structure of the non-wandering set. Smale was not the 
first to consider a hyperbolic point, Poincare had already done that; but Smale was the 
first to introduce a global hyperbolic structure. By 1960 Smale was already lecturing on 
the horseshoe as a structurally stable dynamical system with an infinity of periodic points 
and promoting his global viewpoint. (R. Mainieri) 



Remark A.2 Levels of ergodicity. In the mid 1970's A. Katok and Ya.B. Pesin tried 
to use geometry to establish positive Lyapunov exponents. A. Katok and J.-M. Strelcyn 
carried out the program and developed a theory of general dynamical systems with sin- 
gularities. They studied uniformly hyperbolic systems (as strong as Anosov's), but with 
sets of singularities. Under iterations a dense set of points hits the singularities. Even 
more important are the points that never hit the singularity set. In order to establish some 
control over how they approach the set, one looks at trajectories that approach the set by 
some given e", or faster. 

Ya.G. Sinai, L. Bunimovich and N.I. Chernov studied the geometry of billiards in a 
very detailed way. A. Katok and Ya.B. Pesin's idea was much more robust: look at the 
discontinuity set, take an e neighborhood around it. Given that the Lebesgue measure is 
e a and the stability grows not faster than (distance)". A. Katok and J.-M. Strelcyn proved 
that the Lyapunov exponent is non-zero. 

In mid 1980's Ya.B. Pesin studied the dissipative case. Now the problem has no 
invariant Lebesgue measure. Assuming uniform hyperbolicity, with singularities, and 
tying together Lebesgue measure and discontinuities, and given that the stability grows 
not faster than (distance)", Ya.B. Pesin proved that the Lyapunov exponent is non-zero, 
and that SRB measure exists. He also proved that the Lorenz, Lozi and Byelikh attractors 
satisfy these conditions. 

In the systems that are uniformly hyperbolic, all trouble is in differentials. For the 
Henon attractor, already the differentials are nonhyperbolic. The points do not separate 
uniformly, but the analogue of the singularity set can be obtained by excising the regions 
that do not separate. Hence there are 3 levels of ergodic systems: 

1 . Anosov flow 
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2. Anosov flow + singularity set: For the Hamiltonian systems the general case is 
studied by A. Katok and J.-M. Strelcyn, and the billiards case by Ya.G. Sinai and 
L. Bunimovich. The dissipative case is studied by Ya.B. Pesin. 

3. Henon case: The first proof was given by M. Benedicks and L. Carleson [ 96, 97, 
98]. A more readable proof is given in M. Benedicks and L.-S. Young [ 99]. 

(based on Ya.B. Pesin's comments) 

Remark A.3 Einstein did it? The first hint that chaos is afoot in quantum mechanics 
was given in a note by A. Einstein [ 100]. The total discussion is a one sentence remark. 
Einstein being Einstein, this one sentence has been deemed sufficient to give him the 
credit for being the pioneer of quantum chaos [ 13, 101]. We asked about the paper two 
people from that era, Sir Rudolf Peierls and Abraham Pais; neither had any recollection 
of the 1917 article. However, Theo Geisel has unearthed a reference that shows that 
in early 20s Born did have a study group meeting in his house that studied Poincare's 
Mechanique Celeste [67]. In 1954 Fritz Reiche, who had previously followed Einstein 
as professor of physics in Breslau (now Wroclaw, Poland), pointed out to J.B. Keller 
that Keller's geometrical semiclassical quantization was anticipated by the long forgotten 
paper by A. Einstein [100]. In this way an important paper written by the physicist who 
at the time was the president of German Physical Society, and the most famous scientist 
of his time, came to be referred to for the first time by Keller [?], 41 years later. But 
before Ian Percival included the topological phase, and Wintgen and students recycled the 
Helium atom, knowing Mechanique Celeste was not enough to complete Bohr's original 
program. 

Remark A.4 Berry-Keating conjecture. A very appealing proposal in the context 
of semiclassical quantization is due to M. Berry and J. Keating [ 102]. The idea is to im- 
prove cycle expansions by imposing unitarity as a functional equation ansatz. The cycle 
expansions that they use are the same as the original ones described above [30], but the 
philosophy is quite different; the claim is that the optimal estimate for low eigenvalues of 
classically chaotic quantum systems is obtained by taking the real part of the cycle expan- 
sion of the semiclassical zeta function, cut off at the appropriate cycle length. M. Sieber, 
G. Tanner and D. Wintgen, and P. Dahlqvist find that their numerical results support this 
claim; F. Christiansen and P. Cvitanovic do not find any evidence in their numerical re- 
sults. The usual Riemann-Siegel formulas exploit the self-duality of the Riemann and 
other zeta functions, but there is no evidence of such symmetry for generic Hamiltonian 
flows. Also from the point of hyperbolic dynamics discussed above, proposal in its cur- 
rent form belongs to the category of crude cycle expansions; the cycles are cut off by a 
single external criterion, such as the maximal cycle time, with no regard for the topology 
and the curvature corrections. While the functional equation conjecture is not in its final 
form yet, it is very intriguing and fruitful research inspiration. 

The real life challenge are generic dynamical flows, which fit neither of extreme ide- 
alized settings, Smale horseshoe on one end, and the Riemann zet function on the other. 

Remark A.5 Sources. The tale of appendix A. 7, aside from a few personal recollec- 
tions, is in large part lifted from Abraham Pais' accounts of the demise of the old quantum 
theory [103, 104], as well as Jammer's account [105]. In August 1994 Dieter Wintgen 
died in a climbing accident in the Swiss Alps. 
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Appendix B 
Go straight 



AHamiltonian system is said to be integrable if one can find a change of 
coordinates to an action-angle coordinate frame where the phase-space 
dynamics is described by motion on circles, one circle for each degree 
of freedom. In the same spirit, a natural description of a hyperbolic, unstable 
flow would be attained if one found a change of coordinates into a frame where 
the stable/unstable manifolds are straight lines, and the flow is along hyperbolas. 
Achieving this globally for anything but a handful of contrived examples is a pipe 
dream. Nevertheless, as we shall now show, we can make some headway on 
straightening out the flow locally. 

There is much more to this story than what we touch upon here: other tricks 
and methods to construct regularizations, what kind of singularities could be reg- 
ularized, etc.. Even though such nonlinear coordinate transformations are very 
important, especially in celestial mechanics, we shall not use them much in what 
follows, so you can safely skip this chapter on the first reading. Except, perhaps, 
you might like transformations that turn a Keplerian ellipse into a harmonic oscil- 
lator (example B.2) and regularize the 2-body Coulomb collisions (sect. B.2) in 
classical helium. 



B.l Rectification of flows 



A profitable way to exploit invariance of dynamics under smooth conjugacies is 
to use it to pick out the simplest possible representative of an equivalence class. 
These are just words, as we have no clue how to pick such 'canonical' represen- 
tations, but for smooth flows we can always do it locally and for sufficiently short 
time, by appealing to the rectification theorem, a fundamental theorem of ordi- 
nary differential equations. The theorem tells us that a solution exists (at least for 
a short time interval) and what it looks like. The rectification theorem holds in the 
neighborhood of points of the vector field v(x) that are not singular, that is, ev- 
erywhere except for the equilibrium points (2.9), and points at which v is infinite. 
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According to the theorem, in a small neighborhood of a non-singular point there 
exists a change of coordinates y = h(x) such that x - v(x) in the new, canonical 
coordinates takes form 



yi = yi 
yd = i, 



yd-\ 



(B.l) 



with unit velocity flow along yj, and no flow along any of the remaining directions. 
This is an example of a one-parameter Lie group of transformations, with finite 
time r action 

y'i = yi, i = l,2,...,d-\ 

y' d = yd + T. 



exercise 9.8 
exercise B.1 



Example B.l Harmonic oscillator, rectified: As a simple example of global 
rectification of a flow consider the harmonic oscillator 



9 



(B.2) 



The trajectories x(t) = (q(t), p(t)) circle around the origin, so a fair guess is that the 
system would have a simpler representation in polar coordinates y = (r, 9): 



Y l (r,9) = rcosd 



q — h 

p = h 2 l (r,ff) = rsin 



(B.3) 



The Jacobian matrix, dhi/dxj, of the transformation is 



h' 



cos 6 sin# 
sin 0 cos 8 



(B.4) 



resulting in (2.23) of rectified form 



( cos 9 sin 0 
sinfl cos 9 



exercise 5.1 



(B.5) 



In the new coordinates the radial coordinate r is constant, and the angular coordinate 
9 wraps around a cylinder with constant angular velocity. There is a subtle point in this 
change of coordinates: the domain of the map hr x is not the plane R 2 , but rather the 
plane minus the origin. We mapped a plane into a cylinder, and coordinate transforma- 
tions should not change the topology of the space in which the dynamics takes place; 
the coordinate transformation is not defined on the equilibrium point x = (0, 0), or r = 0. 
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Figure B.l: Coordinates for the helium three body 
problem in the plane. 



Figure B.2: Collinear helium, with the two electrons 
on opposite sides of the nucleus. 




B.2 Collinear helium 

(G. Tanner) 

So far much has been said about 1 -dimensional maps, game of pinball and other 
curious but rather idealized dynamical systems. If you have become impatient and 
started wondering what good are the methods learned so far in solving real life 
physical problems, good news are here. We will apply here concepts of nonlinear 
dynamics to nothing less than the helium, a dreaded three -body Coulomb problem. 

Can we really jump from three static disks directly to three charged particles 
moving under the influence of their mutually attracting or repelling forces? It 
turns out, we can, but we have to do it with care. The full problem is indeed 
not accessible in all its detail, but we are able to analyze a somewhat simpler 
subsystem-collinear helium. This system plays an important role in the classical 
and quantum dynamics of the full three-body problem. 

The classical helium system consists of two electrons of mass 11% and charge 
—e moving about a positively charged nucleus of mass ny ie and charge +2e. 

The helium electron-nucleus mass ratio mh e lm e = 1836 is so large that we 
may work in the infinite nucleus mass approximation rn/ ie = 00, fixing the nucleus 
at the origin. Finite nucleus mass effects can be taken into account without any 
substantial difficulty. We are now left with two electrons moving in three spatial 
dimensions around the origin. The total angular momentum of the combined elec- 
tron system is still conserved. In the special case of angular momentum L - 0, the 
electrons move in a fixed plane containing the nucleus. The three body problem 
can then be written in terms of three independent coordinates only, the electron- 
nucleus distances r\ and r2 and the inter-electron angle 0, see figure B.l. 

This looks like something we can lay our hands on; the problem has been 
reduced to three degrees of freedom, six phase-space coordinates in all, and the 
total energy is conserved. But let us go one step further; the electrons are attracted 
by the nucleus but repelled by each other. They will tend to stay as far away from 
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each other as possible, preferably on opposite sides of the nucleus. It is thus worth 
having a closer look at the situation where the three particles are all on a line with 
the nucleus being somewhere between the two electrons. If we, in addition, let the 
electrons have momenta pointing towards the nucleus as in figureB.2, then there 
is no force acting on the electrons perpendicular to the common interparticle axis. 
That is, if we start the classical system on the dynamical subspace 0 = tt,4® = 0, 
the three particles will remain in this collinear configuration for all times. 



B.2.1 Scaling 

In what follows we will restrict the dynamics to this collinear subspace. It is a 
system of two degrees of freedom with the Hamiltonian 

l/ 9 9 \ 2e 2 2e 2 e 2 

H=x-(p\ + p 2 2) + ^—=E, (B.6) 

2m e v i n r 2 n + r 2 

where E is the total energy. As the dynamics is restricted to the fixed energy shell, 
the four phase-space coordinates are not independent; the energy shell dependence 
can be made explicit by writing 



{n,r 2 ,p\,P2) -> (ri(E),r 2 (E), pi(E), p 2 (E)) . 

We will first consider the dependence of the dynamics on the energy E. A 
simple analysis of potential versus kinetic energy tells us that if the energy is 
positive both electrons can escape to — > oo, i = 1,2. More interestingly, a 
single electron can still escape even if E is negative, carrying away an unlimited 
amount of kinetic energy, as the total energy of the remaining inner electron has no 
lower bound. Not only that, but one electron will escape eventually for almost all 
starting conditions. The overall dynamics thus depends critically on whether E > 
0 or E < 0. But how does the dynamics change otherwise with varying energy? 
Fortunately, not at all. Helium dynamics remains invariant under a change of 
energy up to a simple scaling transformation; a solution of the equations of motion 
at a fixed energy Eq = - 1 can be transformed into a solution at an arbitrary energy 
E < 0 by scaling the coordinates as 



r,(E) = 



(-£) 



n, 



?i(E) = ^-m e E pu i= 1,2, 



together with a time transformation t(E) - ^mpi-Ey^ 12 t. We include the 
electron mass and charge in the scaling transformation in order to obtain a non- 
dimensionalized Hamiltonian of the form 



1 



2 2 r\ r 2 r\ + r 2 



-1 



(B.7) 
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The case of negative energies chosen here is the most interesting one for us. It 
exhibits chaos, unstable periodic orbits and is responsible for the bound states and 
resonances of the quantum problem. 



B.2.2 Regularization of two-body collisions 

Next, we have a closer look at the singularities in the Hamiltonian (B.7). When- 
ever two bodies come close to each other, accelerations become large, numerical 
routines require lots of small steps, and numerical precision suffers. No numerical 
routine will get us through the singularity itself, and in collinear helium electrons 
have no option but to collide with the nucleus. Hence a regularization of the dif- 
ferential equations of motions is a necessary prerequisite to any numerical work 
on such problems, both in celestial mechanics (where a spaceship executes close 
approaches both at the start and its destination) and in quantum mechanics (where 
much of semiclassical physics is dominated by returning classical orbits that probe 
the quantum wave function at the nucleus). 

There is a fundamental difference between two-body collisions r\ = 0 or r2 = 
0, and the triple collision r\ = r2 = 0. Two-body collisions can be regularized, 
with the singularities in equations of motion removed by a suitable coordinate 
transformation together with a time transformation preserving the Hamiltonian 
structure of the equations. Such regularization is not possible for the triple colli- 
sion, and solutions of the differential equations can not be continued through the 
singularity at the origin. As we shall see, the chaos in collinear helium originates 
from this singularity of triple collisions. 

A regularization of the two-body collisions is achieved by means of the Kust- 
aanheimo-Stiefel (KS) transformation, which consists of a coordinate dependent 
time transformation which stretches the time scale near the origin, and a canonical 
transformation of the phase-space coordinates. In order to motivate the method, 
we apply it first to the 1 -dimensional Kepler problem 

1 , 2 

H=-p 2 --=E. (B.8) 



Example B.2 Keplerian ellipse, rectified: To warm up, consider the E - 0 case, 
starting at x = 0 at t = 0. Even though the equations of motion are singular at the initial 
point, we can immediately integrate 




by means of separation of variables 

yfxdx = 2dt, x = (3t)K (B.9) 

and observe that the solution is not singular. The aim of regularization is to compensate 
for the infinite acceleration at the origin by introducing a fictitious time, in terms of which 
the passage through the origin is smooth. 
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A time transformation dt = f(q, p)dr for a system described by a Hamiltonian 
H(q, p) = E leaves the Hamiltonian structure of the equations of motion unaltered, if 
the Hamiltonian itself is transformed into H(q,p) = f(q,p)(H(q,p) - E). For the 1- 
dimensional Coulomb problem with (B.8) we choose the time transformation dt - xdr 
which lifts the \x\ — > 0 singularity in (B.8) and leads to a new Hamiltonian 

< H=^xp 2 -2-Ex=0. (B.10) 

The solution (B.9) is now parameterized by the fictitous time dr through a pair of equa- 
tions 

X = T 2 , t=-7 3 . 

3 

The equations of motion are, however, still singular as x -» 0: 

d 2 x 1 dx 

— = -+xE. 

dr- 2x dr 

Appearance of the square root in (B.9) now suggests a canonical transformation of 
form 

* = Q 2 , P = ^q (B.H) 
which maps the Kepler problem into that of a harmonic oscillator with Hamiltonian 

H(Q,P)= \p 2 -EQ 2 = 2, (B.12) 

o 

with all singularities completely removed. 



We now apply this method to collinear helium. The basic idea is that one seeks 
a higher-dimensional generalization of the 'square root removal' trick 5^. 11), by 
introducing a new vector Q with property r = |<2p . In this simple 1 -dimensional 
example the KS transformation can be implemented by 

n = Q\, r 2 = Q 2 2 , Pi = £r, P^ = TFT (B " 13) 

and reparameterization of time by dr = dt/r^. The singular behavior in the 
original momenta at r\ or r 2 = 0 is again compensated by stretching the time 
scale at these points. The Hamiltonian structure of the equations of motions with 
respect to the new time t is conserved, if we consider the Hamiltonian 

H ko = l(Q 2 2 Pj + Q\P\) - 2R 2 n + Q\Q\(-E + l/Rj 2 ) = 0 (B.14) 
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a) 



Figure B.3: (a) A typical trajectory in the [r\, r 2 ] 
plane; the trajectory enters here along the r\ axis 
and escapes to infinity along the r 2 axis; (b) 
Poincare map (?2=0) for collinear helium. Strong 
chaos prevails for small r\ near the nucleus. 




b) 



with R\2 = (<2? + G?) 1 ^ 2 ' an ^ we wn l ta ^ e E = -I in what follows. The equations 



-1 ^2 

of motion now have the form 



Pi = 2gi 

Pi = 2Q 2 



2 4-4 + l 

8 Ux { R\ 2 



Q^- A pm 



Qi = - A PiQ\. 



(B.15) 



Individual electron-nucleus collisions at r\ = Q \ = 0 or r 2 = Q\ = 0 no longer 
pose a problem to a numerical integration routine. The equations (B. 15) are sin- 
gular only at the triple collision R\ 2 = 0, i.e., when both electrons hit the nucleus 
at the same time. 

The new coordinates and the Hamiltonian (B.14) are very useful when cal- 
culating trajectories for collinear helium; they are, however, less intuitive as a 
visualization of the three-body dynamics. We will therefore refer to the old coor- 
dinates n, r 2 when discussing the dynamics and the periodic orbits. 

To summarize, we have brought a 3-body problem into a form where the 
2-body collisions have been transformed away, and the phase-space trajectories 
computable numerically. To appreciate the full beauty of what has been attained, 
you have to fast-forward to quantum chaos part of ChaosBook.org; we are 
already 'almost' ready to quantize helium by semiclassical methods. 



fast track: 
chapter 5, p. 99 



B.3 Rectification of maps 



In sect. B.l we argued that nonlinear coordinate transformations can be profitably 
employed to simplify the representation of a flow. We shall now apply the same 
idea to nonlinear maps, and determine a smooth nonlinear change of coordinates 
that flattens out the vicinity of a fixed point and makes the map linear in an open 
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neighborhood. In its simplest form the idea can be implemented only for an iso- 
lated nondegenerate fixed point (otherwise one needs the normal form expansion 
around the point), and only in a finite neighborhood of a point, as the conjugating 
function in general has a finite radius of convergence. In sect. B. 4 we will extend 
the method to periodic orbits. 

B.3.1 Rectification of a fixed point in one dimension 



Consider a 1 -dimensional map x n+ i = f{x n ) with a fixed point at x = 0, with 
stability A = /'(0). If |A| + 1, one can determine the power series for a smooth 
conjugation h(x) centered at the fixed point, h(0) = 0, that flattens out the neigh- 
borhood of the fixed point 



and replaces the nonlinear map f{x) by a linear map % + \ - Ay n . 

To compute the conjugation h we use the functional equation If 1 (Ax) = 
f(h~ l (x)) and the expansions 



Equating the coefficients of j«* on both sides of the functional equation yields 

hk order by order as a function of fi, fo, If h(x) is a conjugation, so is any 

scaling h(bx) of the function for a real number b. Hence the value of H(0) is not 
determined by the functional equation (B. 16); it is convenient to set h'(0) = 1. 

The algebra is not particularly illuminating and best left to computers. In any 
case, for the time being we will not use much beyond the first, linear term in these 
expansions. 

Here we have assumed | A| + 1 . If the fixed point has vanishing k-l derivatives, 
the conjugacy is to the kth normal form. 

In multiple dimensions, A is replaced by the Jacobian matrix, and one has to 
check that the eigenvalues M are non-resonant, that is, there is no integer linear 
relation between the Floquet exponents (5.7). 



B.4 Rectification of a periodic orbit 

In sect. B.3. 1 we have constructed the conjugation function for a fixed point. Here 
we turn to the problem of constructing it for periodic orbits. Each point around the 



exercise B.3 



f{x) = hr l {Ah{x)) 



(B.16) 



f{x) = Ax + x 2 f 2 + x 3 f 3 + ... 
h~ l (x) = x + x 2 h 2 + x 3 /i 3 + . . . . 



(B.17) 
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cycle has a differently distorted neighborhood, with differing second and higher 
order derivatives, so we need to compute a different conjugation function at 
each periodic point x a . We expand the map / around each periodic point along 
the cycle, 

ya(0) = fa(#) ~ = <f>fa,l + <f> 2 fa,2 + ■■■ (B.18) 

where x a is a point on the cycle, f a (<f>) = f(x a + 4>) is centered on the periodic 
orbit, and the index k in f a ^ refers to the kth order in the expansion (B.17). 

For a periodic orbit the conjugation formula (B.16) generalizes to 

fa(<f>) = h- l +l (f;,(0)h a (4>)) , a = 1,2, • • • , n , 

point by point. The conjugationg functions 1\, are obtained in the same way as 
before, by equating coefficients of the expansion (B.17), and assuming that the 
cycle Floquet multiplier A = Y\'aZof'( x a) is not marginal, |A| + 1. The explicit 
expressions for h a in terms of / are obtained by iterating around the whole cycle, 

f{x a + </>) = h~\AhM) + x a . (B.19) 

evaluated at each periodic point a. Again we have the freedom to set H.(0) = 1 for 

F F 6 &v ' remark B. 2 

all a. 

B.4.1 Repeats of cycles 

We have traded our initial nonlinear map / for a (locally) linear map Ay and an 
equally complicated conjugation function h. What is gained by rewriting the map 
/ in terms of the conjugacy function hi Once the neighborhood of a fixed point 
is linearized, the iterates of / are trivialized; from the conjugation formula (B.17) 
one can compute the derivatives of a function composed with itself r times: 

f(x) = h-\A r h(x)). 

One can already discern the form of the expansion for an arbitrary iterate; the an- 
swer will depend on the conjugacy function h(x) computed for a single application 
of mapping /, and all the dependence on the iterate number will be carried by fac- 
tors that are polynomial functions of A r , a considerable simplification. The beauty 
of the idea is difficult to gauge at this stage-an appreciation only sets in when one 
starts computing perturbative corrections, whether in celestial mechanics (where 
the method was born), or quantum or stochastic corrections to 'semiclassical' ap- 
proximations. 

in depth: 

appendix C.4, p. 817 
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Resume 

The dynamical system (At, f) is invariant under the group of all smooth conjuga- 
cies 

(M,f) -> {M',g) = {h{M),hofoh- 1 ). 

This invariance can be used to (i) find a simplified representation for the flow and 
(ii) identify a set of invariants, numbers computed within a particular choice of 
(At, /), but invariant under all At — > h(M) smooth conjugacies. 

The 2D-dimensional phase space of an integrable Hamiltonian system of D 
degrees of freedom is fully foliated by D-tori. In the same spirit, for a uniformly 
hyperbolic, chaotic dynamical system, one would like to transform to a coordinate 
frame in which the stable and unstable manifolds form a set of transversally in- 
tersecting hyper-planes, with the flow everywhere locally hyperbolic. That cannot 
be achieved in general: Fully globally integrable and fully globally chaotic flows 
are a very small subset of all possible flows, a 'set of measure zero' in the world 
of all dynamical systems. 

What we really care about is developing invariant notions for a given dynam- 
ical system. The totality of smooth one-to-one nonlinear coordinate transforma- 
tions h that map all trajectories of a given dynamical system (At,/) onto all tra- 
jectories of dynamical systems (Af ,g*) gives us a huge equivalence class, much 
larger than the equivalence classes familiar from the theory of linear transforma- 
tions. In the theory of Lie groups, the full invariant specification of an object is 
given by a finite set of Casimir invariants. What a good full set of invariants for a 
group of general nonlinear smooth conjugacies might be is not known, but the set 
of all periodic orbits and their Floquet multipliers turns out to be a good start. 



Commentary 

Remark B.l Rectification of flows. See Section 2.2.5 of ref. [10] for a pedagogical 
introduction to smooth coordinate reparameterizations. Explicit examples of transfor- 
mations into canonical coordinates for a group of scalings and a group of rotations are 
worked out. 

Remark B.2 Rectification of maps. The methods outlined above are standard in 
the analysis of fixed points and the construction of normal forms for bifurcations, see for 
example refs. [26, 34, 2, 3, 4, 5, 6, 7, 11]. The geometry underlying such methods is 
elegant, and we enjoyed reading, for example, Percival and Richards [ 8], chaps. 2 and 4 
of Ozorio de Almeida's monograph [9], and, as always, Arnol'd [1]. 

Recursive formulas for the evaluation of derivatives needed to evaluate (B.17) are 
given, for example, in Appendix A of ref. [9]. Section 10.6 of ref. [11] describes in 
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detail the smooth conjugacy that relates the Ulam map (11.5) to the tent map (1 1.4). For 
'negative Schwartzian derivatives,' families of conjugacies of Ulam-type maps, associated 
Lyapunov exponents, continuous measures and further pointers to literature, see ref. [12]. 



Remark B.3 A resonance condition. In the hyperbolic case there is a resonance 
condition that must be satisfied: none of the Floquet exponents may be related by ratios 
of integers. That is, if A Pi i, A p £, . . . , A p j are the Floquet multipliers of the Jacobian 
matrix, then they are in resonance if there exist integers n \ , . . . , n ( / such that 

(A^i) ni (A p , 2 r---(A M ) n " = 1. 

If there is resonance, one may get corrections to the basic conjugation formulas in the 
form of monomials in the variables of the map. (R. Mainieri) 



[ 



Exercises 

B. 1 . Harmonic oscillator in polar coordinates: Given a 
harmonic oscillator (B.2) that follows p = -qandq = p, 
use (B.4) to rewrite the system in polar coordinates (B.3) 
and find equations for r and 6. 

1. Show that the 1 -dimensional state space of the 
rewritten system is the quotient space A4/SO(2). 

2. Construct a Poincare section of the quotiented 
flow. 

B.2. Coordinate transformations. Changing coordinates 
is conceptually simple, but can become confusing when 
carried out in detail. The difficulty arises from con- 
fusing functional relationships, such as x(t) = h (y(t)) 
with numerical relationships, such as w(y) = h'(x)v(x). 
Working through an example will clear this up. 

(a) The differential equation in M is x = {2jc i , JC2 } 
and the change of coordinates from M to M' is 
h(x\,X2) = {2xi +X2,x\ —X2\. Solve for x{t). Find 

h-\ 

(b) Show that in the transformed space M' , the differ- 
ential equation is 



d 


y\ 


l 


5yi + 2j2 


dt 




~ 3 


y\ + 4y 2 



Solve this system. Does it match the solution in 
the M. space? 

B.3. Linearization for maps. Let / : C — > C be a map 

from the complex numbers into themselves, analytic at 
the origin with a fixed point. By manipulating power se- 
ries, find the first few terms of the map h that conjugates 
/ to az, that is, 

f(z) = h-\ah(z)). 

There are conditions on the derivative of / at the origin 
to assure that the conjugation is always possible. For- 
mulate these conditions by examining the series 

(difficulty: medium) (R. Mainieri) 

B.4. Ulam and tent maps. Show that the smooth conju- 
gacy (2.20) 

g(yo) = hofoh^iyo) 
y = h{x) = sin 2 (7rx/2) , 

conjugates the tent map f(x) = 1 — 2\x - 1/2| into the 
Ulam map g(y) - 4y(l-y) . (continued as exercise 13.1) 
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Linear stability 



Mopping up operations are the activities that engage most 
scientists throughout their careers. 



— Thomas Kuhn, The Structure of Scientific Revolu- 
tions 



The subject of linear algebra generates innumerable tomes of its own, and is 
way beyond what we can exhaustively cover. Here we recapitulate a few 
essential concepts that ChaosBook relies on. The punch line is Eq. (C.24): 

Hamilton-Cayley equation ]~[(M - Ail) = 0 associates with each distinct root 
Ai of a matrix M a projection onto ith vector subspace 



C.l Linear algebra 

In this section we collect a few basic definitions. The reader might prefer going 
straight to sect. C.l. 

Vector space. A set V of elements x, y, z, . . . is called a vector (or linear) space 
over a field F if 

(a) vector addition "+" is defined in V such that V is an abelian group under 
addition, with identity element 0; 

(b) the set is closed with respect to scalar multiplication and vector addition 




a(x + y) = ax + ay , 



a,b e F, x, y € V 
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(a + b)x = ax + bx 
a(bx) = (ab)x 
lx = x, ( 



0x = 0. 



(C.l) 



Here the field F is either R, the field of reals numbers, or C, the field of complex 
numbers. Given a subset Vb c V, the set of all linear combinations of elements of 
Vq, or the span of Vq, is also a vector space. 

A basis, {e^\ • • • , e ( ^} is any linearly independent subset of V whose span is V. 
The number of basis elements d is the dimension of the vector space V. 

Dual space, dual basis. Under a general linear transformation g e GL(n, F), the 
row of basis vectors transforms by right multiplication as cP = E/tCg -1 )' 7 * e > 
and the column of ^'s transforms by left multiplication as x" - gx. Under 
left multiplication the column (row transposed) of basis vectors fy) transforms 
as ey) = (g')j k e(k), where the dual rep = (g~') r is the transpose of the inverse 
of g. This observation motivates introduction of a dual representation space V, 
the space on which GL(n, F) acts via the dual rep g^ . 

Definition. If V is a vector representation space, then the dual space V is the set 
of all linear forms on V over the field F. 

If |e (1) , • • • , e (rf) } is a basis of V, then V is spanned by the dual basis {e(i), • • • , e^}, 
the set of d linear forms e^) such that 



where 5 k is the Kronecker symbol, = 1 if j - k, and zero otherwise. The 
components of dual representation space vectors y e V will here be distinguished 
by upper indices 



For GL(n,¥) no complex conjugation is implied by the * notation; that interpre- 
tation applies only to unitary subgroups U(n) c GL(n, C). In the index notation, 
g can be distinguished from g^ by keeping track of the relative ordering of the 
indices, 



e (i) • e 




(v',v- v"). 



(C2) 



They transform under GL(n, F) as 



y fl = (g T )V- 



(C.3) 



(g): 



ga 



b 



(C.4) 



a 
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Algebra. A set of r elements t a of a vector space T forms an algebra if, in 
addition to the vector addition and scalar multiplication, 

(a) the set is closed with respect to multiplication T • T —> T, so that for any 
two elements t a , tp e T, the product ta • tp also belongs to T: 

r-l 

ta-fc = £W VeC; (C.5) 

7=0 

(b) the multiplication operation is distributive: 

(tor + tys) • ty = t a ■ ty + tp ■ ty 
ta ' (t/3 + ty) = t a • tp + t a • ty . 

The set of numbers x a p y are called the structure constants. They form a matrix 
rep of the algebra, 

(ta)/ = V , (C.6) 

whose dimension is the dimension r of the algebra itself. 

Depending on what further assumptions one makes on the multiplication, one 
obtains different types of algebras. For example, if the multiplication is associative 

(ta • tp) • ty = ta • (tp ■ ty) , 

the algebra is associative. Typical examples of products are the matrix product 



da ■ tpf a = (t a ) b a (tp)l , ta £ V ® V , (C.7) 



and the Lie product 



da ■ tpf a = (t a ) b a (tpf b - (t a ) h c (tp) a b , ta e V ® V (C.8) 



which defines a Lie algebra. 

C.2 Eigenvalues and eigenvectors 

Eigenvalues of a [dxd] matrix M are the roots of its characteristic polynomial 
det (M - ^1) - F[(^ - A) - 0 . (C.9) 
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Given a nonsingular matrix M, with all A, + 0, acting on <i-dimensional vectors 
x, we would like to determine eigenvectors e® of M on which M acts by scalar 
multiplication by eigenvalue A, 

Me® = ^e®. (CIO) 



If Aj + Aj, e w and e (7) are linearly independent. There are at most d distinct 
eigenvalues and eigenspaces, which we assume have been computed by some 
method, and ordered by their real parts, Re Aj > Re Ai+\ . 



If all eigenvalues are distinct e ( ^ are d linearly independent vectors which can 
be used as a (non-orthogonal) basis for any J-dimensional vector xel 1 

x = Xi e (1) + x 2 e (2) + • • • + x d e (d) . (CM) 



From (C. 10) it follows that 

matrix (M - Ajl) annihilates e®, the product of all such factors annihilates any 
vector, and the matrix M satisfies its characteristic equation (C.9), 

d 

"[(M-^1) = 0. (C.12) 

!=1 



This humble fact has a name: the Hamilton-Cayley theorem. If we delete one term 
from this product, we find that the remainder projects x onto the corresponding 
eigenspace: 



~[(M - Ajl)x - Y\(Ai - Aj)xfiF> . 



Dividing through by the (Ai - Aj) factors yields the projection operators 

i—r M - Ajl 

'• 11^— <C13) 

J*' 3 
which are orthogonal and complete: 

r 

V i Vj = 5 ij Vj, (no sum on j), Yj V=1 (C14) 

!=1 
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It follows from the characteristic equation (C.12) that Aj is the eigenvalue of M on 
P, subspace: 

M P, = AjPi (no sum on i) . (C. 15) 

Using M = M1 and completeness relation (C.14) we can rewrite M as 

M = A l V l +A 2 V 2 + --- + A d V d . (C.16) 

Any matrix function /(M) takes the scalar value f(A) on the P, subspace, /(M) P, = 
f(Ai) P, , and is thus easily evaluated through its spectral decomposition 

/(M) = ^/(^)Pi. (C.17) 



This, of course, is the reason why anyone but a fool works with irreducible reps: 
they reduce matrix (AKA "operator") evaluations to manipulations with numbers. 

By (C.10) every column of P; is proportional to a right eigenvector e'\ and 
its every row to a left eigenvector e^. In general, neither set is orthogonal, but by 
the idempotence condition (C.14), they are mutually orthogonal, 



e© -e 0) = c8\. (C.18) 

The non-zero constant c is convention dependent and not worth fixing, unless you 
feel nostalgic about Clebsch-Gordan coefficients. We shall set c = 1 . Then it is 
convenient to collect all left and right eigenvectors into a single matrix as follows. 



Fundamental matrix. The set of solutions x(i) = /(jco)jco f° r a system of ho- 
mogeneous linear differential equations x(t) = A(t)x(t) of order 1 and dimension 
d forms a J-dimensional vector space. A basis {4 l \t), . . . ,e (rf) (?)} for this vector 
space is called a. fundamental system. Every solution x(t) can be written as 



c(f) = Jc/e®(t). 



The [dxd] matrix F7. 1 = er whose columns are the right eigenvectors of / 

FitT 1 = (e (1) (0, . . . ,e (rf) (0) , F(t) T - (e ( i)(0, . . . ,e (rf) (0) (C19) 
is the inverse of a fundamental matrix. 
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Jacobian matrix. The Jacobian matrix /(xq) is the linear approximation to a 
differentiable function/ (xo), describing the orientation of a tangent plane to the 
function at a given point and the amount of local rotation and shearing caused 
by the transformation. The inverse of the Jacobian matrix of a function is the 
Jacobian matrix of the inverse function. If / is a map from <i-dimensional space 
to itself, the Jacobian matrix is a square matrix, whose determinant we refer to as 
the 'Jacobian.' 

The Jacobian matrix can be written as transformation from basis at time ft to 
the basis at time t\ , 



Then the matrix form of (C.18) is F(?)F(?) 1 = 1, i.e., for zero time the Jacobian 
matrix is the identity. 



Example C.l Fundamental matrix. If A is constant in time, the system (4.2) is 
autonomous, and the solution is 

x(t) = e A 'x(0), 

where exp(A t) is defined by the Taylor series forexp(x). As the system is linear, the sum 
of any two solutions is also a solution. Therefore, given d independent initial conditions, 
xi(0), xz(0), . . . x d (Q) we can write the solution for an arbitrary initial condition based on 
its projection on to this set, 

x(t) =F(*)F(0r 1 Jc(0) = e At , 

where F(f) = (x\(t), x 2 (t), x d (t)) is a fundamental matrix of the system. (J. Halcrow) 



Example C.2 Complex eigenvalues. As M has only real entries, it will in general 
have either real eigenvalues, or complex conjugate pairs of eigenvalues. That is not 
surprising, but also the corresponding eigenvectors can be either real or complex. All 
coordinates used in defining a dynamical flow are real numbers, so what is the meaning 
of a complex eigenvector? 

If A k ,A k+i eigenvalues that lie within a diagonal [2x2] sub-block M' c M 
form a complex conjugate pair, {A k ,A k+l } = {/j. + ico,fi - ito], the corresponding com- 
plex eigenvectors can be replaced by their real and imaginary parts, {e <k \e (k+l) } -> 
{Ree w ,Ime w }. In this 2-dimensional real representation, M' — > N, the block N is a 
sum of the rescalingxidentity and the generator of SO(2) rotations 



j ti - to (xo) = mw(to) 



(C.20) 



exercise C.1 



exercise C.1 




Trajectories ofx-Nx, given by x(t) = J' 



x(0), where 




(C.21) 
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spiral in/out around (x,y) = (0,0), see figure 4.3, with the rotation period T and the 
radial expansion /contraction multiplier along the e 0) eigen-direction per a turn of the 
spiral: 

T = 2nlu>, AroOM = e T » . (C.22) 

We learn that the typical turnover time scale in the neighborhood of the equilibrium 
(x,y) = (0, 0) is of order * T (and not, let us say, 1000 T, or 10~ 2 T). 



Degenerate eigenvalues. While for a matrix with generic real elements all eigen- 
values are distinct with probability 1, that is not true in presence of symmetries, 
or spacial parameter values (bifurcation points). What can one say about situation 
where d 0 eigenvalues are degenerate, = Aj = Ai+\ = • • • = /t i+< 4_i? Hamilton- 
Cayley (C.12) now takes form 



~[(M-A a l) d ° =0, Yj d <* = d - (C23) 

a=\ a 

We distinguish two cases: 



M can be brought to diagonal form. The characteristic equation (C.23) can be 
replaced by the minimal polynomial, 



f](M-^l) = 0, (C.24) 

where the product includes each distinct eigenvalue only once. Matrix M acts 
multiplicatively 



Mt {a ' k) = Ait (a ' k) , (C.25) 

on a d a -dimensional subspace spanned by a linearly independent set of basis 
eigenvectors \4 a,Y) ,^ a,T) , • • • , e^'^}. This is the easy case whose discussion we 
continue in appendix 1.2.1. Luckily, if the degeneracy is due to a finite or compact 
symmetry group, relevant M matrices can always be brought to such Hermitian, 
diagonalizable form. 



M can only be brought to upper-triangular, Jordan form. This is the messy 
case, so we only illustrate the key idea in exampleC.3. 
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Example C.3 Decomposition of 2-dimensional vector spaces: Enumeration of 
every possible kind of linear algebra eigenvalue I eigenvector combination is beyond 
what we can reasonably undertake here. However, enumerating solutions for the sim- 
plest case, a general [2x2] non-singular matrix 

M = ( M U M n ) 

takes us a long way toward developing intuition about arbitrary finite-dimensional ma- 
trices. The eigenvalues 

A h2 = ^trM± - V(trM) 2 -4detM (C.26) 

are the roots of the characteristic (secular) equation (C.9): 

det(M-^l) = (Ai-A)(A 2 -A) 

= ^ 2 -trM^ + detM = 0. 



Distinct eigenvalues case has already been described in full generality. The left/right 
eigenvectors are the rows/columns of projection operators 

M-A 2 1 M-Ail 

Pi = — , Pi = — , A X ±A 2 . (C.27) 

A\ — A2 A2 — Ai 



Degenerate eigenvalues. If A l - A 2 - A, we distinguish two cases: (a) M can be 
brought to diagonal form. This is the easy case whose discussion in any dimension we 
continue in appendix 1.2. 1. (b) M can be brought to Jordan form, with zeros everywhere 
except for the diagonal, and some 1's directly above it; for a [2x2] matrix the Jordan 
form is 



M 



Hoi), "-(i)- *"-(?) 



v (2) helps span the 2-dimensional space, (M - A) 2 \ {2) = 0, but is not an eigenvector, as 
Mv (2) = A\ {2) + e (1) . For every such Jordan [d 0 xd a ] block there is only one eigenvector 
per block. Noting that 



M'" = 



A'" mA'"- 1 
0 A m 



we see that instead of acting multiplicatively on R 2 , Jacobian matrix J' = exp(fM) 



(M "I _ JA 



V 



u + tv\ 

(C.28) 



picks up a power-low correction. That spells trouble (logarithmic term \nt if we bring 
the extra term into the exponent). 
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Example C.4 Projection operator decomposition in 2 dimensions: Let's illus- 
trate how the distinct eigenvalues case works with the [2x2] matrix 




Its eigenvalues [A\, A 2 } = {5, 1} are the roots of (C.26): 

det (M - Al) = A 2 - 6 A + 5 = (5 - A)( 1 - A) = 0 . 

That M satisfies its secular equation (Hamilton-Cayley theorem) can be verified by 
explicit calculation: 

(t l)'- 6 (t 2) +5 (i ?H° o)- 

Associated with each root A, is the projection operator (C.27) 

Pl = i (M " 1)= i(3 I) (C29) 
P 2 = l(M-5.1)= l -(} 3 (C.30) 

Matrices P, are orthonormal and complete, The dimension of the ith subspace is given 
by di = trP, ; in case at hand both subspaces are 1 -dimensional. From the charac- 
teristic equation it follows that P, satisfies the eigenvalue equation MP, = A,-Pj . Two 
consequences are immediate. First, we can easily evaluate any function ofM by spec- 
tral decomposition, for example 

H'-s.i-cs'-w+ci-ioft-Gg; [mi). 

Second, as P, satisfies the eigenvalue equation, its every column is a right eigenvector, 
and every row a left eigenvector. Picking first row/column we get the eigenvectors: 

M - { (\),(_\) 
{e (1) ,e (2) } - {(3 1) ,( 1 -1)}, 

with overall scale arbitrary.The matrix is not hermitian , so {e 0) } do not form an orthog- 
onal basis. The left-right eigenvector dot products e (/) • e w , however, are orthogonal 
as in (C. 18), by inspection. 



Example C.5 Computing matrix exponentials. If A is diagonal (the system is 
uncoupled), then e' A is given by 



(A\t 



exp 



A 2 t 



A d t) 



If A is diagonalizable, A = FDF 1 , where D is the diagonal matrix of the eigen- 
values of A and F is the matrix of corresponding eigenvectors, the result is simple: 
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Table C.l: The first 27 least stable Floquet exponents A = /j + iw of equilibrium EQ 5 for plane 
Couette flow, Re = 400. The exponents are ordered by the decreasing real part. The two zero expo- 
nents, to the numerical precision of our computation, arise from the two translational symmetries. 
For details, see ref. [1]. 



j 


0) 


. .0") 

EQ5 


•51*2*3 


1,2 


0.07212161 


0.04074989 


s s s 


3 


0.06209526 




SAA 


4 


0.06162059 




ASA 


5,6 


0.02073075 


0.07355143 


s s s 


7 


0.009925378 




SAA 


8,9 


0.009654012 


0.04551274 


AAS 


10,11 


0.009600794 


0.2302166 


SAA 


12,13 


1.460798e-06 


1.542103e-06 


- - A 


14,15 


-0.0001343539 


0.231129 


AAS 


16 


-0.006178861 




ASA 


17,18 


-0.007785718 


0.1372092 


AAS 


19 


-0.01064716 




SAA 


20,21 


-0.01220116 


0.2774336 


s s s 


22,23 


-0.01539667 


0.2775381 


SAA 


24,25 


-0.03451081 


0.08674062 


ASA 


26,27 


-0.03719139 


0.215319 


SAA 



A" = (FDF ^iFDF 1 ) . . . (FDF 1 ) = FD"F l . Inserting this into the Taylor series for 
e x gives e At = Fe Dt F-\ 

But A may not have d linearly independant eigenvectors, making F singular and 
forcing us to take a different route. To illustrate this, consider [2x2] matrices. For any 
linear system in R 2 , there is a similarity transformation 

B = U^AU, 

where the columns of U consist of the generalized eigenvectors of A such that B has 
one of the following forms: 




These three cases, called normal forms, correspond to A having ( 1 ) distinct real eigen- 
values, (2) degenerate real eigenvalues, or (3) a complex pair of eigenvalues. It follows 
that 

Et_{e M 0\ e B<_ e A,(l t\ Bt _ t (cosbt -sinfcA 

\0 ef')' e M0 IT [sinbt cosbt)' 

ande A ' = Ue B 'U~ l . What we have done is classify all '[2*2] matrices as belonging to one 
of three classes of geometrical transformations. The first case is scaling, the second 
is a shear, and the third is a combination of rotation and scaling. The generalization of 
these normal forms to R d is called the Jordan normal form. (J. Halcrow) 



appcndStability - 23jan2012 



ChaosBook.org version!4.4. 1, Apr 21 2013 



APPENDIX C. LINEAR STABILITY 



816 



Figure C.l: Eigenvalues of the plane Couette flow 
equilibrium EQ$, plotted according to their isotropy 

groups: • + + +, the S -invariant subspace, ► H , 

■4 — i — , and A h where ± symbols stand for 

symmetric/antisymmetric under symmetry operation 
Si, s 2 , and s 3 respectively, defined in ref. [2]. For ta- 
bles of numerical values of stability eigenvalues see 
Channelflow.org. 



C.3 Eigenspectra: what to make out of them? 

Well Mack the Finger said to Louie the King 
I got forty red white and blue shoe strings 
And a thousand telephones that don't ring 
Do you know where I can get rid of these things? 
— Bob Dylan, Highway 61 Revisited 

Table C.l, taken from ref. [1], is an example of how to tabulate the leading 
Floquet eigenvalues of the stability matrix of an equilibrium or relative equilib- 
rium. The isotropy subgroup of the corresponding eigenfunction should be 

indicated. If the isotropy is trivial, G^q = {e}, it is omitted from the table. The 
isotropy subgroup Geq of the solution itself needs to be noted, and for relative 
equilibrium (10.30) the velocity c along the group orbit. In addition, if the least 
stable (i.e., the most unstable) eigenvalue is complex, it is helpful to state the 
period of the spiral-out motion (or spiral-in, if stable), Teq = 2/r/a>£g. 

Table C.l, taken from ref. [3], is an example of how to tabulate the leading 
Floquet exponents of the monodromy matrix of an periodic orbit or relative pe- 
riodic orbit. For a periodic orbit one states the period T p , A p = T\ A Pie , and the 
isotropy group G p of the orbit; for a relative periodic orbit (10.35) one states in 
addition the shift parameters (p = {<pi,(f>z,- " 4>n)- A p , the product of expanding 
Floquet multipliers (5.9) is useful, as l/|A p | is the geometric weight of cycle p 
in a cycle expansion (remember that each complex eigenvalue contributes twice). 
We often do care about cr p - A p j/\A p j\ e {+1, -1}, the sign of the jth Floquet 
multiplier, or, if A p j is complex, its phase T p or p . 

Surveying this multitude of equilibrium and Floquet exponents is aided by a 
plot of the complex exponent plane (ji,a>). An example are the eigenvalues of 
equilibrium EQg from ref. [2], plotted in figure C.l. To decide how many of 
the these are "physical" in the PDE case (where number of exponents is always 
infinite, in principle), it is useful to look at the (j',// ;) ) plot. However, intelli- 
gent choice of the 7-axis units can be tricky for high-dimensional problems. For 
Kuramoto-Sivashinsky system the correct choice are the wave-numbers which, 
due to the 0(2) symmetry, come in pairs. For plane Couette flow the good choice 
is not known as yet; one needs to group 0(2) x 0(2) wave-numbers, as well as 
take care of the wall-normal node counting. 



4 

A 

► A 

• ► 


EQ 8 

► 

► ► A • • 


► A 
A 

•4 


4 m 

► 



-0.05 0 0.05 0.1 
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Table C.2: The first 13 least stable Floquet exponents A = p ± ico of periodic orbit p = P59.77 for 
plane Couette flow, Re = 400, together with the symmetries of corresponding eigenvectors. The 
eigenvalues are ordered by the decreasing real part. The one zero eigenvalue, to the numerical pre- 
cision of our computation, arises from the spanwise translational SO(2) symmetry of this periodic 
orbit. For details, see ref. [3]. 



j 


u p 


ft 


. .(;■) 


°p 


1,2 




0.07212161 


0.04074989 


D, 


3 


1 


0.06209526 




9 


4 


-1 


0.06162059 






5,6 




0.02073075 


0.07355143 




7 


-1 


0.009925378 






8,9 




0.009654012 


0.04551274 




10,11 




0.009600794 


0.2302166 





C.4 Stability of Hamiltonian flows 



(M.J. Feigenbaum and P. Cvitanovic) 

The symplectic structure of Hamilton's equations buys us much more than the 
incompressibility, or the phase space volume conservation alluded to in sect.7.1. 
The evolution equations for any p, q dependent quantity Q = Q(q, p) are given by 
(16.31). 

In terms of the Poisson brackets, the time-evolution equation for Q = Q(q, p) 
is given by (16.33). We now recast the symplectic condition (7.9) in a form con- 
venient for using the symplectic constraints on M. Writing x{t) = i = [p',q'] 
and the Jacobian matrix and its inverse 



dq dp 
d£ d£ 
dq dp 



M = \ || % , Mr 





I dq 


dq \ 


1 








% 


X 




V dq' 


dp' ) 



(C.31) 



we can spell out the symplectic invariance condition (7-9): 



dqi dqj dqi dqj 



dq[dp[_dp^dq[ 
dpidpj dpidpj 

dq t dp j dqt dp j 



= 0 

= dij. (C.32) 



From (7.26) we obtain 

djnjj^ dp i = d _i dq L = Jj) d_p L = _d_P) (C33) 
dq'j dpi ' dp'j dq t ' dp'j dp { ' dq'j dq { ' 
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Taken together, (C.33) and (C.32) imply that the flow conserves the {p,q} Poisson 
brackets 



, , _ dqj dqj dqj dqj _ n 
{qi > q j ] = d P ' kd q' k -dp' k H =0 

{ Pi ,Pj} = 0, {pi,qj} = Sij, (C.34) 



i.e., the transformations induced by a Hamiltonian flow are canonical, preserving 
the form of the equations of motion. The first two relations are symmetric under 
i, j interchange and yield D(D - l)/2 constraints each; the last relation yields T3 
constraints. Hence only (2D) 2 - 2D(D - l)/2 - D 2 = d(2D + 1) elements of M 
are linearly independent, as it behooves group elements of the symplectic group 
S p(2D). 



C.5 Monodromy matrix for Hamiltonian flows 

(G. Tanner) 

It is not the Jacobian matrix J of the flow (4.5), but the monodromy matrix M, 
which enters the trace formula. This matrix gives the time dependence of a dis- 
placement perpendicular to the flow on the energy manifold. Indeed, we discover 
some trivial parts in the Jacobian matrix J. An initial displacement in the direc- 
tion of the flow x = a)VH(x) transfers according to 6x(t) = Xf(t)6t with 5t time 
independent. The projection of any displacement on 8x on VH(x) is constant, i.e., 
VH(x(t))Sx(t) = 6E. We get the equations of motion for the monodromy matrix 
directly choosing a suitable local coordinate system on the orbit x(t) in form of 
the (non singular) transformation U(x(t)): 

J{x{t)) = IT 1 MO) J(x(t)) U«0)) (C.35) 

These lead to 



J = LJ 

with L = U _1 (LU-U) (C.36) 



Note that the properties a) - c) are only fulfilled for / and L if U itself is symplec- 
tic. 

Choosing xe = VH(t)/\VH(t)\ 2 and x t as local coordinates uncovers the two 
trivial eigenvalues 1 of the transformed matrix in (C.35) at any time t. Setting 
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U , Xy , . . . , Xnj_ 2) §ivcs 



7 = 



^ 1 * * 

0 1 0 

0 * 

1 0 * 



0 



M 



L = 



( 0 * * ... * ^ 
0 0 0 ... 0 

0 * 



1 



I 0 * 



(C.37) 



The matrix M is now the monodromy matrix and the equation of motion are given 
by 



M = 1M. 



(C.38) 



The vectors x\ , . . . , *2c/-2 must span the space perpendicular to the flow on the 
energy manifold. 

For a system with two degrees of freedom, the matrix U(t) can be written 
down explicitly, i.e., 



U(f) = (x t ,xi,x E ,x 2 ) = 



x —y -u/cf -v/q 

y x -v/q 2 ii/q 2 

u v x/q 2 —jlq 2 

v -u y/q 2 x/q 2 



(C.39) 



with x T = (x,y;u, v) and q = |V//| = \x\. The matrix U is non singular and 
symplectic at every phase space point x, except the equilibrium points x = 0. The 
matrix elements for 1 are given (C.41). One distinguishes 4 classes of eigenvalues 
of M. 



• stable or elliptic, if A = e ±OTV and v e]0, 1[. 

• marginal, if A = ± 1 . 

• hyperbolic, inverse hyperbolic, if A = e ±A , A = —e ±A . 

• loxodromic, if A = g*^*'" with ju and u real. This is the most general case, 
possible only in systems with 3 or more degree of freedoms. 

For 2 degrees of freedom, i.e., M is a [2x2] matrix, the eigenvalues are determined 
by 



A = 



tr(M)± Vtr(M) 2 -4 



(C.40) 



i.e., tr(M) = 2 separates stable and unstable behavior. 
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The 1 matrix elements for the local transformation (C.39) are 

In-- [(hi - h 2 -hl + h 2 ,)(h xu - hyy) + 2(h x h y - h u h v )(h xv + h yu ) 
*? 

-(h x h u + h y h v )(h xx + hyy - h uu - h n ,)] 
I12 = — [(hi + h 2 v ){h yy + h uu ) + {h 2 y + h 2 u )(h xx + h m ) 

-2(h x h u + h y h v ){h xu + h yv ) - 2(h x h y - hJi^Qi^ - h uv )] 
hi = ~(h 2 x + h])(h uu + h vv ) - (hi + h 2 v ){h xx + h yy ) 

+2(h x h u - h y h v )(h xu - hyy) + 2(h x h v + hyh^Qi^, + h yu ) 
1 22 = -T n , (C.41) 

with hi, hij is the derivative of the Hamiltonian H with respect to the phase space 
coordinates and q = \VH\ 2 . 
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Exercises 



C. 1 . Real representation of complex eigenvalues. (Ver- 
ification of example C.2.) Ak,Ak+i eigenvalues form a 
complex conjugate pair, {Ak,Ak+\) — \p- + icj,/J - icj). 
Show that 

(a) corresponding projection operators are complex 
conjugates of each other, 

P = p, , p* = p M , 

where we denote Pa by P for notational brevity. 

(b) P can be written as 

p = i(R + «Q), 



where R = Pa + Pa+i and Q are matrices with real 
elements. 

(d) ■ ■ •+AjfcPjt+AJPjfc + i +■ ■ ■ complex eigenvalue pair in 
the spectral decomposition (C.16) is now replaced 
by a real [2x2] matrix 

jj ~ Ll> \( r n 



< T)(q) 



or whatever you find the clearest way to write this 
real representation. 
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Appendix D 
Finding cycles 



(C. Chandre) 

D.l Newton-Raphson method 
D.l.l Contraction rate 

Consider a J-dimensional map x' - fix) with an unstable fixed point x*. The 
Newton-Raphson algorithm is obtained by iterating the following map 

x' =g(x) = x-(J(x)-\y l (f(x)-x). 

The linearization of g near x* leads to 

x* + e' = + e - (/(x*) - l)" 1 (/(x*) + J(x*)e - x* - e) + O (||e|| 2 ) , 
where e = x - x*. Therefore, 
x' - x* = O ((x - x*) 2 ) . 

After n steps and if the initial guess xq is close to x t , the error decreases 
super-exponentially 

g n (x 0 ) - x* - O ((*o - x*) 2 ") . 
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D.1.2 Computation of the inverse 

The Newton-Raphson method for finding «-cycles of cf-dimensional mappings 
using the multi-shooting method reduces to the following equation 



( 1 -Df(x n ) \ 


( 5 i 1 






-Df( Xl ) 1 

1 


<5 2 




F 2 


-Df(x n -i) 1 


, 8n ) 




<. F n , 



(D.l) 



where Df{x) is the [d x d] Jacobian matrix of the map evaluated at the point x, 
and 5 m - x' m - x m and F m = x m - f{x m ^{) are J-dimensional vectors. By some 
starightforward algebra, the vectors 6, n are expressed as functions of the vectors 
F ■ 

1 in- 



k=l 



U=l 



(D.2) 



for m - 1 , . . . , n, where fi^m = Df(x m )Df(x m - 1) • • ■ Df{x^) for k <m and fy,m - 1 
for k > m. Therefore, finding ^-cycles by a Newton-Raphson method with multi- 
ple shooting requires the inversing of a [dxd] matrix l-Df(x l )Df(x n -i)- ■ ■ Df{x\). 



D.2 Hybrid Newton-Raphson / relaxation method 



Consider a <i-dimensional map x" = f(x) with an unstable fixed point x* . 
The transformed map is the following one: 



*' = g(x) = * + yC(f(x) - x), 

where y > 0 and C is a d x d invertible constant matrix. We note that x* is also a 
fixed point of g. Consider the stability matrix at the fixed point x, 



A - d J- 
Ag ~ dx 



\+yC{A f -\). 



The matrix C is constructed such that the eigenvalues of Ag are of modulus less 
than one. Assume that Af is diagonalizable: In the basis of diagonalization, the 
matrix writes: 



4 = l+rC(A/-l), 
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Figure D.l: Illustration of the 
tion. The original section y = 0 
x-f(x) for the Newton iteration 
is y = 0.7. 



2.5 



optimal Poincare sec- 
yields a large distance 
. A much better choice 




-1.5 ' ' ' ' 1 ' 1 

0 0.2 0.4 0.6 0.8 1 1.2 



where A/ is diagonal with elements fij. We restrict the set of matrices C to diag- 
onal matrices with Cu = e, where e,- = ±1. Thus A g is diagonal with eigenvalues 
ji = 1 + yeiQii - 1). The choice of y and e,- is such that \y,-\ < 1. It is easy to see 
that if Re(/i ( ) < 1 one has to choose q — 1, and if ReijUj) > 1, e\ = -1. If /I is 
chosen such that 



. 2|Re(Ai,-)- 1| 
0 < 7 < mm — ; — j — 



i=i,...,d 

all the eigenvalues of A g have modulus less that one. The contraction rate at the 
fixed point for the map g is then max, |1 + ye,(/i,- - 1)|. If Re(jU,) = 1, it is not 
possible to stabilize x* by the set of matrices yC. 

From the construction of C, we see that 2 1 choices of matrices are possible. For 
example, for 2-dimensional systems, these matrices are 



C e 



10\ /-10\ /l 0 W-l 0 
OlJ'l 0 lj'lo-l j'l 0 -1 



For 2-dimensional dissipative maps, the eigenvalues satisfy Re(jUi)Re(jt/2) < det Df < 
1. The case (Re(/ii) > l,Re(//2) > 1) which is stabilized by ("q 1 has to be dis- 
carded. The minimal set is reduced to three matrices. 



D.2.1 Newton method with optimal section 

(F. Christiansen) 

In some systems it might be hard to find a good starting guess for a fixed point. 
This can happen, for example, if the topology and/or the symbolic dynamics of 
the flow is not well understood. By changing the Poincare section one might get 
a better initial guess in the sense that x and f(x) are closer together. We illustrate 
this in figure D.l. The figure shows a Poincare section, y = 0, an initial guess x, 
the corresponding f(x) and pieces of the trajectory near these two points. 

If Newton iteration does not converge for the initial guess x we might have to 
work very hard to find a better guess, particularly if this is in a high-dimensional 
system (high-dimensional in this context might mean a Hamiltonian system with 3 
or more degrees of freedom). Clearly, we could easily obtain a much better guess 
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by simply shifting the Poincare section to y = 0.7 where the distance x - f(x) 
would be much smaller. Naturally, one cannot so easily determine by inspection 
the best section for a higher dimensional system. Rather, the way to proceed is 
as follows: We want to have a minimal distance between our initial guess x and 
its image f(x). We therefore integrate the flow looking for a minimum in the 
distance d(i) - \f{x) - x\. d{t) is now a minimum with respect to variations in 
f\x), but not necessarily with respect to x. We therefore integrate x either forward 
or backward in time. Doing this minimizes d with respect to x, but now it is no 
longer minimal with respect to f(x). We therefore repeat the steps, alternating 
between correcting x and f(x). In most cases this process converges quite rapidly. 
The result is a trajectory for which the vector (f(x) - x) connecting the two end 
points is perpendicular to the flow at both points. We can now define a Poincare 
section as the hyper-plane that goes through x and is normal to the flow at x, 
(x' - x) ■ v(x) = 0. 

The image f(x) lies in the section. This section is optimal in the sense that 
a close return on the section is a local minimum of the distance between x and 
f'(x). More important, the part of the stability matrix that describes linearization 
perpendicular to the flow is exactly the stability of the flow in the section when 
f(x) is close to x. With this method, the Poincare section changes with each New- 
ton iteration. Should we later want to put the fixed point on a specific Poincare 
section, it will only be a matter of moving along the trajectory. 
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Symbolic dynamics techniques 



The kneading theory for unimodal mappings is developed in sect. E. 1 . The 
prime factorization for dynamical itineraries of sect. E. 2 illustrates the sense 
in which prime cycles are "prime" - the product structure of zeta functions 
is a consequence of the unique factorization property of symbol sequences. 



E.l Topological zeta functions for infinite subshifts 

(P. Dahlqvist) 

The transition graph methods outlined in chapter 1 1 are well suited for 
symbolic dynamics of finite subshift type. A sequence of well defined rules leads 
to the answer, the topological zeta function, which turns out to be a polynomial. 
For infinite subshifts one would have to go through an infinite sequence of graph 
constructions and it is of course very difficult to make any asymptotic statements 
about the outcome. Luckily, for some simple systems the goal can be reached by 
much simpler means. This is the case for unimodal maps. 

We will restrict our attention to the topological zeta function for unimodal 
maps with one external parameter f\(x) = Ag(x). As usual, symbolic dynamics is 
introduced by mapping a time series . . . X{-\XiXi+\ . . . onto a sequence of symbols 
. . . Si-iSiSi+i . . . where 

s i — 0 Xi ^ Xq 
Sj — C xi — x c 

Si = 1 Xi > x c (E.l) 

and x c is the critical point of the map (i.e., maximum of g). In addition to the usual 
binary alphabet we have added a symbol C for the critical point. The kneading 
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/(C) 


C(z)/(!-z) 


1(C) 


CUz)/(l-z) 


1C 




1001C 




101C 




1001 11C 




1011101C 




1001 1C 




H°°(l) 


rr°° (1 - z 2"\ 


1001 IOC 




10111C 




100C 




1011111C 




1000 IOC 




101°° 


(1 -2z 2 )/(l +z) 


10001C 




10111111C 




10001 1C 




101111C 




1000C 




1011C 




100001C 




101101C 




10000C 




IOC 


d-z-z 2 ) 


100000C 




10010C 




10°° 


(l-2z)/(l-z) 


100101C 









Table E.l: All ordered kneading sequences up to length seven, as well as some longer kneading 
sequences. Harmonic extension H°°(l) is defined below. 



sequence is the itinerary of the critical point (11.13). The crucial observation 
is that no periodic orbit can have a topological coordinate (see sect.E.1.1) beyond 
that of the kneading sequence. The kneading sequence thus inserts a border in 
the list of periodic orbits (ordered according to maximal topological coordinate), 
cycles up to this limit are allowed, all beyond are pruned. All unimodal maps 
(obeying some further constraints) with the same kneading sequence thus have the 
same set of periodic orbitsand the same topological zeta function. The topological 
coordinate of the kneading sequence increases with increasing A. 

The kneading sequence can be of one of three types 



1. It maps to the critical point again, after n iterations. If so, we adopt the 
convention to terminate the kneading sequence with a C, and refer to the 
kneading sequence as finite. 

2. Preperiodic, i.e., it is infinite but with a periodic tail. 

3. Aperiodic. 



As an archetype unimodal map we will choose the tent map 



. Ax xe [0,1/2] 

*»m = \ M1 _ x) xe (1/2,1] • W 



where the parameter A e (1,2]. The topological entropy is h = log A. This 
follows from the fact any trajectory of the map is bounded, the escape rate is 
strictly zero, and so the dynamical zeta function 

i/f<z)= n('-i^i)=n('-(f)'>>«»pWA) 
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has its leading zero at z = 1 ■ 

The set of periodic points of the tent map is countable. A consequence of this 
fact is that the set of parameter values for which the kneading sequence (1 1. 13) is 
periodic or preperiodic are countable and thus of measure zero and consequently 
the kneading sequence is aperiodic for almost all A. For general unimodal maps 
the corresponding statement is that the kneading sequence is aperiodic for almost 
all topological entropies. 

For a given periodic kneading sequence of period n,K A = PC = 
s\S2 ... s n -\C there is a simple expansion for the topological zeta function. Then 
the expanded zeta function is a polynomial of degree n 



and ao = 1 . 

Aperiodic and preperiodic kneading sequences are accounted for by simply 
replacing n by oo. 

Example. Consider as an example the kneading sequence K\ = IOC. From 
(E.3) we get the topological zeta function l/4, p (z) - (1 _ z)(l — z — z ), see 
table E. 1 . This can also be realized by redefining the alphabet. The only forbidden 
subsequence is 100. All allowed periodic orbits, except 0, can can be built from 
a alphabet with letters 10 and I. We write this alphabet as (10, J; 0}, yielding the 
topological zeta function l/£ 0 p(z) = (1 - z)(l - z - z 2 ). The leading zero is the 
inverse golden mean zo = ( V5 - l)/2. 

Example. As another example we consider the preperiodic kneading se- 
quence K/^ = 101°°. From (E.3) we get the topological zeta function l/£ 0 p(z) = 
(1 - z)(l - 2z 2 )/(l + z), see table E.l. This can again be realized by redefin- 
ing the alphabet. There are now an infinite number of forbidden subsequences, 
namely 101 2 "0 where n > 0. These pruning rules are respected by the alphabet 
{ 01 2 " +1 ; 1,0}, yielding the topological zeta function above. The pole in the zeta 
function Q} p {z) is a consequence of the infinite alphabet. 

An important consequence of (E.3) is that the sequence {a,} has a periodic tail 
if and only if the kneading sequence has one (however, their period may differ 
by a factor of two). We know already that the kneading sequence is aperiodic for 
almost all A. 

The analytic structure of the function represented by the infinite series £ aiZi 
with unity as radius of convergence, depends on whether the tail of {q) is periodic 
or not. If the period of the tail is N we can write 




(E.3) 



l/£to P (z)=p(z) + q(z)(l +z N + z 



) = P(z) + 



q{z) 
1 -z' 



chapter/dahlqvist.tex 30nov2001 



ChaosBook.org versionl4.4.1, Apr21 2013 



APPENDIX E. SYMBOLIC DYNAMICS TECHNIQUES 



830 



for some polynomials p(z) and q{z). The result is a set of poles spread out along 
the unit circle. This applies to the preperiodic case. An aperiodic sequence of 
coefficients would formally correspond to infinite N and it is natural to assume 
that the singularities will fill the unit circle. There is indeed a theorem ensuring 
that this is the case [57], provided the afs can only take on a finite number of 
values. The unit circle becomes a natural boundary, already apparent in a finite 
polynomial approximations to the topological zeta function, as in figure 15.2. A 
function with a natural boundary lacks an analytic continuation outside it. 

To conclude: The topological zeta function 1 /£ op for unimodal maps has the 
unit circle as a natural boundary for almost all topological entropies and for the 
tent map (E.2), for almost all A. 

Let us now focus on the relation between the analytic structure of the topolo- 
gical zeta function and the number of periodic orbits, or rather (15.8), the number 
N n of fixed points of f(x). The trace formula is (see sect. 15.4) 

N n = tiT" = ^-.£ dzz~ n ^- log & 



2m X dz b " ,op 



where y r is a (circular) contour encircling the origin z = 0 in clockwise direction. 

r 1 

■> top 



Residue calculus turns this into a sum over zeros zo and poles z p of f (c 1 



Z0 :r<| Z0 |<R z„:r<\z P \<R J ™ 1 



and a contribution from a large circle jr. For meromorphic topological zeta func- 
tions one may let R — » oo with vanishing contribution from jr, and N n will be a 
sum of exponentials. 

The leading zero is associated with the topological entropy, as discussed in 
chapter 15. 

We have also seen that for preperiodic kneading there will be poles on the unit 
circle. 

To appreciate the role of natural boundaries we will consider a (very) special 
example. Cascades of period doublings is a central concept for the description of 
unimodal maps. This motivates a close study of the function 



S(z) = f[(l-z 2 ") . (E.4) 

This function will appear again when we derive (E.3). 

The expansion of H(z) begins as E(z) = I - z - Z 2 + z 3 - z 4 + z 5 The radius 

of convergence is obviously unity. The simple rule governing the expansion will 
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effectively prohibit any periodicity among the coefficients making the unit circle 
a natural boundary. 

It is easy to see that H(z) = 0 if z - exp(2nm/2 l ) for any integer m and n. 
(Strictly speaking we mean that E(z) — > 0 when z —> exp(27rwi/2') from inside). 
Consequently, zeros are dense on the unit circle. One can also show that singular 
points are dense on the unit circle, for instance |E(z)| — > oo when z — > exp(27rm/3 I ) 
for any integer m and n. 

As an example, the topological zeta function at the accumulation point of 
the first Feigenbaum cascade is Q, l p (z) = (1 - z)3(z). Then N n - 2 /+1 if n = 
2 1 , otherwise N n = 0. The growth rate in the number of cycles is anything but 
exponential. It is clear that N n cannot be a sum of exponentials, the contour jr 
cannot be pushed away to infinity, R is restricted to R < 1 and N n is entirely 
determined by f which picks up its contribution from the natural boundary. 

We have so far studied the analytic structure for some special cases and we 
know that the unit circle is a natural boundary for almost all A. But how does 
it look out there in the complex plane for some typical parameter values? To 
explore that we will imagine a journey from the origin z = 0 out towards the unit 
circle. While traveling we let the parameter A change slowly. The trip will have a 
distinct science fiction flavor. The first zero we encounter is the one connected to 
the topological entropy. Obviously it moves smoothly and slowly. When we move 
outward to the unit circle we encounter zeros in increasing densities. The closer 
to the unit circle they are, the wilder and stranger they move. They move from 
and back to the horizon, where they are created and destroyed through bizarre 
bifurcations. For some special values of the parameter the unit circle suddenly gets 
transparent and and we get (infinitely) short glimpses of another world beyond the 
horizon. 

We end this section by deriving eqs (E.5) and (E.6). The impenetrable prose 
is hopefully explained by the accompanying tables. 

We know one thing from chapter 1 1 , namely for that finite kneading sequence 
of length n the topological polynomial is of degree n. The graph contains a node 
which is connected to itself only via the symbol 0. This implies that a factor 
(1 - z) may be factored out and £ t op(z) = (1 - z)2Z"=q a iZ l - The problem is to find 
the coefficients a,-. 

The ordered list of (finite) kneading sequences tableE.l and the ordered list of 
periodic orbits (on maximal form) are intimately related. In tableE.2 we indicate 
how they are nested during a period doubling cascade. Every finite kneading 
sequence PC is bracketed by two periodic orbits, PI and P0. We have PI < PC < 
P0 if P contains an odd number of l's, and P0 < PC < PI otherwise. From 
now on we will assume that P contains an odd number of l's. The other case 
can be worked out in complete analogy. The first and second harmonic of PC 
are displayed in table E.2. The periodic orbit PI (and the corresponding infinite 
kneading sequence) is sometimes referred to as the antiharmonic extension of PC 
(denoted A°°(P)) and the accumulation point of the cascade is called the harmonic 
extension of PC [ ] (denoted H°°(P)). 
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periodic orbits 


finite kneading sequences 


PI = A°°(P) 


PC 


To 


POPC 


POPl 


P0P1P0PC 


i 


i 


H°°(P) 


H°°(P) 



Table E.2: Relation between periodic orbits and finite kneading sequences in a harmonic cascade. 
The string P is assumed to contain an odd number of 1 's. 

A central result is the fact that a period doubling cascade of PC is not in- 
terfered by any other sequence. Another way to express this is that a kneading 
sequence PC and its harmonic are adjacent in the list of kneading sequences to 
any order. 



1(C) 


Q L p (z)/(l-z) 


Pi = 100C 
//"(P,) = 10001001100... 


l-z-z 1 -^ 

l-z-'^-z'-zUz'+z^ + z 1 -z 8 ... 


P' = 10001C 


1 7 3 4 5 

l - z. - z - z - z + z° 


A°°(P 2 ) = 1000110001... 
P 2 = 1000C 


l-z-z l -z i -z* + z>-z b -z / -z«... 

i 2 3 4 

l - z- z - z - Z 



Table E.3: Example of a step in the iterative construction of the list of kneading sequences PC. 



Table E.3 illustrates another central result in the combinatorics of kneading 
sequences. We suppose that P\C and P 2 C are neighbors in the list of order 5 
(meaning that the shortest finite kneading sequence PC between P\C and P 2 C is 
longer than 5.) The important result is that P (of length n' = 6) has to coincide 
with the first n' - 1 letters of both H°°(P\) and A°°(P 2 ). This is exemplified in 
the left column of table E.3. This fact makes it possible to generate the list of 
kneading sequences in an iterative way. 

The zeta function at the accumulation point /P°(Pi) is 

C P \{zW) , (E.5) 

and just before A°°(P 2 ) 

CpI(z)/(1-z" 2 ) . (E.6) 

A short calculation shows that this is exactly what one would obtain by applying 
(E.3) to the antiharmonic and harmonic extensions directly, provided that it applies 
to £p\{z) and £p 2 '(z)- This is the key observation. 

Recall now the product representation of the zeta function = _ 
z Hp ). We will now make use of the fact that the zeta function associated with 
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P'C is a polynomial of order ri . There is no periodic orbit of length shorter than 
n' + 1 between H°°{Pi) and A°°(P 2 ). It thus follows that the coefficients of this 
polynomial coincides with those of (E.5) and (E.6), see Table E.3. We can thus 
conclude that our rule can be applied directly to PC. 

This can be used as an induction step in proving that the rule can be applied 
to every finite and infinite kneading sequences. 

Remark E.l How to prove things. The explicit relation between the kneading se- 
quence and the coefficients of the topological zeta function is not commonly seen in the 
literature. The result can proven by combining some theorems of Milnor and Thurston 
[14]. That approach is hardly instructive in the present context. Our derivation was in- 
spired by Metropolis, Stein and Stein classical paper [8]. For further detail, consult [14]. 

E.l.l Periodic orbits of unimodal maps 

A periodic point {cycle point) x k belonging to a cycle of period n is a real solution 



The nth iterate of a unimodal map has at most Z 1 monotone segments, and there- 
fore there will be 2" or fewer periodic points of length n. Similarly, the backward 
and the forward Smale horseshoes intersect at most 2 1 times, and therefore there 
will be 2" or fewer periodic points of length n. A periodic orbit of length n cor- 
responds to an infinite repetition of a length n = n p symbol string, customarily 
indicated by a line over the string: 

S p = (S1S2S3 ... s n )°° = sis 2 s 3 ...s n . 

As all itineraries are infinite, we shall adopt convention that a finite string itinerary 
S p = S1S2S3 . . . s n stands for infinite repetition of a finite block, and routinely omit 
the overline. xq, its cyclic permutation 

SkSk+\ .. .s n s\ ... Sk-\ corresponds to the point x^-\ in the same cycle. A cycle p 
is called prime if its itinerary S cannot be written as a repetition of a shorter block 



Each cycle p is a set of n p rational-valued full tent map periodic points y. It 
follows from (1 1.9) that if the repeating string S1S2 ■ ■ ■ s n contains an odd number 
"l"s, the string of well ordered symbols w\M>2 ■ ■ - W2 n has to be of the double 
length before it repeats itself. The cycle-point y is a geometrical sum which we 
can rewrite as the fraction 



of 



/"(**) = f(f(. ..f(x k ) ...)) = x k , k = 0, 1,2, .. . ,n - 1 . 



(E.7) 



S'. 




(E.8) 
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Using this we can calculate the y(S) for all short cycles. For orbits up to length 5 
this is done in table 11.1. 

Here we give explicit formulas for the topological coordinate of a periodic 
point, given its itinerary. For the purpose of what follows it is convenient to com- 
pactify the itineraries by replacing the binary alphabet $ = {0, 1 } by the infinite 
alphabet 

{a l ,a 2 ,a 3 ,a 4 ,- ■■;()} = {1, 10, 100, 1000, . . . ;6}. (E.9) 

In this notation the itinerary S = aiajatai ■ ■ • and the corresponding topological 
coordinate (1 1.9) are related by y(S) = .l'O^l^O' • • •. For example: 

S - 111011101001000... = a\a\a 2 a\a\a 2 a^aA, . . . 
7(5) - .101101001110000... = .1 1 0 1 1 2 0 1 1 1 0 2 1 3 0 4 . . . 

Cycle points whose itineraries start with w\ - w% - . . . = vv; = 0, Wj+i = 1 remain 
on the left branch of the tent map for i iterations, and satisfy y(0 ...OS) = y(S)/2. 

Periodic points correspond to rational values of y, but we have to distinguish 
even and odd cycles. The even (odd) cycles contain even (odd) number of q in 
the repeating block, with periodic points given by 



f J£l.l'"0/...l* even 
y(a;a j ■ ■ ■ aide) = { , , (E.10) 

3 [ 2^(i + 2«x.i'o;---io odd 



where n = i + j H 1- k + £ is the cycle period. The maximal value periodic point 

is given by the cyclic permutation of S with the largest q as the first symbol, 
followed by the smallest available aj as the next symbol, and so on. For example: 



f(l) -- 


= r(«i) 


-- .10101... 


- .10 


= 2/3 


7(10) = 


= y(a 2 ) 


: .1 2 0 2 ... 


- .1100 


= 4/5 


7(100) = 


= rfe) 


: .1 3 0 3 ... 


- .111000 


= 8/9 


7(101) -- 


- y{a 2 a\) = 


: .^O 1 ... 


- .no 


= 6/7 



An example of a cycle where only the third symbol determines the maximal value 
periodic point is 



7(1101110) = y(a 2 a ia2 aiai) = .11011010010010 - 100/129. 
Maximal values of all cycles up to length 5 are given in table!? 
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E.2 Prime factorization for dynamical itineraries 

The Mobius function is not only a number-theoretic function, but can be 
used to manipulate ordered sets of noncommuting objects such as symbol strings. 
Let P = {p\,p2,P3, • • •} be an ordered set of prime strings, and 

j e N, ki e Z+, be the set of all strings n obtained by the ordered concatenation of 
the "primes" p,. By construction, every string n has a unique prime factorization. 
We say that a string has a divisor d if it contains d as a substring, and define the 
string division n/d as n with the substring d deleted. Now we can do things like 
this: defining t n := tp\t^ 2 ■ ■ ■ t k J. we can write the inverse dynamical zeta function 
(20.2) as 

p n 

and, if we care (we do in the case of the Riemann zeta function), the dynamical 
zeta function as . 

p y n 

A striking aspect of this formula is its resemblance to the factorization of nat- 
ural numbers into primes: the relation of the cycle expansion (E. 1 1) to the product 
over prime cycles is analogous to the Riemann zeta (exercise 19.10) represented 
as a sum over natural numbers vs. its Euler product representation. 

We now implement this factorization explicitly by decomposing recursively 
binary strings into ordered concatenations of prime strings. There are 2 strings of 
length 1, both prime: p\ = 0, p% - 1- There are 4 strings of length 2: 00, 01, 
11, 10. The first three are ordered concatenations of primes: 00 = p\, 01 = p\Pi, 
11 = p^', by ordered concatenations we mean that P1P2 is legal, but P2P1 is not. 
The remaining string is the only prime of length 2, p$ = 10. Proceeding by 
discarding the strings which are concatenations of shorter primes p\ [ p 1 ^ " ' Pj J > 
with primes lexically ordered, we generate the standard list of primes, in agree- 
ment with table 15.1: 0, 1, 10, 101, 100, 1000, 1001, 1011, 10000, 10001, 
10010, 10011, 10110, 10111, 100000, 100001, 100010, 100011, 100110, 100111, 
101100, 101110, 101111, . . .. This factorization is illustrated in tableE.4. 




chapter/appendSymb.tex 23mar98 



ChaosBook.org versionl4.4. 1, Apr 21 2013 



APPENDIX E. SYMBOLIC DYNAMICS TECHNIQUES 



836 



factors 



Pi 

P2 



P\P2 

pi 

P3 



P\P2 
P\P\ 

P\ 

PlP3 
P2P3 
P4 
P5 



string 



00 
01 
11 
10 



000 
001 

on 
111 

010 
110 
100 
101 



factors 

pfi 
p\p\ 
p\p\ 

p\ 

P\P3 
P\P2P3 
P\P3 
P\ 

P\PA 

P2P4 

P\P5 

P2P5 

P6 

Pi 

PS 



string 



0000 
0001 
0011 
0111 
1111 
0010 
0110 
1110 
1010 
0100 
1100 
0101 
1101 
1000 
1001 
1011 



factors 


string 


n 

P[P2 


00000 


00001 


3 2 
P\P 2 

2 3 
P\P\ 


00011 


00111 


P\P\ 


01111 


Pi 

P]P3 
P\P2P3 


11111 


00010 


00110 


P\PiP3 


01110 


P 2 P3 


11110 


P\P\ 


01010 


P2P\ 


11010 


p\pa 


00100 


PIP2PA 


01100 


P\P4 


11100 


P3PA 


10100 



factors 


string 


P\P5 


00101 


P1P2P5 


01101 


P2P5 


11101 


P3P5 


10101 


P\P6 


01000 


P2P6 


11000 


Pi Pi 


01001 


P2Pl 


11001 


PlP& 


01011 


P2P% 


11011 


P9 


10000 


P10 


10001 


Pll 


10010 


Pl2 


10011 


Pl3 


10110 


Pu 


10111 



Table E.4: Factorization of all periodic points strings up to length 5 into ordered con- 
of prime strings p\ = 0, p 2 = 1, pi = 10, pa = 100, ... . 



k\ k 

catenations p { p 2 



pu = 10111. 



E.2.1 Prime factorization for spectral determinants 

Following sect. E.2, the spectral determinant cycle expansions is obtained 
by expanding F as a multinomial in prime cycle weights t p 

00 00 

p k=0 kik 2 k 3 -=0 

where the sum goes over all pseudo-cycles. In the above we have defined 



t k\ h * 3 - nc 0 .*, 4' • ( E -!3) 
1=1 

exercise 19.10 

A striking aspect of the spectral determinant cycle expansion is its resem- 
blance to the factorization of natural numbers into primes: as we already noted in 
sect. E.2, the relation of the cycle expansion (E.12) to the product formula (19.9) 
is analogous to the Riemann zeta represented as a sum over natural numbers vs. 
its Euler product representation. 

This is somewhat unexpected, as the cycle weights factorize exactly with re- 
spect to r repetitions of a prime cycle, t pp _ p = f p , but only approximately {shad- 
owing) with respect to subdividing a string into prime substrings, tp lPl « t pi t P2 . 
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The coefficients C p * have a simple form only in 1 -dimensional given by the 
Euler formula (23.5). In higher dimensions C p k can be evaluated by expanding 
(19.9), F{z) = U P Fp, where 



oo f r V 



F p = \ 



yi. Li yi_ 



Expanding and recollecting terms, and suppressing the p cycle label for the mo- 
ment, we obtain 



F p = ^C/, C k = irfc k ID k , 
r=l 

k d k 

D k = Jd r = \ ~](l-u r J 



(E.14) 



r =1 u=! r=l 



where evaluation of requires a certain amount of not too luminous algebra: 



co 

C3 



1 
1 



( d 



"[(1 + Ua) - Y\(l - Ua) 



\a=\ 



1 ( G?2<^3 



3! </? 



+ 2(fifi?2 - 3c?3 



~[(1 + 2m C( + 2«q + u 3 a ) 

a=l 

+2 [~[(1 - Mfl - u 2 a + ul) - 3 f](l - i£) 



0=1 



efc. . For example, for a general 2-dimensional map we have 



1 Ui+U 2l "1^2(1 + «l)(l + «2) + U\ + «2 3 

f„ — 1 ? h ? ? + . . . . (E.15) 

P D y D 2 D 3 v 



We discuss the convergence of such cycle expansions in sect.J.4. 

With r tj t 2 *„ defined as above, the prime factorization of symbol strings is 

P\ Po "~Pn 

unique in the sense that each symbol string can be written as a unique concatena- 
tion of prime strings, up to a convention on ordering of primes. This factorization 
is a nontrivial example of the utility of generalized Mobius inversion, sect.E.2. 

How is the factorization of sect. E.2 used in practice? Suppose we have com- 
puted (or perhaps even measured in an experiment) all prime cycles up to length 
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n, i.e., we have a list of t p 's and the corresponding Jacobian matrix eigenvalues 
A P; 2, • • • A p A cycle expansion of the Selberg product is obtained by gener- 
ating all strings in order of increasing length j allowed by the symbolic dynamics 
and constructing the multinomial 



(E.16) 



where n = s\S2 ■ ■ ■ sj, Si range over the alphabet, in the present case {0, 1}. Fac- 
torizing every string n = S1S2 • • • sj = p\ l p 1 ^ • • • pj as in table E.4, and sub- 
stituting t *, k 2 we obtain a multinomial approximation to F. For example, 

Pi Pi 

tooiooioioioi = Tomooioioioi = T ooi 2T oi 3 ' and T oi 3 ' T ooi 2 are known functions of 
the corresponding cycle eigenvalues. The zeros of F can now be easily determined 
by standard numerical methods. The fact that as far as the symbolic dynamics is 
concerned, the cycle expansion of a Selberg product is simply an average over all 
symbolic strings makes Selberg products rather pretty. 

To be more explicit, we illustrate the above by expressing binary strings as 
concatenations of prime factors. We start by computing N,„ the number of terms 
in the expansion (E. 12) of the total cycle length n. Setting C p kt k p = z n,,k in (E.12), 
we obtain 



CO co 1 



n„d -.-••) 



So the generating function for the number of terms in the Selberg product is the 
topological zeta function. For the complete binary dynamics we have Af, = 2" 
contributing terms of length n: 



^ = n,a-z^)"i-2z-§ 2V 



Hence the number of distinct terms in the expansion (E.12) is the same as the 
number of binary strings, and conversely, the set of binary strings of length n 
suffices to label all terms of the total cycle length n in the expansion E.12). 
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Counting itineraries 



F.l Counting curvatures 



One consequence of the finiteness of topological polynomials is that the con- 
tributions to curvatures at every order are even in number, half with posi- 
tive and half with negative sign. For instance, for complete binary labeling 

(20.7), 

C4 = -foOOl _ f 0011 _ f 0111 - toto\t\ 

+ fofooi + Won + kmh + huh ■ (F.l) 

We see that 2 3 terms contribute to C4, and exactly half of them appear with a 
negative sign - hence if all binary strings are admissible, this term vanishes in the 
counting expression. 

Such counting rules arise from the identity 

n(' + <,)=n^f <™ 

p p p 

Substituting t p = z" p and using (15.18) we obtain for unrestricted symbol dynam- 
ics with N letters 

p k=2 

The z" coefficient in the above expansion is the number of terms contributing to 
c„ curvature, so we find that for a complete symbolic dynamics of N symbols and 
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n > 1, the number of terms contributing to c„ is (N - l)N k 1 (of which half carry 
a minus sign). 

We find that for complete symbolic dynamics of N symbols and n > 1, the 
number of terms contributing to c n is (N - \)N n ~ x . So, superficially, not much 
is gained by going from periodic orbits trace sums which get N 1 contributions of 
n to the curvature expansions with N"(l - l/N). However, the point is not the 
number of the terms, but the cancelations between them. 



exercise F.4 



Exercises 



F.l. 



F.2. 



F.3. 



F.4. 



Lefschetz zeta function. Elucidate the relation 

betveen the topological zeta function and the Lefschetz 
zeta function. 

Counting the 3-disk pinball counterterms. Verify 
that the number of terms in the 3-disk pinball curvature 
expansion (20.42) is given by 



Substituting into the identity 



we obtain 



IK 1 -*) = 



1 - 3z 4 - 2z 6 
1 - 3z 2 - 2z 3 



= 1 + 3z 2 + 2z 3 + 



z 4 (6+ 12z + 2|)[(l +t P ) = 



I -t 0 



= 1 + to + t\ + 



2t 0 t 



= 1 + 3z 2 + 2z 3 + 6z 4 + 12z 5 



1 - 3z 2 - 2z 3 " 
20z 6 + 48z 7 + 84z 8 + 184z 9 +(E*) 1 + t 0 + h + 



l-/o 



This means that, for example, C6 has a total of 20 terms, 
in agreement with the explicit 3-disk cycle expansion 
(20.43). 

Cycle expansion denominators**. Prove that the 
denominator of q is indeed D^, as asserted (E.14). 

Counting subsets of cycles. The techniques de- 
veloped above can be generalized to counting subsets 
of cycles. Consider the simplest example of a dynami- 
cal system with a complete binary tree, a repeller map 
(1 1.4) with two straight branches, which we label 0 and 
1. Every cycle weight for such map factorizes, with a 
factor to for each 0, and factor t\ for each 1 in its sym- 
bol sning. The transition manix traces (15.7) collapse 
to tr{T k ) = (t 0 + t x ) k , and 1 /£ is simply 



Hence for n > 2 the number of terms in the expansion 
?! with k 0's and n — k l's in their symbol sequences is 
2 fcjj. This is the degeneracy of distinct cycle eigenval- 
ues in fig. ?! ; for systems with non-uniform hyperbolicity 
this degeneracy is lifted (see fig. ?!). 

In order to count the number of prime cycles in each 
such subset we denote with M„^ (n = 1,2, . . . ; k 

I } for ii I : k = 1, . . . ,n — 1 for n > 2) the number 
of prime n-cycles whose labels contain k zeros, use bi- 
nomial string counting and Mobius inversion and obtain 



M 



1,0 



= Mi i = 1 



nM„ 



= ^ y.(m) 



n/m 
k/m 



n>2,k= 1,. 



~[ (l - t p ) = l-to-h 



(F.4) 



where the sum is over all m which divide both n and k. 
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Implementing evolution 



G.l Koopmania 



T 



he Koopman operator action on a state space function a(x) is to replace it by 
its downstream value time t later, a{x) — > a(x(t)), evaluated at the trajectory 
point x{t): 

[**«](*) = a{f{x)) = f dy<K\x,y)a(y) 

J M 

<K\x,y) = 8(y-f(x)). (G.l) 



Given an initial density of representative points p(x), the average value of a(x) 
evolves as 

<«>(') = ■. : f dxa(f(x))p(x) = J— [ dx IK' a](x) p(x) 

\pm\jm \pm\Jm l j 

= ; ; I dx dy a(y) s(y - f(x)) p(x) . 

The 'propagator' 6(y - f{x)) can be interpreted as belonging to the Perron-Frobenius 
operator (16. 10), so the two operators are adjoint to each other, 

f dx[K t a](x)p{x)= f a(y) [£'p](y) . (G.2) 

This suggests an alternative point of view, which is to push dynamical effects 
into the density. In contrast to the Koopman operator which advances the trajec- 
tory by time t, the Perron-Frobenius operator depends on the trajectory point time 
t in the past 
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The Perron-Frobenius operators are non-normal, not self-adjoint operators, 
so their left and right eigenvectors differ. The right eigenvectors of a Perron- 
Frobenius operator are the left eigenvectors of the Koopman, and vice versa. 
While one might think of a Koopman operator as an 'inverse' of the Perron- 
Frobenius operator, the notion of adjoint is the right one, especially in settings 
where flow is not time-reversible, as is the case for dissipative PDEs (infinite di- 
mensional flows contracting forward in time) and for stochastic flows. 

The family of Koopman's operators {'K t } t€ $> forms a semigroup parameter- 
ized by time 

(a) 7C 0 = 1 

(b) 'K ,( K t ' = < K ,+f t, f > 0 (semigroup property) , 

with the generator of the semigroup, the generator of infinitesimal time transla- 
tions denned by 



J{ = lim - i'K' - l) . 
f->0+ t v ' 



(If the flow is finite-dimensional and invertible, is a generator of a group). The 
explicit form of follows from expanding dynamical evolution up to first order, 
as in (2.6): 

J(a{x) = lim - (a{f\x)) - a(xj) = v,(*)<9,a(x) . (G.3) 
t->o + t v ' 

Of course, that is nothing but the definition of the time derivative, so the equation 
of motion for a(x) is 

^j t -^ja{x) = 0. (G.4) 

appendix G.2 

The finite time Koopman operator (G. 1) can be formally expressed by exponenti- 
ating the time-evolution generator J{ as 

7C f = . (G.5) 

exercise G.1 

The generator looks very much like the generator of translations. Indeed, 
for a constant velocity field dynamical evolution is nothing but a translation by 
time x velocity: 



exercise 16.10 



e^^aix) - a(x + tv) . (G.6) 
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As we will not need to implement a computational formula for general 4 31 in 
what follows, we relegate making sense of such operators to appendix G.2. Here 
we limit ourselves to a brief remark about the notion of "spectrum" of a linear 
operator. 

The Koopman operator 7C acts multiplicatively in time, so it is reasonable to 
suppose that there exist constants M > 0, (3 > 0 such that ||'7C|| < Me t/3 for all 
t > 0. What does that mean? The operator norm is define in the same spirit in 
which we defined the matrix norms in sect. K.2: We are assuming that no value 
of HCpix) grows faster than exponentially for any choice of function p(x), so that 
the fastest possible growth can be bounded by 4$, a reasonable expectation in the 
light of the simplest example studied so far, the exact escape rate (17.39). If that 
is so, multiplying by e~'P we construct a new operator e~ t/3( K' = e^ - -® which 
decays exponentially for large t, \\4^~®\\ < M. We say that e'^'K' is an element 
of a bounded semigroup with generator - J31. Given this bound, it follows by 
the Laplace transform 



f 

Jo 



dt e-s'K* = — !— , Re s > {3 , (G.7) 
s - y\ 



that the resolvent operator (s - Jl) 1 is bounded ("resolvent" = able to cause 
separation into constituents) 



1 



s- ft 



r°° m 

< dte- st Me'P = 
Jo 



s-p 



appendix G.2 



section K.2 



If one is interested in the spectrum of e K, as we will be, the resolvent operator is a 
natural object to study. The main lesson of this brief aside is that for the continu- 
ous time flows the Laplace transform is the tool that brings down the generator in 
(16.29) into the resolvent form (17.33) and enables us to study its spectrum. 



G.2 Implementing evolution 

(R. Artuso and R Cvitanovic) 



We now come back to the semigroup of operators 7C. We have introduced 
the generator of the semigroup (16.27) as 



3K = 4*^ 
dt 



r=() 



If we now take the derivative at arbitrary times we get 



dt ) t]->o n 
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Vi(f(x)) — .A(x) 

OX; 



f=/'(x) 



which can be formally integrated like an ordinary differential equation yielding 



exercise G.1 



7C f = e m . 



(G.8) 



This guarantees that the Laplace transform manipulations in sect. 16.5 are correct. 
Though the formal expression of the semigroup (G.8) is quite simple one has to 
take care in implementing its action. If we express the exponential through the 
power series 



k=0 



(G.9) 



we encounter the problem that the infinitesimal generator (16.27) contains non- 
commuting pieces, i.e., there are i, j combinations for which the commutator does 
not satisfy 



d_ 

dxi 



= 0. 



To derive a more useful representation, we follow the strategy used for finite- 
dimensional matrix operators in sects. 4.3 and 4.4 and use the semigroup property 
to write 

t/Sr 
m=l 

as the starting point for a discretized approximation to the continuous time dy- 
namics, with time step 8t. Omitting terms from the second order onwards in the 
expansion of < 70 5t yields an error of order Oidi 2 ). This might be acceptable if the 
time step St is sufficiently small. In practice we write the Euler product 



t/Sr 



K' = I ] (l+<5r^l (m) ) + 0(6t 2 ) 



(G.10) 



m= 1 



where 



(^ (m) <A)(x) - v,cr* r (x))^ 



x=f m5r (x) 
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As far as the x dependence is concerned, £ T ^ [ acts as 







x x 




• — > ■ 




Xi 




xi + 5tv,-(jc) 


. X <1 . 




x d 



(G.ll) 



exercise 2.6 



We see that the product form (G.10) of the operator is nothing else but a pre- 
scription for finite time step integration of the equations of motion - in this case 
the simplest Euler type integrator which advances the trajectory by tfrxvelocity at 
each time step. 



G.2.1 A symplectic integrator 



The procedure we described above is only a starting point for more so- 
phisticated approximations. As an example on how to get a sharper bound on the 
error term consider the Hamiltonian flow 3K = S + C, S = Pi-^p, C - -djV(q)-^. 
Clearly the potential and the kinetic parts do not commute. We make sense of the 
formal solution (G.10) by splitting it into infinitesimal steps and keeping terms up 
to 6t 2 in 



exercise G.2 



<K St = K St + 2^(<5r) 3 [S + 2C, [S,C]] + 



(G.12) 



where 



q^&r _ e \5rS e 6TC e \5rS 



(G.13) 



The approximate infinitesimal Liouville operator , K"' 5t is of the form that now gen- 
erates evolution as a sequence of mappings induced by (16.30), a free flight by 
\8tS, scattering by drdViq'), followed again by \6tS free flight: 



e \6rS J q 



e- 



P 
I l' 



StB I 1 



-TP 



p' + 6TdV(q') 



q> - %p> 



(G.14) 



Collecting the terms we obtain an integration rule for this type of symplectic flow 
which is better than the straight Euler integration (G.ll) as it is accurate up to 



appendMeasure - 8may20I2 



ChaosBook.org versionI4.4.1, Apr 21 2013 



EXERCISES 



846 



order St 2 : 



(St) 2 



q n +l = q n - St p n — dV (q n - Stp„/2) 

Pn+i = p„ + STdV(q„-6Tp n /2) (G.15) 

The Jacobian matrix of one integration step is given by 



0 1/1 STdV(q') 1 MO 1 



Note that the billiard flow (8.11) is an example of such symplectic integrator. In 
that case the free flight is interrupted by instantaneous wall reflections, and can be 
integrated out. 



Commentary 



Remark G.l Koopman operators. The "Heisenberg picture" in dynamical systems 
theory has been introduced by Koopman and Von Neumann [ 1, 2], see also ref. [12]. 
Inspired by the contemporary advances in quantum mechanics, Koopman [ I ] observed in 
1931 that 7C is unitary on L 2 (p) Hilbert spaces. The Koopman operator is the classical 
analogue of the quantum evolution operator exp \ iHt/h) - the kernel of JL'iy, x) introduced 
in (16.16) (see also sect. 17.2) is the analogue of the Green's function discussed here in 
chapter 31. The relation between the spectrum of the Koopman operator and classical 
ergodicity was formalized by von Neumann [2]. We shall not use Hilbert spaces here 
and the operators that we shall study will not be unitary. For a discussion of the relation 
between the Perron-Frobenius operators and the Koopman operators for finite dimensional 
deterministic invertible flows, infinite dimensional contracting flows, and stochastic flows, 
see Lasota-Mackey [ 1 2] and Gaspard [8]. 



Remark G.2 Symplectic integration. The reviews [12] and [13] offer a good starting 
point for exploring the symplectic integrators literature. For a higher order integrators of 
type (G.l 3), check ref. [ 18]. 



Exercises 

G.l. Exponential form of semigroup elements. Check commute, K'Jl = by considering the action of 

that the Koopman operator and the evolution generator both operators on an arbitrary state space function a(x). 
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G.2. Non-commutativity. Check that the commutators in G.3. Symplectic leapfrog integrator. Implement (G. 15) 

(G.12) are not vanishing by showing that for 2-dimensional Hamiltonian flows; compare it with 

(o q \ Runge-Kutta integrator by integrating trajectories in 

V" V — . some (chaotic) Hamiltonian flow. 
dp dqj 
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Appendix H 



Transport of vector fields 



Man who says it cannot be done should not interrupt man 
doing it. 

— Sayings of Vattay Gabor 

In this appendix we show that the multidimensional Lyapunov exponents and 
relaxation exponents (dynamo rates) of vector fields can be expressed in terms 
of leading eigenvalues of appropriate evolution operators. 



H.l Evolution operator for Lyapunov exponents 



Lyapunov exponents were introduced and computed for 1 -dimensional 
maps in sect. 17.4. For higher-dimensional flows only the Jacobian matrices are 
multiplicative, not individual eigenvalues, and the construction of the evolution 
operator for evaluation of the Lyapunov spectra requires the extension of evolution 
equations to the flow in the tangent space. We now develop the requisite theory. 

Here we construct a multiplicative evolution operator (H.4) whose spectral 
determinant (H.8) yields the leading Lyapunov exponent of a <i-dimensional flow 
(and is entire for Axiom A flows). 

The key idea is to extend the dynamical system by the tangent space of the 
flow, suggested by the standard numerical methods for evaluation of Lyapunov 
exponents: start at xq with an initial infinitesimal tangent space vector in the d- 
dimensional tangent space ?7(0) e TM X , and let the flow transport it along the 
trajectory x(t) = f(x 0 ). 

The dynamics in the tangent bundle (x, 5x) e TM is governed by the system 
of equations of variations (4.2): 

x - v(x) , fj - A(x) r\ . 



849 



APPENDIX H. TRANSPORT OF VECTOR FIELDS 



850 



Here A(x) is (4.3), the stability matrix (velocity gradients matrix) of the flow. We 
write the solution as 



x(t)=f(x 0 ), 77(/) = /(x 0 )77o, (H.1) 

with the tangent space vector 77 transported by the Jacobian matrix J(xq) - dx(t)/dxo 
(4.5). 

As explained in sect. 4.1, the growth rate of this vector is multiplicative along 
the trajectory and can be represented as 77(f) = \r](t)\f\T](0)\ u{t) where u{t) is a 
"unit" vector in some norm |.||. For asymptotic times and for almost every initial 
(xq, ?/(0)), this factor converges to the leading eigenvalue of the linearized stability 
matrix of the flow. 

We implement this multiplicative evaluation of Floquet multipliers by adjoin- 
ing the J-dimensional transverse tangent space 77 e TA1 V ; 77(jc) • v(x) = 0 to the 
(d+l)-dimensional dynamical evolution space x € M c In order to deter- 

mine the length of the vector 77 we introduce a homogeneous differentiable scalar 
function g{rf) - \\r]\\. It has the property g(Arj) = |A|g(?7) for any A. An example 
is the projection of a vector to its dth component 



< m ' 
m 



= \m\ 



Any vector ?7(0) e TM X can now be represented by the product 77 = Am, where 
u is a "unit" vector in the sense that its norm is |//| = 1, and the factor 

AW u 0 ) = g(T](t)) = g(f(xo)u 0 ) (H.2) 



is the multiplicative "stretching" factor. 

Unlike the leading eigenvalue of the Jacobian the stretching factor is multi- 
plicative along the trajectory: 

A ? ' + '(^o, mo) - A'\x(t), u{t)) A f (^o, wo). 

exercise H.1 

The u evolution constrained to ET gtX , the space of unit transverse tangent vectors, 
is given by rescaling of (H.1): 

u = R\x, u) = J\x)u . (H.3) 

A\x, u) 
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Eqs. (H. 1), (H.2) and (H.3) enable us to define a multiplicative evolution operator 
on the extended space U x ET gtX 



, , , I , , \ S(u' — R'(x, u)) 
L\x', u';x, u) = Six - fix)) K -i-r- , (H.4) 



where /? is a variable. 

To evaluate the expectation value of log |A f (x, u)\ which is the Lyapunov ex- 
ponent we again have to take the proper derivative of the leading eigenvalue of 
(H.4). In order to derive the trace formula for the operator (H.4) we need to eval- 
uate Tr£ ? = J dxdu£l{u,x;u,x). The J dx integral yields a weighted sum over 
prime periodic orbits p and their repetitions r: 



~ 6(t-rT p ) 
Tr£ f = 2j T ph I det(l- MQ \ ^ 



6(u - R T » r (x p , «)) 

uu = — — 

|AV(x p ,b)|^- 1 



A P,r = du rp ~7Z~, i (H5) 



where M p is the prime cycle p transverse stability matrix. As we shall see below, 
Ap ;f . is intrinsic to cycle p, and independent of any particular periodic point x,. 

We note next that if the trajectory fix) is periodic with period T, the tangent 
space contains d periodic solutions 

e m (x(T + 0) = e (0 (je(f)) , i - 1, ...,d, 

corresponding to the d unit eigenvectors {tP-*,e (2 \ • • • , e^} of the transverse sta- 
bility matrix, with "stretching" factors (H.2) given by its eigenvalues 

M p (x)e ( '\x) = A p j e ( '\x) , i - 1, d. (no summation on i) 

The J du integral in (H.5) picks up contributions from these periodic solutions. In 
order to compute the stability of the ith eigen-direction solution, it is convenient to 
expand the variation around the eigenvector in the stability matrix eigenbasis 
5u - Yj Sue . The variation of the map (H.3) at a complete period t = T is then 
given by 



T , n M5u Me w /<9g(e®) 

5R T (e U) ) = — s \ MSu 

g(MeW) g(Me^) 2 \ du 

- ZteU-0*g!lW ' (H.6, 
ifi a pJ \ du k ) 
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The 5ui component does not contribute to this sum since g{& l) + <fw,e®) - 1 +dui 
implies dg(e®)/dui = 1. Indeed, infinitesimal variations 8u must satisfy 



g(u + 5u) = g(u) = 1 



, dg{u) 
5u c — — = 0, 
due 



so the allowed variations are of form 



6u = Y J ( 



duk 



c k , \c k \ <SC 1 , 



and in the neighborhood of the eigenvector the J du integral can be expressed 
as 



| du - I Y\ dck ■ 



Inserting these variations into the J du integral we obtain 
f du S(e (i) + 6u-R T (e (i) )-6R T (e (i) ) + ...) 



dc k 8{{\ -A k /Aj)c k + ...) 



Afe/Ail ' 



and the J du trace (H.5) becomes 



v Z ]A r «-i r[|i_A: ,/a: 



i'=1 1 P,i ' k+i 



P^ A pJ 



(H.7) 



The corresponding spectral determinant is obtained by observing that the Laplace 
transform of the trace (18.23) is a logarithmic derivative Tr£(s) = -4- log F(s) 
of the spectral determinant: 



F{fi, s) = exp 



-X 

V p,r 



e sT p r 



r | det (1 - Mi) 



T\ I n r » ' 



(H.8) 



This determinant is the central result of this section. Its zeros correspond to the 
eigenvalues of the evolution operator (H.4), and can be evaluated by the cycle 
expansion methods. 
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The leading zero of (H.8) is called "pressure" (or free energy) 



P(fi) = SOW)- 



(H.9) 



The average Lyapunov exponent is then given by the first derivative of the pressure 
at/3 - 1: 



A = P'(l). 



(H.10) 



The simplest application of (H.8) is to 2-dimensional hyperbolic Hamiltonian 
maps. The Floquet multipliers are related by Ai = I/A2 = A, and the spectral 
determinant is given by 



F(J3,z) 

ApM 



r\ A>„ I (1-1 /A') 2 



5 -z 

I A- \^ 

1 - l/Al r ' 1 - I /A 2 / 



1 



+ 



(H.11) 



The dynamics (H.3) can be restricted to a u unit eigenvector neighborhood 
corresponding to the largest eigenvalue of the Jacobi matrix. On this neighbor- 
hood the largest eigenvalue of the Jacobi matrix is the only fixed point, and the 
spectral determinant obtained by keeping only the largest term the A,,,, sum in 
(H.7) is also entire. 

In case of maps it is practical to introduce the logarithm of the leading zero 
and to call it "pressure" 

P(yS) = log £()(£>. 

The average of the Lyapunov exponent of the map is then given by the first deriva- 
tive of the pressure at /3 = 1 : 

I = P'(1). 

By factorizing the determinant (H. 11) into products of zeta functions we can 
conclude that the leading zero of the (H.4) can also be recovered from the leading 
zeta function 



l/<r o 0e,z) = exp 



< rn \ 

r\A r „f 



(H.12) 



This zeta function plays a key role in thermodynamic applications, see chapterL. 
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H.2 Advection of vector fields by chaotic flows 

Fluid motions can move embedded vector fields around. An example is the mag- 
netic field of the Sun which is "frozen" in the fluid motion. A passively evolving 
vector field V is governed by an equation of the form 

d t \ + u- VV-V- Vu = 0, (H.13) 

where u(x, t) represents the velocity field of the fluid. The strength of the vector 
field can grow or decay during its time evolution. The amplification of the vector 
field in such a process is called the "dynamo effect." In a strongly chaotic fluid 
motion we can characterize the asymptotic behavior of the field with an exponent 

V(x, t) ~ Y(x)e vt , (H.14) 

where v is called the fast dynamo rate. The goal of this section is to show that 
periodic orbit theory can be developed for such a highly non-trivial system as 
well. 

We can write the solution of (H.13) formally, as shown by Cauchy. Let x(t,a) 
be the position of the fluid particle that was at the point a at t - 0. Then the field 
evolves according to 

V(x,0 - J(a,0V(a,0) , (H.15) 

where J(a, t) = d(x) /3(a) is the Jacobian matrix of the transformation that moves 
the fluid into itself x = x(a, t). 

We write x = /'(a), where f is the flow that maps the initial positions of the 
fluid particles into their positions at time t. Its inverse, a = /" f (x), maps particles 
at time t and position x back to their initial positions. Then we can write p. 15) 

V i (x,t) = Y j fd 3 a£ t ij (x,aWM,0) , (H.16) 



with 



dx 

4/x,a) = 5(a-r f (x))^ . (H.17) 



For large times, the effect of £! is dominated by its leading eigenvalue, e K ° ( with 
Re(vo) > Re(vj), i = 1,2, 3, .... In this way the transfer operator furnishes the fast 
dynamo rate, v := vq. 
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The trace of the transfer operator is the sum over all periodic orbit contribu- 
tions, with each cycle weighted by its intrinsic stability 



trM: 



fit det(l-M/) 



■8{t - rT p ). 



(H.18) 



We can construct the corresponding spectral determinant as usual 



F(s) = exp 



1 

EZ 1 



trM: 



fj^i r det(l-M/) 



(H.19) 



Note that in this formuli we have omitted a term arising from the Jacobian trans- 
formation along the orbit which would give 1 + tr M p in the numerator rather 
than just the trace of M' p . Since the extra term corresponds to advection along the 
orbit, and this does not evolve the magnetic field, we have chosen to ignore it. It 
is also interesting to note that the negative powers of the Jacobian occur in the 
denominator, since we have J^ f in (H.17). 

In order to simplify F(s), we factor the denominator cycle stability determi- 
nants into products of expanding and contracting eigenvalues. For a 3-dimensional 
fluid flow with cycles possessing one expanding eigenvalue Ap (with \A p \ > 1), 
and one contracting eigenvalue A p (with \A p \ < 1) the determinant may be ex- 
panded as follows: 



|det (l - M~/)\ 1 = |(1 - A/)(l - A-/T 1 = IV' 2 Z A P JrA P ■ (H - 20) 
With this decomposition we can rewrite the exponent in (H. 19) as 



' l(A p + A p )e srT » 



oo oo 1 

Z X Z - {\*p\^ P J 4 eSTp J (^ +A P > (H-21) 



p r=l r det (l - M/)| ^ p^ 0 frf r 



which has the form of the expansion of a logarithm: 

^ ^ [log(l - e sT »\A p \A l p - j A k p ) + log (l - e sT "\A p \A~ p j A p +k )} . (H.22) 

P j,k 

The spectral determinant is therefore of the form, 



F(s) = F e (s)F c (s) , 



(H.23) 
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where 



F e (s) = \ , (H.24) 

P j,k=0 



Fc(s) = \ \(l-tf ] A p ), (H.25) 

P j,k=0 



with 



A k 

tf> = e sT »\A p \^r . (H.26) 



The two factors present in F(s) correspond to the expanding and contracting ex- 
ponents. (Had we not neglected a term in (H.19), there would be a third factor 
corresponding to the translation.) 

For 2-dimensional Hamiltonian volume preserving systems, A - 1/A and 
(H.24) reduces to 

oo / \k+l s f 

'•»=nn('-^rj • '-rb • 

With <T p = Ay, /| Ap|, the Hamiltonian zeta function (the j = k = 0 part of the 
product (H.25)) is given by 

HQyn{s) = Y\(l-(T p e sT "). (H.28) 



This is a curious formula — the zeta function depends only on the return times, 
not on the eigenvalues of the cycles. Furthermore, the identity, 



A+ 1/A 



|(1 - A)(l - 1/A)| |(1 - A)(l - 1/A)| ' 

when substituted into (H.23), leads to a relation between the vector and scalar 
advection spectral determinants: 



F dyn (s) = Fl(s)IQ yn {s) . (H.29) 



The spectral determinants in this equation are entire for hyperbolic (axiom A) 
systems, since both of them correspond to multiplicative operators. 
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In the case of a flow governed by a map, we can adapt the formulas p.27) 
and (H.28) for the dynamo determinants by simply making the substitution 



e sT " , (H.30) 



where n p is the integer order of the cycle. Then we find the spectral determinant 
F e (z) given by equation (H.27) but with 

fi P 

* = Pg (H31) 

for the weights, and 

lK d yn{z) = n p (\-cT p z n ") (H.32) 

for the zeta-function 

For maps with finite Markov partition the inverse zeta function (H.32) reduces 
to a polynomial for z since curvature terms in the cycle expansion vanish. For ex- 
ample, for maps with complete binary partition, and with the fixed point stabilities 
of opposite signs, the cycle expansion reduces to 

IKdynis) = 1. (H.33) 

For such maps the dynamo spectral determinant is simply the square of the scalar 
advection spectral determinant, and therefore all its zeros are double. In other 
words, for flows governed by such discrete maps, the fast dynamo rate equals the 
scalar advection rate. 

In contrast, for 3-dimensional flows, the dynamo effect is distinct from the 
scalar advection. For example, for flows with finite symbolic dynamical gram- 
mars, (H.29) implies that the dynamo zeta function is a ratio of two entire deter- 
minants: 



\Kdyn{s) = F dyn {s)IFl{s). (H.34) 



This relation implies that for flows the zeta function has double poles at the zeros 
of the scalar advection spectral determinant, with zeros of the dynamo spectral 
determinant no longer coinciding with the zeros of the scalar advection spectral 
determinant; Usually the leading zero of the dynamo spectral determinant is larger 
than the scalar advection rate, and the rate of decay of the magnetic field is no 
longer governed by the scalar advection. 



exercise H.2 
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Remark H.l Lyapunov exponents. Sect. H.l is based on ref. [1], 

Remark H.2 Dynamo zeta. The dynamo zeta (H.32) has been introduced by Aurell 
and Gilbert [3] and reviewed in ref. [4]. Our exposition follows ref. [22]. 



[ 



Exercises 

for x > 0. Show that the dynamo zeta is 

l/& yn O) = l-e sT -+e sT ". (H.36) 

Show also that the escape rate is the leading zero of 

l/( 0 (s) = l-e sT «/a-e sT »/b. (H.37) 

Calculate the dynamo and the escape rates analytically 
if b — a 2 and Tt, = 2T a . Do the calculation for the case 
when you reverse the signs of the slopes of the map. 
What is the difference? 
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Discrete symmetries of dynamics 



Basic group-theoretic notions are recapitulated here: groups, irreducible rep- 
resentations, invariants. Our notation follows birdtracks.eu. 

The key result is the construction of projection operators from invariant ma- 
trices. The basic idea is simple: a hermitian matrix can be diagonalized. If this 
matrix is an invariant matrix, it decomposes the reps of the group into direct sums 
of lower-dimensional reps. Most of computations to follow implement the spectral 
decomposition 

M = I1P1 +A 2 P 2 + '-' + XrV r , 

which associates with each distinct root A, of invariant matrix M a projection 
operator (1.17): 



Sects. 1.3 and 1.4 develop Fourier analysis as an application of the general 
theory of invariance groups and their representations. 



We define group, representation, symmetry of a dynamical system, and invariance. 




1.1 Preliminaries and definitions 



(A. Wirzba and P. Cvitanovic) 
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Group axioms. A group G is a set of elements g\ , g 2 , g3, . . . for which compo- 
sition or group multiplication g2 o gj (which we often abbreviate as g2gi) of any 
two elements satisfies the following conditions: 

1. If gi,g 2 e G, then g 2 ° gi eG. 

2. The group multiplication is associative: gs o (g 2 ° gi) = (gi ° g 2 ) ° gi- 

3. The group G contains identity element e such that g o e = e o g = g for every 
element g e G. 

4. For every element g e G, there exists a unique /i == g -1 e G such that 
/log-go/j-e. 

A finite group is a group with a finite number of elements 

G = {e,g 2 , ...,g|G|), 

where \G\, the number of elements, is the order of the group. 

Example 1.1 Finite groups: Some finite groups that frequently arise in applica- 
tions: 

• C„ (also denoted Z n ): the cyclic group of order n. 

• D„ : the dihedral group of order In, rotations and reflections in plane that preserve 
a regular n-gon. 

• S„: the symmetric group of all permutations ofn symbols, order nl. 

Example 1.2 Lie groups: Some compact continuous groups that arise in dynam- 
ical systems applications: 

• S 1 (also denoted T l ): circle group of dimension 1. 

• T m - S 1 x S 1 ■ ■ ■ x S 1 : m-torus, of dimension m. 

• SO(2): rotations in the plane, dimension 1. Isomorphic toS 1 . 

• 0(2) = SO(2) x Di : group of rotations and reflections in the plane, of dimension 
1. 

• U(l): group of phase rotations in the complex plane, of dimension 1. Isomorphic 
to SO(2). 

• SO(3): rotation group of dimension 3. 

• SU(2): unitary group of dimension 3. Isomorphic to SO(3). 

• GL(n): general linear group of invertible matrix transformations, dimension n 2 . 

• SO(n): special orthogonal group of dimension n(n - l)/2. 

• 0(«) = SO(«) x Di : orthogonal group of dimension n(n - l)/2. 

• S p(n): symplectic group of dimension n(n + l)/2. 

• SU(n): special unitary group of dimension n 2 - 1 . 
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Example 1.3 Cyclic and dihedral groups: The cyclic group C„ c SO(2) of order n 
is generated by one element. For example, this element can be rotation through 2njn. 
The dihedral group D„ c 0(2), n>2, can be generated by two elements one at least 
of which must reverse orientation. For example, take cr corresponding to reflection in 
the x-axis. cr 2 = e; such operation cr is called an involution. C to rotation through 2n/n, 
then D n = (cr, C), and the defining relations are cr 1 = C" = e, (Co-) 2 = e. 

Groups are defined and classified as abstract objects by their multiplication 
tables (for finite groups) or Lie algebras (for Lie groups). What concerns us in 
applications is their action as groups of transformations on a given space, usually a 
vector space (see appendix C. 1), but sometimes an affine space, or a more general 
manifold M. 

Repeated index summation. Throughout this text, the repeated pairs of up- 
per/lower indices are always summed over 

n 

G a b x b = Y J G a h x b , (1.1) 

b=l 

unless explicitly stated otherwise. 

General linear transformations. Let GL(n,¥) be the group of general linear 
transformations, 

GL(n, ¥) = {g : F" -> F" | det (g) * 0} . (1.2) 

Under GL(n, F) a basis set of V is mapped into another basis set by multiplication 
with a [nxn] matrix g with entries in field F (F is either R or C), 

e /fl = eW- 

As the vector x is what it is, regardless of a particular choice of basis, under this 
transformation its coordinates must transform as 

x'a = ga b Xb ■ 

Standard rep. We shall refer to the set of [n x n] matrices g as a standard rep 
of GL(n,¥), and the space of all rc-tuples (jq, X2, ■ ■ ., x n ) T , Xj e F on which these 
matrices act as the standard representation space V. 

Under a general linear transformation g e GL(n, F), the row of basis vectors 
transforms by right multiplication as d = eg -1 , and the column of x a 's trans- 
forms by left multiplication as xf = gx. Under left multiplication the column 
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(row transposed) of basis vectors e 7 transforms as e' T = g^e T , where the dual 
rep g^ = (g~ l ) T is the transpose of the inverse of g. This observation motivates 
introduction of a dual representation space V, the space on which GL(n, F) acts 
via the dual rep g^. 



Dual space. If V is a vector representation space, then the dual space V is the 
set of all linear forms on V over the field F. 

If {e (1 \ • • • , e^} is a (right) basis of V, then V is spanned by the dual basis 
(left basis) {e(i), • • • , e^}, the set of n linear forms ey) such that 

where 5 h a is the Kronecker symbol, & a = 1 if a = b, and zero otherwise. The 
components of dual representation space vectors will here be distinguished by 
upper indices 

(v : .v- v"). (1.3) 

They transform under GL(n, F) as 

y' a = (g f ) b a y b . (1-4) 

For GL(n,¥) no complex conjugation is implied by the^ notation; that interpre- 
tation applies only to unitary subgroups of GL(n, C). g can be distinguished from 
g 1 ' by meticulously keeping track of the relative ordering of the indices, 



(gT a -» fa ■ (1-5) 



Defining space, dual space. In what follows V will always denote the defining 
n-dimensional complex vector representation space, that is to say the initial, "el- 
ementary multiplet" space within which we commence our deliberations. Along 
with the denning vector representation space V comes the dual ^-dimensional vec- 
tor representation space V. We shall denote the corresponding element of V by 
raising the index, as in (1.3), so the components of defining space vectors, resp. 
dual vectors, are distinguished by lower, resp. upper indices: 

x = (xi,X2, ...,x„) , xeV 

x = (x l ,x 2 ,...,x n ) , xeV. (1.6) 
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Defining rep. Let G be a group of transformations acting linearly on V, with the 
action of a group element g € G on a vector x e V given by an [nxn] matrix g 

x' a =g a b x b a,b - 1,2, ...,n. (1.7) 

We shall refer to g a b as the defining rep of the group G. The action of g e G on a 
vector q e V is given by the cfwa/ re/? [tixti] matrix g': 

x' a = x b {g^) b " = g" h x b . (1.8) 

In the applications considered here, the group G will almost always be assumed 
to be a subgroup of the unitary group, in which case g~ l = g^, and ' indicates 
hermitian conjugation: 

(g\ h = (8b a r = 8 h a. (1-9) 



Hermitian conjugation is effected by complex conjugation and index transpo- 
sition: Complex conjugation interchanges upper and lower indices; transposition 
reverses their order. A matrix is hermitian if its elements satisfy 

(M f f b = M a b . (1. 10) 



For a hermitian matrix there is no need to keep track of the relative ordering of 
indices, as M b a = (M f )b a - M a b . 



Invariant vectors. The vector q e V is an invariant vector if for any transfor- 
mation g e G 

q = gq. (1.11) 
If a bilinear form M(x, y) - xf'M^yi, is invariant for all g e G, the matrix 



M a b = g a c g b d M c d (1.12) 



is an invariant matrix. Multiplying with g b and using the unitary condition (1.9), 
we find that the invariant matrices commute with all transformations g e G: 

[g,M]=0. (1.13) 
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Invariants. We shall refer to an invariant relation between p vectors in V and 
q vectors in V, which can be written as a homogeneous polynomial in terms of 
vector components, such as 

H(x, y, I, r, s) = h ab cde x b y a s e r d z c , (1. 14) 

as an invariant in V q ® V p (repeated indices, as always, summed over). In this 
example, the coefficients are components of invariant tensor h e V 6 ® V 2 . 

Matrix representation of a group. Let us now map the abstract group G homeo- 
morphically on a group of matrices D(G) acting on the vector space V, i.e., in such 
a way that the group properties, especially the group multiplication, are preserved: 

1. Any g € G is mapped to a matrix D(g) e D(G). 

2. The group product g2 ° g\ e G is mapped onto the matrix product Dfe ° 
gi) = D(g 2 )D(g 1 ). 

3. The associativity is preserved: D(g 3 o (g 2 o gl )) = D(g 3 )(D(g 2 )D(gi)) = 
(D(g 3 )(D(g 2 ))D(g 1 ). 

4. The identity element e e G is mapped onto the unit matrix D(e) = 1 and 
the inverse element g" 1 € G is mapped onto the inverse matrix D(g _1 ) = 
[D(g)]" 1 = D- ! (g). 

We call this matrix group D(G) a linear or matrix representation of the group G 
in the representation space V. We emphasize here 'linear' in order to distinguish 
the matrix representations from other representations that do not have to be linear, 
in general. Throughout this appendix we only consider linear representations. 

If the dimensionality of V is d, we say the representation is an d-dimensional 
representation. We will often abbreviate the notation by writing matrices D(g) e 
D(G) as g, i.e., x' = gx corresponds to the matrix operation ^ = Yfj-\ ^(8)ij x j- 

Character of a representation. The character of Xaig) of a (i-dimensional rep- 
resentation D(g) of the group element g € G is defined as trace 

d 

Xa (g) = trD(g) = ^ii(8)- 
i=i 

Note that x(e) = d, since D,y(e) = 6jj for 1 < i, j < d. 
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Faithful representations, factor group. If the mapping G on D(G) is an iso- 
morphism, the representation is said to be faithful. In this case the order of the 
group of matrices D(G) is equal to the order \G\ of the group. In general, how- 
ever, there will be several elements h € G that will be mapped on the unit matrix 
D(/j) = 1. This property can be used to define a subgroup H c G of the group 
G consisting of all elements h e G that are mapped to the unit matrix of a given 
representation. Then the representation is a faithful representation of the factor 
group G/H. 

Equivalent representations, equivalence classes. A representation of a group 
is by no means unique. If the basis in the J-dimensional vector space V is changed, 
the matrices D(g) have to be replaced by their transformations 0(g), with the new 
matrices T)'(g) and the old matrices D(g) are related by an equivalence transfor- 
mation through a non-singular matrix C 



The group of matrices D'(g) form a representation D'(G) equivalent to the rep- 
resentation D(G) of the group G. The equivalent representations have the same 
structure, although the matrices look different. Because of the cylic nature of the 
trace the character of equivalent representations is the same 



Regular representation of a finite group. The regular representation of a group 
is a special representation that is defined as follows: Combine the elements of a 
finite group into a vector {g\,g2, ■ ■ ■ ,g\c\)- Multiplication by any element g v per- 
mutes {gi,g2, . . . ,g\c\] entries. We can represent the element g v by the permu- 
tation it induces on the components of vector {g\,gi, ■ ■ ■ ,g\c\)- Thus for i, j = 
1, . . . , \G\, we define the regular representation 



In the regular representation the diagonal elements of all matrices are zero except 
for the identity element g v = e with g v gi = gj. So in the regular representation the 
character is given by 



D'(g) = CD(g)C 



7f 



Xig) = V'ite) = trD '(s) = trfCDC^C- 1 ) . 



i=i 



Dyfey) 



<5 ji, if g v gi = 8k with li = l,...,\G\, 
0 otherwise . 
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1.2 Invariants and reducibility 



What follows is a bit dry, so we start with a motivational quote from Hermann 
Weyl on the "so-called first main theorem of invariant theory": 

"All invariants are expressible in terms of a finite number among them. We 
cannot claim its validity for every group G; rather, it will be our chief task to 
investigate for each particular group whether a finite integrity basis exists or not; 
the answer, to be sure, will turn out affirmative in the most important cases." 

It is easy to show that any rep of a finite group can be brought to unitary 
form, and the same is true of all compact Lie groups. Hence, in what follows, we 
specialize to unitary and hermitian matrices. 



1.2.1 Projection operators 

For M a hermitian matrix, there exists a diagonalizing unitary matrix C such that 
( M ... 0 



0 



A 



0 



0 







*2 


0 .. 


0 


CMC 1 " - 


0 


0 










0 




A 2 




0 




0 





(1.15) 



Here A[ + Aj are the r distinct roots of the minimal characteristic (or secular) 
polynomial 



~[(M-^1) = 0. 



(1.16) 



In the matrix C(M - A 2 1)C^ the eigenvalues corresponding to A 2 are replaced 
by zeroes: 



Ai - A 2 



A\ - A 2 



0 






0 



/I3 - A 2 



/I3 - A 2 
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and so on, so the product over all factors (M - /fel)(M - /I3I) ... , with exception 
of the (M - A\l) factor, has nonzero entries only in the subspace associated with 



c y\(M - = Y\&i - A i> 



1 o o 
o 1 o 

0 0 1 



Thus we can associate with each distinct root Ai a projection operator P,, 



i—r M - Ajl 



J*l 



Ai - Ai 



(1.17) 



which acts as identity on the ith subspace, and zero elsewhere. For example, the 
projection operator onto the A\ subspace is 



/ 1 



Pi = C f 



(1.18) 



0 / 



The diagonalization matrix C is deployed in the above only as a pedagogical de- 
vice. The whole point of the projector operator formalism is that we never need 
to carry such explicit diagonalization; all we need are whatever invariant matrices 
M we find convenient, the algebraic relations they satisfy, and orthonormality and 
completeness of P,-: The matrices P; are orthogonal 



P,Py = SijPj , (no sum on j) , 



(1-19) 



and satisfy the completeness relation 



i=\ 



(1.20) 



As tr(CP,C^) = trP,-, the dimension of the ith subspace is given by 



di = trP,-. 



(1.21) 
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It follows from the characteristic equation (1.16) and the form of the projection 
operator (1.17) that Aj is the eigenvalue of M on p subspace: 

MP ; = AiVi , (no sum on i) . (1.22) 

Hence, any matrix polynomial /(M) takes the scalar value /(^) on the P, sub- 
space 

/(M)P; = fWPi . (1.23) 

This, of course, is the reason why one wants to work with irreducible reps: they 
reduce matrices and "operators" to pure numbers. 



1.2.2 Irreducible representations 

Suppose there exist several linearly independent invariant [dxd] hermitian matrices 
Mi,M2, . . ., and that we have used Mi to decompose the J-dimensional vector 
space V = Vi © V% © •• •. Can M2,M3, ... be used to further decompose Vp. 
Further decomposition is possible if, and only if, the invariant matrices commute: 



[Mi,M 2 ]=0, (1.24) 

or, equivalently, if projection operators P ; constructed from M 2 commute with 
projection operators p constructed from Mi , 

P,P 7 = V-Pi . (1.25) 

Usually the simplest choices of independent invariant matrices do not com- 
mute. In that case, the projection operators p constructed from Mi can be used 
to project commuting pieces of M 2 : 



M® = P,-M 2 P ; - , (no sum on i) 



That M® commutes with Mi follows from the orthogonality of p: 



[M w , Mi ] = ^ Aj[Mf , Pj] - 0 . (1.26) 



Now the characteristic equation for M® (if nontrivial) can be used to decompose 
Vi subspace. 
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An invariant matrix M induces a decomposition only if its diagonalized form 
(1.15) has more than one distinct eigenvalue; otherwise it is proportional to the 
unit matrix and commutes trivially with all group elements. A rep is said to be 
irreducible if all invariant matrices that can be constructed are proportional to the 
unit matrix. 

According to (1.13), an invariant matrix M commutes with group transforma- 
tions [G, M] = 0. Projection operators (1.17) constructed from M are polynomials 
in M, so they also commute with all g € Q: 

[G,Pi] = 0 (1.27) 

Hence, a [dxd] matrix rep can be written as a direct sum of [4x<f,] matrix reps: 

G = 1G1 = J] P ; GP y = PGP < = Z G ' ■ (L28) 

i,j i i 

In the diagonalized rep (1.18), the matrix g has a block diagonal form: 
gi 0 0 



CgC^ 



0 g 2 0 

0 o '•. 



^C'g/Q. (1.29) 



The rep g,- acts only on the ^-dimensional subspace V; consisting of vectors pg, 
q e V. In this way an invariant [dxd] hermitian matrix M with r distinct eigenval- 
ues induces a decomposition of a J-dimensional vector space V into a direct sum 
of ^-dimensional vector subspaces Vf. 



V ^ Vi©V 2 ©...©Vr. (1.30) 



1.3 Lattice derivatives 



In order to set up continuum field-theoretic equations which describe the evolution 
of spatial variations of fields, we need to define lattice derivatives. 

Consider a smooth function <p{x) evaluated on an infinite J-dimensional lattice 



(p e = (p{x), x = at = lattice point , £eZ d , (1.31) 



where a is the lattice spacing. Each set of values of <p{x) (a vector (ft) is a 
possible lattice configuration. Assume the lattice is hyper-cubic, and let e 
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\h\,fi2, • • • , hd) be the unit lattice cell vectors pointing along the d positive direc- 
tions. The lattice derivative is then 



<p(x + ahu) - (f>(x) <$>Uhn ~ <Pf 



(1-32) 



Anything else with the correct a —> 0 limit would do, but this is the simplest 
choice. We can rewrite the lattice derivative as a linear operator, by introducing 
the stepping operator in the direction ji 



(<7>) = CW W ■ 



(1.33) 



As cr will play a central role in what follows, it pays to understand what it does. 

In computer dicretizations, the lattice will be a finite <i-dimensional hyper- 
cubic lattice 



x = at- lattice point , I e (Z/Nf , 



(1.34) 



where a is the lattice spacing and there are points in all. For a hyper-cubic 
lattice the translations in different directions commute, cr^o-y = o-ycr^, so it is 
sufficient to understand the action of (1.33) on a 1-dimensional lattice. 

Let us write down cr for the 1-dimensional case in its full [NxN] matrix glory. 
Writing the finite lattice stepping operator (1.33) as an 'upper shift' matrix, 



^0 1 
0 1 
0 1 



cr = 



0 1 

0 



(1.35) 



is no good, as cr so defined is nilpotent, and after N steps nothing is left, = 0. 
A sensible way to approximate an infinite lattice by a finite one is to replace it by a 
lattice periodic in each h M direction. On a periodic lattice every point is equally far 
from the "boundary" N/2 steps away, the "surface" effects are equally negligible 
for all points, and the stepping operator acts as a cyclic permutation matrix 



^0 1 
0 1 
0 1 



cr 



0 1 

0. 



(1.36) 
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with ' 1 ' in the lower left corner assuring periodicity. 

Applied to the lattice configuration cp - (fa, fa, • • • , fay), the stepping operator 
translates the configuration by one site, cr<p = (fa, fa, ■ ■ ■ , fay, fa\). Its transpose 
translates the configuration the other way, so the transpose is also the inverse, 
cr~ ! - cr T . The partial lattice derivative (1.32) can now be written as a multiplica- 
tion by a matrix: 



d fJ fa = ^(cr ll -l) (j fa 



In the 1 -dimensional case the [NxN] matrix representation of the lattice deriva- 
tive is: 



d= - 

a 



<-\ 1 

-1 1 

-1 1 



1 



1 

-1 



(1.37) 



To belabor the obvious: On a finite lattice of N points a derivative is simply a 
finite [./VX./V] matrix. Continuum field theory is a world in which the lattice is so 
fine that it looks smooth to us. Whenever someone calls something an "operator," 
think "matrix." For finite-dimensional spaces a linear operator is a matrix; things 
get subtler for infinite-dimensional spaces. 



1.3.1 Lattice Laplacian 

In the continuum, integration by parts moves d around, <jf • d 2 <p —> -dfa 7 ■ dfa, on 
a lattice this amounts to a matrix transposition 



[(ay, - l) faf ■ [(a-, -\)fa} = <p T - (<r? - l) (a, - l) 



If you are wondering where the "integration by parts" minus sign is, it is there in 
discrete case at well. It comes from the identity 



a V / n 



(cr-1) = -cr-'d. 



The symmetric (self-adjoint) combination A = -tf d 



A = -^t « - 1) K - 1 ) = -^Z( 1 - \(< + *■#«>) (1.38) 

u=l u=l ' ' 
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is the lattice Laplacian. We shall show below that this Laplacian has the correct 
continuum limit. In the 1 -dimensional case the [NxN] matrix representation of 
the lattice Laplacian is: 



-2 
1 



-2 1 



1 \ 



(1-39) 



The lattice Laplacian measures the second variation of a field <fc across three 
neighboring sites: it is spatially non-local. You can easily check that it does what 
the second derivative is supposed to do by applying it to a parabola restricted to the 
lattice, (p[ = (p(a£), where <p{a€) is defined by the value of the continuum function 
<p(x) = x 2 at the lattice point X( = a£. 



1.3.2 Inverting the Laplacian 

Evaluation of perturbative corrections in (28.51) requires that we come to grips 
with the "free" or "bare" propagator M. While the the Laplacian is a simple 
difference operator (1.39), the propagator is a messier object. A way to compute 
is to start expanding the propagator M as a power series in the Laplacian 




(1.40) 



As A is a finite matrix, the expansion is convergent for sufficiently large m 2 . To 
get a feeling for what is involved in evaluating such series, evaluate A 2 in the 
1 -dimensional case: 





f 6 


-4 


1 




1 


-4\ 




-4 


6 


-4 1 






1 


1 


1 


-4 


6 -4 


1 








1 


1 


-4 '•• 




6 


1 

-4 




,-4 


1 




1 


-4 


6 J 



(1.41) 



What A 3 , A 4 , • • • contributions look like is now clear; as we include higher and 
higher powers of the Laplacian, the propagator matrix fills up; while the inverse 
propagator is differential operator connecting only the nearest neighbors, the prop- 
agator is integral, non-local operator, connecting every lattice site to any other lat- 
tice site. In statistical mechanics, M is the (bare) 2-point correlation. In quantum 
field theory, it is called a propagator. 
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These matrices can be evaluated as is, on the lattice, and sometime it is eval- 
uated this way, but in case at hand a wonderful simplification follows from the 
observation that the lattice action is translationally invariant. We will show how 
this works in sect. 1.4. 



1.4 Periodic lattices 

Our task now is to transform M into a form suitable to explicit evaluation. 

Consider the effect of a lattice translation <f> —> cr<p on the matrix polynomial 



As M is constructed from cr and its inverse, M~ and cr commute, and the 
function S [crcp] is invariant under translations, 



If a function (in this case, the function 5 [0]) defined on a vector space (in this case, 
the configuration (p) commutes with a linear operator cr, then the eigenvalues of cr 
can be used to decompose the <p vector space into invariant subspaces. For a hyper- 
cubic lattice the translations in different directions commute, o^cr y = crycr^, so it 
is sufficient to understand the spectrum of the 1 -dimensional stepping operator 
(1.36). To develop a feeling for how this reduction to invariant subspaces works in 
practice, let us continue humbly, by expanding the scope of our deliberations to a 
lattice consisting of 2 points. 

1.4.1 A 2-point lattice diagonalized 

The action of the stepping operator cr (1.36) on a 2-point lattice <p = ((p\,<p2) is to 
permute the two lattice sites 





(1.42) 




As exchange repeated twice brings us back to the original configuration, o 2 = 1, 
the characteristic polynomial of cr is 



(tr + l)(o--l) = 0, 
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with eigenvalues Aq - l,A\ - -1. Construct now the symmetrization, antisym- 
metrization projection operators 



Po 
Pi 



o~ — A\l 1 1/1 1 

-^ = 2 a + (r) = 2\l 1 



CT 



1 - 1 



-( 1 

2\-l 



(1.43) 
(1.44) 



Noting that P$ + P\ - 1, we can project the lattice configuration 0 onto the two 
eigenvectors of cr: 



0 = 10 - P o -0 + Pi • 

'01 \ (01 + 02) 

> 02 / V2 V2 

= 0o«O + 01"! • 



*(!)■ 



(01 - 02) 1 / 1 



a/2 V2 V" 1 



(1.45) 
(1.46) 



As PoPi = 0, the symmetric and the antisymmetric configurations transform sep- 
arately under any linear transformation constructed from cr and its powers. 

In this way the characteristic equation o 2 = 1 enables us to reduce the 2- 
dimensional lattice configuration to two 1 -dimensional ones, on which the value 
of the stepping operator cr is a number, A e {1,-1}, and the eigenvectors are % = 
4=(1, 1). n\ = -4=(1, -1). We have inserted V2 factors for convenience, in order 
that the eigenvectors be normalized unit vectors. As we shall now see, (0o, 0i) is 
the 2-site periodic lattice discrete Fourier transform of the field (0, 02)- 



1.5 Discrete Fourier transforms 



Let us generalize this reduction to a 1 -dimensional periodic lattice with ,/V sites. 

Each application of cr translates the lattice one step; in Af steps the lattice is 
back in the original configuration 

. o ° ° o 3 
o X° 2 
o h oi 

O O 0 

CT — 1 o o N -!, 

k° ° 
O O N " 2 

■ O O 

so the eigenvalues of cr are the ,/V distinct Af-th roots of unity 



N-l 



cr 



N 



k=0 



(T 



c l) -o, 



oj = e N . 



(1.47) 



As the eigenvalues are all distinct and N in number, the space is decomposed 
into ,/V 1-dimensional subspaces. The general theory (expounded in appendixl.2) 
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associates with the k-th eigenvalue of cr a projection operator that projects a con- 
figuration <f) onto k-th eigenvector of cr, 



CT 



J*k 



A k - Aj 



(1.48) 



A factor (cr - Ajt) kills the 7-th eigenvector <pj component of an arbitrary vector 
in expansion <p - • • • + <pj<pj + The above product kills everything but the 
eigen-direction (p k , and the factor Ylj^ k (Ak _ ^j) ensures that P k is normalized as 
a projection operator. The set of the projection operators is complete, 



(1.49) 



and orthonormal 



PkPj = SkjPk (no sum on k) . 



(1.50) 



Constructing explicit eigenvectors is usually not a the best way to fritter one's 
youth away, as choice of basis is largely arbitrary, and all of the content of the 
theory is in projection operators. However, in case at hand the eigenvectors are 
so simple that we can forget the general theory, and construct the solutions of the 
eigenvalue condition 



(Tifk 



OJ k (f k 



(1.51) 



by hand: 



f0 1 
0 1 
0 1 



0 1 
0 



\ 


r 1 1 




( 1 "1 












co k 




to k 




u? k 


-J* 1 


co 2k 




co 3k 


- OJ 


o? k 


/ 









The 1 / ^[N factor is chosen in order that <pk be normalized complex unit vectors 



<P k -fk 



1 N ~ l 
N ^ 



k=Q 



—=(1,cj k ,o) 2k ,---,co 



(no sum on k) 

(N-l)k^ _ 



(1.52) 
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The eigenvectors are orthonormal 



tfl ■ tpj = 5 kj , (1.53) 



as the explicit evaluation of ip^ • tpj yields the Kronecker delta function for a peri- 
odic lattice 




Sk i = nTj ^ {k ~ S)C ^TilY*^ ' • (1-54) 

f-0 



The sum is over the N unit vectors pointing at a uniform distribution of points on 
the complex unit circle; they cancel each other unless k = j (mod N), in which 
case each term in the sum equals 1. 

The projection operators can be expressed in terms of the eigenvectors (1.51), 
(1.52) as 



The completeness (1.49) follows from (1.54), and the orthonormality (1.50) from 
(1.53). 

fa, the projection of the <p configuration on the &-th subspace is given by 



(Pk ■ 4>)e = fa (<Pk)t , (no sum on k) 

l N-l 

fa = 4 ■ <P = ~j= J] e " fkC( !>? ( L56 ) 



We recognize fa as the discrete Fourier transform of fy. Hopefully rediscovering 
it this way helps you a little toward understanding why Fourier transforms are full 
of e lxp factors (they are eigenvalues of the generator of translations) and when 
are they the natural set of basis functions (only if the theory is translationally 
invariant). 



1.5.1 Fourier transform of the propagator 

Now insert the identity £ = 1 wherever profitable: 

m=1m1= Tj PkMPk ' = Z • M ■ 
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The matrix 

M W = {ffl ■ M • tpu) (1.57) 

is the Fourier space representation of M. According to (1.53) the matrix Uu = 
(<fk)e - ~^ e '~ kC * s a un i tar y matrix, so the Fourier transform is a linear, unitary 
transformation, UU^ = YjPk = 1, with Jacobian det U - 1. The form of the 
invariant function (1.42) does not change under (f> —> fa transformation, and from 
the formal point of view, it does not matter whether we compute in the Fourier 
space or in the configuration space that we started out with. For example, the 
trace of M is the trace in either representation 

trM = Y, M ci^YjYj {PkMPk ' )u 

C kk' e 

= ^ Yj&^k ' M ' W'Mff'l')* = 2 6kk 'Mkk> = trM . 

kk' C kk' 

From this it follows that trM" - trM", and from the tr In = lntr relation that 
det M = det M. In fact, any scalar combination of fas, /'s and couplings, such as 
the partition function Z[J], has exactly the same form in the configuration and the 
Fourier space. 

OK, a dizzying quantity of indices. But what's the payback? 



1.5.2 Lattice Laplacian diagonalized 

Now use the eigenvalue equation (1.51) to convert cr matrices into scalars. If M 
commutes with cr, then Qpl • M • (p^) = Mk^kk' , and the matrix M acts as a multipli- 
cation by the scalar on the kth subspace. For example, for the 1-dimensional 
version of the lattice Laplacian (1.38) the projection on the k-th subspace is 

= H c ° s (%")-'h <L58) 

In the k-th subspace the bare propagator is simply a number, and, in contrast to 
the mess generated by (1.40), there is nothing to inverting M~ l : 

((ft-M- av) = (Go)k(W - f , r , (1.59) 

^<-S^ =1 (cos(^)-l) 

where k = (ki, ■ ■ ■ , k^) is a J-dimensional vector in the A^-dimensional dual 
lattice. 
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Figure 1.1: Symmetries of four disks on a square. A 
fundamental domain indicated by the shaded wedge. 




Going back to the partition function (28.51) and sticking in the factors of 1 
into the bilinear part of the interaction, we replace the spatial Jg by its Fourier 
transform J^, and the spatial propagator {M\{> by the diagonalized Fourier trans- 
formed (Go)* 

J T -M-J = ^(J T ■ 9k)(<fil ■ M ■ <p w )(<pl -J) = Yj J ~l(G 0 ) k J k . (1.60) 

k,k' k 

1.6 C\ v factorization 

If an Af-disk arrangement has Cn symmetry, and the disk visitation sequence is 
given by disk labels {6162.63 . . .}, only the relative increments p, = e,+i - e,- mod N 
matter. Symmetries under reflections across axes increase the group to C^ v and 
add relations between symbols: {§} and {N - e,} differ only by a reflection. As a 
consequence of this reflection increments become decrements until the next reflec- 
tion and vice versa. Consider four equal disks placed on the vertices of a square 
(figure 1.1). The symmetry group consists of the identity e, the two reflections cr x , 
<r y across x, y axes, the two diagonal reflections cr^, 0-24, and the three rotations 
C4, C2 and by angles n/2, n and 3n/2. We start by exploiting the C4 subgroup 
symmetry in order to replace the absolute labels q e {1,2,3,4} by relative incre- 
ments pi e {1,2, 3}. By reflection across diagonals, an increment by 3 is equivalent 
to an increment by 1 and a reflection; this new symbol will be called J. Our con- 
vention will be to first perform the increment and then to change the orientation 
due to the reflection. As an example, consider the fundamental domain cycle 1 12. 
Taking the disk 1 —> disk 2 segment as the starting segment, this symbol string 
is mapped into the disk visitation sequence 1+12+13+21 . . . = 123, where the sub- 
script indicates the increments (or decrements) between neighboring symbols; the 
period of the cycle 1 12 is thus 3 in both the fundamental domain and the full space. 
Similarly, the cycle J_12 will be mapped into l+i2_i l_23-i2+i3+2l = 121323 
(note that the fundamental domain symbol J corresponds to a flip in orientation af- 
ter the second and fifth symbols); this time the period in the full space is twice that 
of the fundamental domain. In particular, the fundamental domain fixed points 
correspond to the following 4-disk cycles: 



appendSymm - 22sep2010 



ChaosBook.org version 14.4. 1 , Apr 212013 



APPENDIX I. DISCRETE SYMMETRIES OF DYNAMICS 



880 



Table LI: C4,, correspondence between the ternary fundamental domain prime cycles p 
and the full 4-disk {1,2,3,4} labeled cycles p, together with the C$ v transformation that 
maps the end point of the p cycle into an irreducible segment of the p cycle. For typo- 
graphical convenience, the symbol l_of sect. 1.6 has been replaced by 0, so that the ternary 
alphabet is {0, 1,2). The degeneracy of the p cycle is m p — %npjn p . Orbit 2 is the sole 
boundary orbit, invariant both under a rotation by ji and a reflection across a diagonal. 
The two pairs of cycles marked by (a) and (b) are related by time reversal, but cannot be 
mapped into each other by C\ v transformations. 



p 


P 


hp 


P 


P 


K 


0 


12 




0001 


12121414 


0"24 


1 


1234 


c 4 


0002 


12124343 


(Ty 


2 


1 3 


C2, 0"13 


0011 


1212 3434 


c 2 


01 


1214 


0"24 


0012 


1212414134342323 


c 3 


02 


1243 


(Ty 


0021 (a) 


1213414234312324 


c 3 

L - 4 


12 


12413423 


c 3 


0022 


1213 


e 


001 


121232343414 




0102 (a) 


1214232134324143 




002 


121343 


c 2 


0111 


12143234 


0"13 


011 


121434 


(Ty 


0112(&) 


12142123 


cr x 


012 


121323 


cr 13 


0121 (b) 


12132124 


cr x 


021 


124324 


cr 13 


0122 


12131413 


cr 24 


022 


124213 


<T X 


0211 


1243 2134 


cr x 


112 


123 


e 


0212 


1243 1423 


cr 24 


122 


124231342413 


c 4 


0221 


1242 1424 


0"24 








0222 


12424313 


(Ty 








1112 


1234234134124123 


c 4 








1122 


12313413 


c 2 








1222 


1242413134242313 


L 4 



4-disk reduced 

12 o 1 
1234 «-» 1 

13 <-> 2 



Conversions for all periodic orbits of reduced symbol period less than 5 are listed 
in table 1.1. 

This symbolic dynamics is closely related to the group-theoretic structure 
of the dynamics: the global 4-disk trajectory can be generated by mapping the 
fundamental domain trajectories onto the full 4-disk space by the accumulated 
product of the C 4v group elements g\ = C, g 2 = C 2 , gi = cr diag C = o- axis , 
where C is a rotation by n/2. In the 112 example worked out above, this yields 
gin = g2gig}_ = C 2 C(Taxis = Cdiag, listed in the last column of table 1.1. Our 
convention is to multiply group elements in the reverse order with respect to the 
symbol sequence. We need these group elements for our next step, the dynamical 
zeta function factorizations. 

The C\ v group has four 1 -dimensional representations, either symmetric (A\) 
or antisymmetric (A2) under both types of reflections, or symmetric under one and 
antisymmetric under the other (B\, B2), and a degenerate pair of 2-dimensional 
representations E. Substituting the Q v characters 
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Figure 1.2: Symmetries of four disks on a rectangle. 
A fundamental domain indicated by the shaded wedge. 






A\ A 2 Bi 


B 2 E 


e 




1 2 


c 2 




1 -2 


c 4 ,cl 




-1 0 


@~axes 




-1 0 






1 0 



into (2 1 . 1 5) we obtain: 



hp 






Ai 


A 2 


Bi 


B 2 


e: 


(1 


- tpf = 


(1 - tp) 


(1 - tp) 


(1 - tp) 


(1 - tp) 


C 2 : 


(1 


~¥ = 


(1 - t p ) 


(1 - tp) 


(1 - tp) 


(1 - f/>) 


C4, C^: 


(1 


-¥ = 
-|) 4 = 


(1 - tp) 


(1 - tp) 


(1 + tp) 


(1 + tp) 


@~axes ■ 


(1 


(1 - tp) 


(1 + tp) 


(1 - tj) 


(1 + tp) 


0~diag ■ 


(1 


(1 - tp) 


(1 + tp) 


(1 + t p ) 


(1 - f/d 



(1 - tp) 4 
(1 + tpf 
(1 + ^) 2 
(1 - & 



(1 



f\2 



5) 



The possible irreducible segment group elements hp are listed in the first column; 
o- axes denotes a reflection across either the x-axis or the y-axis, and Odi ag denotes a 
reflection across a diagonal (see figurel.l). In addition, degenerate pairs of bound- 
ary orbits can run along the symmetry lines in the full space, with the fundamental 
domain group theory weights h p = (C 2 + cr x )/2 (axes) and h p = (C 2 + o" 13 )/2 (di- 
agonals) respectively: 

Ai A 2 fii B 2 E 
axes: (1 - tp 2 - (1 - t p ){\ - 0t p ){\ - tp){\ - 0t p ){\ + t p ) 2 
diagonals: (1 - rj) 2 - (1 - t p )(l - Ot p )(l - Ot p )(l - t p ){\ + tp) 2 (1.61) 

(we have assumed that t p does not change sign under reflections across symmetry 
axes). For the 4-disk arrangement considered here only the diagonal orbits 13, 24 
occur; they correspond to the 2 fixed point in the fundamental domain. 

The A 1 subspace in C 4v cycle expansion is given by 



i/<r Al = (i-fo)(i-*i)(i-* 2 )(i-*oi)(i-te)(i-*i 2 ) 
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(1 - ?ooi)(l - '002X1 - '011XI - '012XI - - '022XI - '112) 

(1 - ?122)(1 - ?000l)(l - '0002)(1 _ ? 001l)(l ~ ?0012)(1 ~ '002l) ■ ■ ■ 

= l-t 0 -ti-t 2 - Coi - toh ) - ('02 ~ ht 2 ) - (hi - 'l' 2 ) 
~('001 _ 'o'oi) - ('002 - 'o'02) _ ('oil - 'l'oi) 
~('022 - ' 2 '02) - (?112 - 'l'l 2 ) ~ ('l22 - hhl) 

-{ton + '021 + fo'1'2 - tot 12 - 'i'o2 - '2'oi) • • • (1-62) 

(for typographical convenience, \ is replaced by 0 in the remainder of this sec- 
tion). For 1 -dimensional representations, the characters can be read off the symbol 
strings: XA2 (hp) - (-1)"°, *fi,(hp) = ^ 2 (h p ) = (-1)"» + "', where n 0 and 

n\ are the number of times symbols 0, 1 appear in the p symbol string. For Bi all 
t p with an odd total number of 0's and l's change sign: 

i/£b 2 = a + ?oXi + *ixi - t 2 )a - t 0 i)a + to 2 )a + tn) 

(1 + fooiXl - '002XI + '011XI - '012XI - '021XI + '022XI - '112) 
(1 + f m )(l - <000l)(l + '0002)(1 - '0011XI + '0012XI + '0021) ■ ■ ■ 
= l+t 0 + h-t 2 - («oi - fo'i) + (tea - toh) + (tn - ht 2 ) 

+('001 _ 'o'oi) - ('002 - 'o'02) + ('oil - 'l'oi) 
+('022 - ' 2 '02) - (?112 - 'l'l 2 ) + ('l22 _ '2*12) 

~('012 + '021 + 'o'l'2 - ttfn - 'l'02 - ? 2 '0l) • ■ • (1.63) 

The form of the remaining cycle expansions depends crucially on the special role 
played by the boundary orbits: by (1.61) the orbit t% does not contribute to A 2 and 
Bx, 

l/U 2 = (l + foXi - fiXi + foiXi + teXi - *i2) 

(1 - ? 0 ()l)(l - ?002)(1 + f01l)(l + '012X1 + telXl + '022)(1 - 'm) 
(1 - ?122)(1 + 'oOOlXl + '0002)(1 - 'oOllXl _ '0012)(1 - '002l) ■ ■ ■ 
= 1 +?0-'l +('01 -'0'l) + '02-'l2 

-('001 _ 'o'oi) - ('002 - 'o'02) + ('Oil - 'l'oi) 

+'022 - '122 - ('ll2 - 'l'l2) + ('012 + '021 - 'o'l2 - 'l'02) ■ ■ ■ (1-64) 

and 

= (1 -'o)(l +'i)(l + 'oi)(l -'02XI +'12) 

(1 + fooiXl - '002)(1 - '01l)(l + '012)(1 + '02l)(l - '022)(1 - 'll 2 ) 
(1 + '122X1 + '000l)(l - '0002X1 - 'OOllXl + '0012X1 + '002l) ■ ■ ■ 
- 1 - to + h + ('01 - toh) ~ t02 + '12 

+('001 _ 'o'oi) - ('002 - 'o'02) - ('Oil - 'l'oi) 

-'022 + '122 - ('112 - hhz) + (ton + '021 - 'o'i2 - 'i'o 2 ) ■ ■ ■ (1-65) 
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In the above we have assumed that *2 does not change sign under C^ v reflections. 
For the mixed-symmetry subspace E the curvature expansion is given by 

1/& = l+f2 + (-f0 2 +?l 2 ) + (2f002 -^0 2 -2?H2 + ?2?l 2 ) 

+(2*0011 _ 2?0022 + 2*2*002 - ?01 2 - *02 2 + 2*1122 ~ 2*2*112 
+*12 2 - *0 2 *1 2 ) + (2*00002 - 2*00112 + 2*2*0011 - 2*00121 - 2*00211 
+2*00222 - 2*2*0022 + 2*01012 + 2*01021 ~ 2*01102 ~ *2*01 2 + 2*02022 
-*2*02 2 + 2*11112 - 2*11222 + 2*2*1122 ~ 2*12122 + *2*12 2 ~ *2*0 2 *1 2 

+2*oo 2 (-*o 2 + *i 2 ) - 2* n2 (-*o 2 + *i 2 )) (1-66) 



A quick test of the £ = £4 1 £4 2 £bi£b2<Te factorization is afforded by the topo- 
logical polynomial; substituting t p - z Hp into the expansion yields 

= 1 - 3z , = - 1 . = 1/& = 1 + z , 

in agreement with (15.46). 



exercise 20.9 



1.7 C2 V factorization 



An aiTangement of four identical disks on the vertices of a rectangle has Q v sym- 
metry (figure I.2b). C% v consists of {e, cr x , o~ y , C2}, i.e., the reflections across the 
symmetry axes and a rotation by n. 

This system affords a rather easy visualization of the conversion of a 4-disk 
dynamics into a fundamental domain symbolic dynamics. An orbit leaving the 
fundamental domain through one of the axis may be folded back by a reflection 
on that axis; with these symmetry operations go = cr x and gi = cr y we asso- 
ciate labels 1 and 0, respectively. Orbits going to the diagonally opposed disk 
cross the boundaries of the fundamental domain twice; the product of these two 
reflections is just C2 = o- x <T y , to which we assign the label 2. For example, a 
ternary string 0010201... is converted into 12143123. . ., and the associated 
group-theory weight is given by . . . gigog2gogigogo- 

Short ternary cycles and the corresponding 4-disk cycles are listed in tablel.2. 
Note that already at length three there is a pair of cycles (012 = 143 and 021 = 142) 
related by time reversal, but not by any Ci v symmetries. 

The above is the complete description of the symbolic dynamics for 4 suf- 
ficiently separated equal disks placed at corners of a rectangle. However, if the 
fundamental domain requires further partitioning, the ternary description is in- 
sufficient. For example, in the stadium billiard fundamental domain one has to 
distinguish between bounces off the straight and the curved sections of the bil- 
liard wall; in that case five symbols suffice for constructing the covering symbolic 
dynamics. 



appendSymm - 22sep2010 



ChaosBook.org version 14.4. 1 , Apr 212013 



APPENDIX I. DISCRETE SYMMETRIES OF DYNAMICS 



884 



Table 1.2: Civ correspondence between the ternary {0, 1 , 2} fundamental domain prime cy- 
cles p and the full 4-disk { 1 ,2,3,4} cycles p, together with the C 2 V transformation that maps 
the end point of the p cycle into an irreducible segment of the p cycle. The degeneracy of 
the p cycle is m p = 4rip/n p . Note that the 012 and 021 cycles are related by time reversal, 
but cannot be mapped into each other by Civ transformations. The full space orbit listed 
here is generated from the symmetry reduced code by the rules given in sect. 1.7, starting 
from disk 1 . 



p 


P 


g 


P 


P 


g 


0 


14 


(Ty 


0001 


14143232 


c 2 


1 


1 2 




0002 


14142323 


cr x 


2 


13 


c 2 


0011 


1412 


e 


01 


14 32 


c 2 


0012 


14124143 


O-y 


02 


1423 




0021 


14134142 


O-y 


12 


1243 


(Ty 


0022 


1413 


e 


001 


141232 


cr x 


0102 


14324123 


O-y 


002 


141323 


C 2 


0111 


14343212 


c 2 


011 


143412 


(Ty 


0112 


14342343 


cr x 


012 


143 


e 


0121 


14312342 


o- x 


021 


142 


e 


0122 


14313213 


c 2 


022 


142413 


O-y 


0211 


14212312 


O-.v 


112 


121343 


c 2 


0212 


1421 3243 


c 2 


122 


124213 




0221 


14243242 


c 2 








0222 


14242313 


<T X 








1112 


12124343 


(Ty 








1122 


1213 


e 








1222 


12424313 


O-y 



The group C 2v has four 1 -dimensional representations, distinguished by their 
behavior under axis reflections. The A\ representation is symmetric with respect 
to both reflections; the A2 representation is antisymmetric with respect to both. 
The B\ and B 2 representations are symmetric under one and antisymmetric under 
the other reflection. The character table is 



c 2v 


Ai 


A 2 


Bi 


B 2 


e 


1 


1 


1 


1 


c 2 


1 


1 


-1 


-1 


cr x 


1 


-1 


1 


-1 


(Ty 


1 


-1 


-1 


1 



Substituted into the factorized determinant (21.14), the contributions of peri- 
odic orbits split as follows 



Sp 

e: (l-tpf = 

C 2 . (l-t 2 ) 2 = 

v x : (l-tp 2 = 

o- y : (l-tj) 2 = 



A 


1 


A 2 


(1- 


tp) 


(1 - tp) 


(1- 


tp) 


(1 - tp) 


(1- 


tp) 


(1 + tp) 


(1- 


tp) 


(1 + tp) 



B\ B 2 

(1-tp) (1-tp) 

a-tp) a-tp) 

(i - tp) (i + tp) 

(i + tp) (i - tp) 



Cycle expansions follow by substituting cycles and their group theory factors from 
table 1.2. For A\ all characters are +1, and the corresponding cycle expansion is 
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given in (1.62). Similarly, the totally antisymmetric subspace factorization Ai is 
given by (1.63), the B 2 factorization of C$ v . For B\ all t p with an odd total number 
of O's and 2's change sign: 

= (l+t 0 )(l-h)(l+t 2 )(l+toi)(l-to2)a+tn) 

(1 - t m )(l + + f01l)(l - ?012)(1 - ?02l)(l + ?022)(1 + hn) 

(1 - f 122 )(l + f000l)(l - ^0002)(1 - ?001l)(l + ?0012)(1 + fo()2l) ■ • ■ 

= 1 + t 0 - ti + t 2 + (t 0 i - hh) - (t 02 - t 0 t 2 ) + (tn - ht 2 ) 

_ (f001 _ ?0?0l) + (?002 - toto2) + (toil - Vol) 

+(?022 - tefta) + (hn - hh 2 ) - (hn - hhi) 

-(ton + t02\ + tot\t2 - tot 12 - t\to2 - t2to\) ■ ■ ■ (1-67) 

For B 2 all t p with an odd total number of l's and 2's change sign: 

l/6? 2 = (l-?0)(l+?l)(l+^2)(l+?0l)(l+?02)(l-?12) 

(1 + ?00l)(l + ?002)(1 - ?01l)(l - *012)(1 - ?02l)(l - ?022)(1 + hn) 
(1 + ?122>(1 + ?000l)(l + %02)(1 - ?001l)(l - ?0012)(1 ~ ?002l) ■ ■ ■ 



Note that all of the above cycle expansions group long orbits together with their 
pseudo-orbit shadows, so that the shadowing arguments for convergence still ap- 
ply. 

The topological polynomial factorizes as 



consistent with the 4-disk factorization (15.46). 

1.8 Henon map symmetries 

We note here a few simple symmetries of the Henon map (3.17). For HO the 
Henon map is reversible: the backward iteration of (3.18) is given by 



= 1 - to + h + h + (to\ - toh) + (f 0 2 - toh) - (t\ 2 - ht 2 ) 



+(?001 _ totoi) + (?002 - toto2) - (ton - t\to\) 
~(tQ22 - ?2?02) + (t\\2 ~ t\t\2) + (t\22 - hhl) 
~(to\2 + t02\ + tot\t2 - tot\2 - t\to2 ~ t 2 to\) • ■ 



(1.68) 




£a 2 (b, £b 2 



L _L _L 



x n -l = -t(1 ~ax 2 n -x n+ i). 



(1-69) 
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Hence the time reversal amounts to b — > \/b, a —> a/b 2 symmetry in the param- 
eter plane, together with x —> -x/b in the coordinate plane, and there is no need 
to explore the (a, b) parameter plane outside the strip b e {— 1, 1}. For b = -1 the 
map is orientation and area preserving , 



the backward and the forward iteration are the same, and the non-wandering set 
is symmetric across the x n+ \ = x n diagonal. This is one of the simplest models of 
a Poincare return map for a Hamiltonian flow. For the orientation reversing b = 1 
case we have 



and the non-wandering set is symmetric across the x, r +\ = -x n diagonal. 



Remark 1.1 Literature This material is covered in any introduction to linear alge- 
bra [1,2, 3] or group theory [16, 2]. The exposition given in sects. 1.2.1 and 1.2.2 is taken 
from refs. [23, 24, 5]. Who wrote this down first we do not know, but we like Harter's 
exposition [25, 26, 1 1] best. Harter's theory of class algebrasoffers a more elegant and 
systematic way of constructing the maximal set of commuting invariant manices M , than 
the sketch offered in this section. 

Remark 1.2 Labeling conventions While there is a variety of labeling conventions [ 19, 
25] for the reduced C4,, dynamics, we prefer the one introduced here because of its close 
relation to the group-theoretic structure of the dynamics: the global 4-disk trajectory can 
be generated by mapping the fundamental domain trajectories onto the full 4-disk space 
by the accumulated product of the C4,. group elements. 

Remark 1.3 C2 V symmetry Ci v is the symmetry of several systems studied in the 
literature, such as the stadium billiard [10], and the 2-dimensional anisonopic Kepler 
potential [3]. 




(1.70) 



x n -\ — 1 dX n + X, 1+ [ , 



(1.71) 



Commentary 



Exercises 
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e 


a 


b 


c 


d 


f 


e 


e 


a 


b 


c 


d 


f 


a 


a 


e 


d 


b 


f 


c 


b 


b 


d 


e 


f 


c 


a 


c 


c 


b 


f 


e 


a 


d 


d 


d 


f 


c 


a 


e 


b 


f 


f 


c 


a 


d 


b 


e 



1. Show that (9.29) has two equilibria: 



(ro,Zo) = (0,0), Go undefined 



(r u e u zi) = (^2b(p-l),n/4,p-(a.l2) 



1.2. 



1.3. 



describes a group. Or does it? (Hint: check whether this 
table satisfies the group axioms of appendix 1.1.) 

FromW.G. Harter[ll] 

Three coupled pendulums with a Co symmetry. 

Consider 3 pendulums in a row: the 2 outer ones of 
the same mass m and length I, the one midway of same 
length but different mass M, with the tip coupled to the 
tips of the outer ones with springs of stiffness k. Assume 
displacements are small, Xt/l «: 1. 

(a) Show that the acceleration matrix x = -a x is 







a + b 






—c 


. x 3 _ 




0 



—a 
2c + b 
—a 



0 

—c 
a + b 



where a = k/ml, c = k/Ml and b = g/l. 

(b) Check that [a, R] = 0, i.e., that the dynamics is 



invariant under C2 
outer pendulums, 



\e,R), where R interchanges the 



R 



0 0 1 

0 1 0 

1 0 0 



(c) Construct the corresponding projection operators P + 
and P_, and show that the 3-pendulum system decom- 
poses into a 1 -dimensional subspace, with eigenvalue 
(o/~') 2 = a + b, and a 2-dimensional subspace, with 
acceleration matrix (trust your own algebra, if it strays 
from what is stated here) 



»(+) 



a + b 
-V2c 



-y/2a 
c + b 



The exercise is simple enough that you can do it with- 
out using the symmetry, so: construct P+,P- first, use 
them to reduce a to irreps, then proceed with computing 
remaining eigenvalues of a. 

(d) Does anything interesting happen if M = ml 

The point of the above exercise is that almost always the 
symmetry reduction is only partial: a matrix representa- 
tion of dimension d gets reduced to a set of subspaces 
whose dimensions c/ (Q,) satisfy 2 d (a) = d. Beyond that, 
love many, trust few, and paddle your own canoe. 

FromW.G. Harter[ll] 

Lorenz system in polar coordinates: dynamics. 

(continuation of exercise 9.8) 



Verify numerically that the eigenvalues and eigen- 
vectors of the two equilibria are (we list here 
the precise numbers to help you check your pro- 
grams): 

EQi = (0, 12, 27) equilibrium: (and its C 1/2 - 
rotation EQ2) has one stable real eigenvalue 
A {1) = -13.854578, 

and the unstable complex conjugate pair 
^(2,3) = ^(2) ± i(j j2) = 0.093956 + /10.194505. 
The unstable eigenplane is defined by eigen- 
vectors 

Ree (2) = (-0.4955,-0.2010,-0.8450) 

Ime (2) = (0.5325,-0.8464,0) 

with period T = 2tt/w (2) = 0.6163306, 

radial expansion multiplier 

A r = exp(2^ (2) /w (2) ) = 1.059617, 

and the contracting multiplier 

A c = exp(2^ (1) /w {2) ) ~ 1.95686 x 10~ 4 

along the stable eigenvector of EQ 1 , 



»(3) 



(0.8557, -0.3298, -0.3988). 



EQo = (0, 0, 0) equilibrium: The stable eigen- 
vector e (1) = (0, 0, 1) of EQo, has contraction rate 
A&> =-b = -2.666 . . .. 
The other stable eigenvector is 
e (2) = (-0.244001,-0.969775,0), with contract- 
ing eigenvalue 

= —22.8277. The unstable eigenvector 
e< 3 > = (-0.653049, 0.757316,0) has eigenvalue 
A {3) = 11.8277. 



3. Plot the Lorenz strange attractor both in the 
Lorenz coordinates figure 2.5, and in the doubled- 
polar angle coordinates (9.24) for the Lorenz pa- 
rameter values cr — 10, £> = 8/3, p = 28. Topolog- 
ical^, does it resemble the Lorenz butterfly, the 
Rossler attractor, or neither? The Poincare sec- 
tion of the Lorenz flow fixed by the z-axis and the 
equilibrium in the doubled polar angle represen- 
tation, and the corresponding Poincare return map 
(s„, s n + 1) are plotted in figure 1 1.8. 

4. Construct the Poincare return map (s„, s„+i), 
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20 



^ 0 



-20 



-40 




-40 



where s is arc-length measured along the unstable 
manifold of EQq, lower Poincare section of fig- 
ure 11.8 (b). Elucidate its relation to the Poincare 
return map of figure 1 1.9. (plot by J. Halcrow) 

5. Show that if a periodic orbit of the polar represen- 
tation Lorenz is also periodic orbit of the Lorenz 
flow, their Floquet multipliers are the same. How 
do the Floquet multipliers of relative periodic or- 
bits of the representations relate to each other? 

6. What does the volume contraction formula (4.41) 
look like now? Interpret. 

1.4. Laplacian is a non-local operator. 

While the Laplacian is a simple tri-diagonal difference 
operator (1.39), its inverse (the "free" propagator of sta- 
tistical mechanics and quantum field theory) is a messier 
object. A way to compute is to start expanding propaga- 
tor as a power series in the Laplacian 



1 



t 2 l - A 



m 2 2_J rrfl" 



(1.73) 



As A is a finite matrix, the expansion is convergent for 
sufficiently large m 2 . To get a feeling for what is in- 
volved in evaluating such series, show that A 2 is: 





< 6 


-4 


1 


1 


-4\ 




-4 


6 


-4 1 






1 


1 


-4 


6 -4 1 










1 


-4 '■■ 














6 


-4 




,-4 


1 


1 


-4 


6 I 



1.5. 



What A 3 , A 4 , • • • contributions look like is now clear; as 
we include higher and higher powers of the Laplacian, 
the propagator matrix fills up; while the inverse propa- 
gator is differential operator connecting only the nearest 
neighbors, the propagator is integral operator, connect- 
ing every lattice site to any other lattice site. 

This matrix can be evaluated as is, on the lattice, and 
sometime it is evaluated this way, but in case at hand 
a wonderful simplification follows from the observation 
that the lattice action is translationally invariant, exer- 
cise 1.5. 



Lattice Laplacian diagonalized. Insert the iden- 
tity Yj P - 1 wherever you profitably can, and use 
the eigenvalue equation (1.5 1) to convert shift cr matrices 
into scalars. If M commutes with cr, then (ip^ ■ M ■ tp^) = 
M^dkk', and the matrix M acts as a multiplication by 
the scalar M (i) on the kth subspace. Show that for the 
1 -dimensional version of the lattice Laplacian (1.39) the 
projection on the kth subspace is 



.(1.74) 



1.6. 



t L 

((pi - A- iptf) = -r 



cos I — k \ - 1 

N 



Skk' 



(1.75) 



In the Mi subspace the propagator is simply a number, 
and, in contrast to the mess generated by (1.73), there is 
nothing to evaluating: 



4 



1 



m 2 l - A 



5kk> 



(may 



T (cos 2nk/N- 1) 



,(1.76) 



where k is a site in the A^-dimensional dual lattice, and 
a = L/N is the lattice spacing. 

Fix Predrag's lecture od Feb 5, 2008. Are the C 3 

frequencies on pp. 4,5 correct? If not, write the correct 
expression for the beat frequency. 
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Appendix J 



Convergence of spectral 
determinants 

J.l Curvature expansions: geometric picture 



If you has some experience with numerical estimates of fractal dimensions, you 
will note that the numerical convergence of cycle expansions for systems such 
as the 3-disk game of pinball, table 20.2, is very impressive; only three input 
numbers (the two fixed points 0, 1 and the 2-cycle 10) already yield the escape 
rate to 4 significant digits! We have omitted an infinity of unstable cycles; so why 
does approximating the dynamics by a finite number of cycles work so well? 

Looking at the cycle expansions simply as sums of unrelated contributions is 
not specially encouraging: the cycle expansion (20.2) is not absolutely convergent 
in the sense of Dirichlet series of appendix J.5, so what one makes of it depends 
on the way the terms are arranged. 

The simplest estimate of the error introduced by approximating smooth flow 
by periodic orbits is to think of the approximation as a tessellation of a smooth 
curve by piecewise linear tiles, figure 1.11. 

J. 1.1 Tessellation of a smooth flow by cycles 

One of the early high accuracy computations of n was due to Euler. Euler com- 
puted the circumference of the circle of unit radius by inscribing into it a regu- 
lar polygon with N sides; the error of such computation is proportional to 1 - 
cos(27r/7V) oc A^ 2 . In a periodic orbit tessellation of a smooth flow, we cover the 
phase space by e^'" tiles at the nth level of resolution, where h is the topological 
entropy, the growth rate of the number of tiles. Hence we expect the error in ap- 
proximating a smooth flow by ^ m linear segments to be exponentially small, of 
order N~ 2 oc e~ 2hn . 
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J.1.2 Shadowing and convergence of curvature expansions 

We have shown in chapter 15 that if the symbolic dynamics is defined by a finite 
grammar, a finite number of cycles, let us say the first k terms in the cycle expan- 
sion are necessary to correctly count the pieces of the Cantor set generated by the 
dynamical system. 

They are composed of products of non-intersecting loops on the transition 
graph, see (15.15). We refer to this set of non-intersecting loops as the. fundamen- 
tal cycles of the strange set. It is only after these terms have been included that 
the cycle expansion is expected to converge smoothly, i.e., only for n > k are the 
curvatures c n , a measure of the variation of the quality of a linearized covering of 
the dynamical Cantor set by the length n cycles, and expected to fall off rapidly 
with n. 

The rate of fall-off of the cycle expansion coefficients can be estimated by 
observing that for subshifts of finite type the contributions from longer orbits in 
curvature expansions such as (20.7) can always be grouped into shadowing com- 
binations of pseudo-cycles. For example, a cycle with itinerary ab= s^S2 • • • s n 
will appear in combination of form 

l/£= 1 (fab-tatb) , 

with ab shadowed by cycle a followed by cycle b, where a = sis%- • • s m , b = 
s m +\ • • • s n -\s n , and Sk labels the Markov partition M St (11.2) that the trajectory 
traverses at the kth return. If the two trajectories coincide in the first m symbols, 
at the mth return to a Poincare section they can land anywhere in the phase space 
M 

\f Ta (xa)-f Ta ~(x a j\*l, 

where we have assumed that the M is compact, and that the maximal possible 
separation across M is 0(1). Here x a is a point on the a cycle of period T a , and x a ,„ 
is a nearby point whose trajectory tracks the cycle a for the first m Poincare section 
returns completed at the time T a .... An estimate of the maximal separation of the 
initial points of the two neighboring trajectories is achieved by Taylor expanding 
around x a _ - x„ + Sx a ,„ 

f a (xa) ~ f' (.v,,..) * dfTa{x ° ) ■ 8x a ... = M a ■ 8x a ... , 

ox 

hence the hyperbolicity of the flow forces the initial points of neighboring trajec- 
tories that track each other for at least m consecutive symbols to lie exponentially 
close 

\Sx a J oc — . 
|Afl| 



appendConverg - I9nov2012 



ChaosBook.org versionl4.4. 1, Apr 21 2013 



APPENDIX J. CONVERGENCE OF SPECTRAL DETERMINANTS 



893 



Similarly, for any observable (17.1) integrated along the two nearby trajecto- 
ries 



A Ta ix a J * A T "{xi) + — 

ox 



■ Sx a „, , 



SO 



i T T I r o Const 

\A T °-{x a J-A T «{Xa)\oc a — , 

\A-a\ 



As the time of return is itself an integral along the trajectory, return times of nearby 
trajectories are exponentially close 

m , T a Const 
T a ... ~ T a \ ex a — , 
|A«I 

and so are the trajectory stabilities 

i T T I r fl Const 

\A Ta -{x a J-A T "{Xa)\^ a — , 

Substituting t a t one finds 



^ tatb - l _ e 'S(T a +T,,-T ab ) 
tab 



Kb 



Since with increasing m segments of ab come closer to a, the differences in action 
and the ratio of the eigenvalues converge exponentially with the eigenvalue of the 
orbit a, 

T a + T b - Tab ~ Const x K~ J , \A a A b /A ab \ « exp(-Const/A flZ) ) 



Expanding the exponentials one thus finds that this term in the cycle expansion is 
of the order of 



t a ib ~ tJaJ-lb ~ Const x taJbK, J ■ (J- 1) 

Even though the number of terms in a cycle expansion grows exponentially, the 
shadowing cancellations improve the convergence by an exponential factor com- 
pared to trace formulas, and extend the radius of convergence of the periodic orbit 
sums. Table J.l shows some examples of such compensations between long 
cycles and their pseudo-cycle shadows. 
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n 


ffli - t a tb 


Tab - (T a + T h ) 


log 


K* 




ab — a ■ b 


2 


-5.23465150784xl0 4 


4.85802927371x10- 




6.3xl0 2 


01-0-1 


3 


-7.96028600139xl0 6 


5.21713101432xl0 3 




9.8xl0 3 


001-0-01 


4 


-1.03326529874xl0 7 


5.29858199419xl0 4 




1.3xl0 3 


0001-0-001 


5 


-1.27481522016xl0 9 


5.35513574697xl0 5 




1.6xl0 4 


00001-0-0001 


6 


-1.52544704823x10" 


5.40999882625xl0 6 




1.8x10 s 


000001-0-00001 


2 


-5.23465150784xl0 4 


4.85802927371x10^ 




6.3x10- 


01-01 


3 


5.30414752996xl0 6 


-3.67093656690xl0 3 




7.7xl0 3 


011-01-1 


4 


-5.40934261680xl0 8 


3.14925761316xl0 4 




9.2xl0 4 


0111-011-1 


5 


4.99129508833x10 10 


-2.67292822795x10 s 




l.OxlO 4 


01111-0111-1 


6 


-4.39246000586x10 12 


2.270871 16266xl0 6 




1.0x10 s 


011111-01111-1 



Table J.l: Demonstration of shadowing in curvature combinations of cycle weights of 
form t a b - t a ti„ the 3-disk fundamental domain cycles at R : d = 6, table 29.3. The ratio 
A fl Ai/A fl 2, is approaching unity exponentially fast. 



It is crucial that the curvature expansion is grouped (and truncated) by topo- 
logically related cycles and pseudo-cycles; truncations that ignore topology, such 
as inclusion of all cycles with T p < T max , will contain orbits unmatched by shad- 
owed orbits, and exhibit a mediocre convergence compared with the curvature 
expansions. 

Note that the existence of a pole at z = 1 /c implies that the cycle expansions 
have a finite radius of convergence, and that analytic continuations will be required 
for extraction of the non-leading zeros of 1 /£. Preferably, one should work with 
cycle expansions of Selberg products, as discussed in sect. 20.2.2. 

J.1.3 No shadowing, poorer convergence 

Conversely, if the dynamics is not of a finite subshift type, there is no finite topo- 
logical polynomial, there are no "curvature" corrections, and the convergence of 
the cycle expansions will be poor. 



J.2 On importance of pruning 

If the grammar is not finite and there is no finite topological polynomial, there 
will be no "curvature" expansions, and the convergence will be poor. That is 
the generic case, and one strategy for dealing with it is to find a good sequence of 
approximate but finite grammars; for each approximate grammar cycle expansions 
yield exponentially accurate eigenvalues, with successive approximate grammars 
converging toward the desired infinite grammar system. 

When the dynamical system's symbolic dynamics does not have a finite gram- 
mar, and we are not able to arrange its cycle expansion into curvature combina- 
tions (20.7), the series is truncated as in sect. 20.6, by including all pseudo-cycles 
such that |A Pl • • • A Pk \ < \Ap\, where P is the most unstable prime cycle included 
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into truncation. The truncation error should then be of order 0(£ Tp Tp/\Ap\), with 
h the topological entropy, and d >Tp roughly the number of pseudo-cycles of stabil- 
ity « |Ap|. In this case the cycle averaging formulas do not converge significantly 
better than the approximations such as the trace formula (22.15). 

Numerical results (see for example the plots of the accuracy of the cycle ex- 
pansion truncations for the Henon map in ref. [3]) indicate that the truncation 
error of most averages tracks closely the fluctuations due to the irregular growth 
in the number of cycles. It is not known whether one can exploit the sum rules 
such as the mass flow conservation (20.17) to improve the accuracy of dynamical 
averaging. 



^■r The periodic orbit theory is learned in stages. At first glance, it seems 
totally impenetrable. After basic exercises are gone through, it seems totally triv- 
ial; all that seems to be at stake are elementary manipulations with traces, deter- 
minants, derivatives. But if start thinking about you will get a more and more 
uncomfortable feeling that from the mathematical point of view, this is a perilous 
enterprise indeed. In chapter 23 we shall explain which parts of this enterprise are 
really solid; here you give a fortaste of what objections a mathematician might 



Birkhoff's 1931 ergodic theorem states that the time average (17.4) exists al- 
most everywhere, and, if the flow is ergodic, it implies that (a(x)) = (a) is a 
constant for almost all x. The problem is that the above cycle averaging formulas 
implicitly rely on ergodic hypothesis: they are strictly correct only if the dynam- 
ical system is locally hyperbolic and globally mixing. If one takes a ft derivative 
of both sides 



J.3 Ma-the-matical caveats 



"Lo duca e io per quel cammino ascoso intrammo a ri- 
tornar nel chiaro monde; e sanza cura aver d'alcun riposa 
salimmo su, el primo e io secondo, tanto ch'i' vidi de le 
cose belle che porta '1 ciel, per un perutgio tondo." 



— Dante 



rise. 



pp(yV 



tw - 



Jm 



f dx8{y-f\x))^ A, ^ P p{x), 



and integrates over y 
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one obtains in the long time limit 



This is the expectation value (17.12) only if the time average (17.4) equals the 
space average (17.9), (a(x)) = (a), for all x except a subset x e M of zero 
measure; if the phase space is foliated into non-communicating subspaces A4 = 
M\ + A4z of finite measure such that f{M\) n M2 - 0 for all t, this fails. In 
other words, we have tacitly assumed metric indecomposability or transitivity. 
We have also glossed over the nature of the "phase space" At. For example, if the 
dynamical system is open, such as the 3-disk game of pinball, M in the expecta- 
tion value integral (17.24) is a Cantor set, the closure of the union of all periodic 
orbits. Alternatively, M can be considered continuous, but then the measure 
in (J.2) is highly singular. The beauty of the periodic orbit theory is that instead 
of using an arbitrary coordinatization of M it partitions the phase space by the in- 
trinsic topology of the dynamical flow and builds the correct measure from cycle 
invariants, the Floquet multipliers of periodic orbits. 

Were we to restrict the applications of the formalism only to systems which 
have been rigorously proven to be ergodic, we might as well fold up the shop 
right now. For example, even for something as simple as the Henon mapping we 
do not know whether the asymptotic time attractor is strange or periodic. Physics 



applications require a more pragmatic attitude. In the cycle expansions approach 
we construct the invariant set of the given dynamical system as a closure of the 
union of periodic orbits, and investigate how robust are the averages computed on 
this set. This turns out to depend very much on the observable being averaged 
over; dynamical averages exhibit "phase transitions", and the above cycle averag- 
ing formulas apply in the "hyperbolic phase" where the average is dominated by 
exponentially many exponentially small contributions, but fail in a phase domi- 
nated by few marginally stable orbits. Here the noise - always present, no matter 
how weak - helps us by erasing an infinity of small traps that the deterministic 
dynamics might fall into. 

Still, in spite of all the caveats, periodic orbit theory is a beautiful theory, 
and the cycle averaging formulas are the most elegant and powerful tool available 
today for evaluation of dynamical averages for low dimensional chaotic determin- 
istic systems. 



J.4 Estimate of the nth cumulant 

An immediate consequence of the exponential spacing of the eigenvalues is that 
the convergence of the Selberg product expansion (E. 12) as function of the topo- 
logical cycle length, F(z) = YmCnZ", is faster than exponential. Consider a d- 
dimensional map for which all Jacobian matrix eigenvalues are equal: Up = 
A p j = A Pt 2 = ■ ■ ■ = A p>( /. The Floquet multipliers are generally not isotropic; 




(J.2) 



exercise 6.4 
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however, to obtain qualitative bounds on the spectrum, we replace all Floquet 
multipliers with the least expanding one. In this case the p cycle contribution to 
the product (19.9) reduces to 

oo 

f p (z) = (i-t p u k ; +k ^) 

k v -k d =0 

CO 

"[(l - t p u p J 1k ; m k 

k=0 
oo m k I v 

= nz? y-4>,r 



In one dimension the expansion can be given in closed form 03.5), and the 
coefficients Q in (E.12) are given by 



d-l+k 
d- 1 



(k + d- 1)! 
k\(d-l)\ 



(J-3) 



M-ll 



r pk = (-D A 



n; =1 (i-^) 



(J-4) 



Hence the coefficients in the F(z) - C n z n expansion of the spectral determinant 
(20.14) fall off faster than exponentially, as \C n \ « u n(jl ~ ^' 2 . In contrast, the cycle 
expansions of dynamical zeta functions fall of "only" exponentially; in numerical 
applications, the difference is dramatic. 

In higher dimensions the expansions are not quite as compact. The leading 
power of u and its coefficient are easily evaluated by use of binomial expansions 
(J.3) of the (1 + tu c ) mk factors. More precisely, the leading if terms in t k coeffi- 
cients are of form 



"](! + 

k=0 



kyrik 



+ U mi + 2m2+...+/my A+m\+m2+.„+mj , 



+ 



+ . . 



+ U —< n " f + . . . 



Hence the coefficients in the F(z) expansion fall off faster than exponentially, as 
u" i+i ' d . The Selberg products are entire functions in any dimension, provided that 
the symbolic dynamics is a finite subshift, and all cycle eigenvalues are sufficiently 
bounded away from 1 . 

The case of particular interest in many applications are the 2-d Hamiltonian 
mappings; their symplectic structure implies that u p = A Pt \ = l/A Pi 2, and the 
Selberg product (19. 13) In this case the expansion corresponding to (23.5) is given 

3/2 

in exercise 23.4 and the coefficients fall off asymptotically as C n « u" . 
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J.5 Dirichlet series 



The most patient reader will thank me for compressing so 
much nonsense and falsehood into a few lines. 

— Gibbon 




A Dirichlet series of f(s) is defined as 



f(s) = ^ ajc 



(J-5) 



where s, a; are complex numbers, and {Aj\ is a monotonically increasing series 
of real numbers A\ < A% < • • • < Aj < • • : A classical example of a Dirichlet 
series is the Riemann zeta function for which aj = 1, Aj = In/ In the present 
context, a formal series over individual pseudo-cycles such as 00.2) ordered by 
increasing pseudo-cycle periods are often Dirichlet series. For example, for the 
pseudo-cycle weight (20.3), the Dirichlet series is obtained by ordering pseudo- 
cycles by increasing periods A n = T pi + T pi + . . . + T Pk , with the coefficients 

e P-(A n +A P2 +.„+A Pk ) 

On - "I 7~\ dn ' 

| A pi A p2 . . . A Pk | 

where d n is a degeneracy factor in the case that d n pseudo-cycles have the same 
weight. 

If the series £ \aj\ diverges, the Dirichlet series is absolutely convergent for 
Re s > a a and conditionally convergent for Re s > cr c , where cr a is the abscissa 
of absolute convergence 



cr a = lim sup — In > \aA, (J.6) 
N-kx, Aft [ 



and o" c is the abscissa of conditional convergence 



lim sup — In 

AT->oo Aff 



N 



7=1 



(J-7) 



We encounter another example of a Dirichlet series in the semiclassical quantiza- 
tion, the quantum chaos part of ChaosBook.org. 
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Commentary 



Remark J.l Are cycle expansions Dirichlet series? Even though some literature [12] 
refers to cycle expansions as 'Dirichlet series', they are not Dirichlet series. Cycle ex- 
pansions collect contributions of individual cycles into groups that correspond to the co- 
efficients in cumulant expansions of spectral determinants, and the convergence of cycle 
expansions is controlled by general properties of spectral determinants. Dirichlet series 
order cycles by their periods or actions, and are only conditionally convergent in the 
regions of interest. The abscissa of absolute convergence is in this context called the 'en- 
tropy barrier'; contrary to frequently voiced anxieties, this number does not necessarily 
has much to do with the actual convergence of the theory. 
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Appendix K 

Infinite dimensional operators 



(A. Wirzba) 



This appendix, taken from ref. [1], summarizes the definitions and properties 
of trace-class and Hilbert-Schmidt matrices, the determinants over infinite 
dimensional matrices and regularization schemes for matrices or operators 
which are not of trace-class. 



K.l Matrix-valued functions 



(P. Cvitanovic) 

As a preliminary we summarize some of the properties of functions of finite - 
dimensional matrices. 

The derivative of a matrix is a matrix with elements 



dA(x) . d 

{x) = dT ' = Tx Aij{x) ■ (K - 1} 



Derivatives of products of matrices are evaluated by the chain rule 



d dA dB 

— (AB) = — B+A— . (K.2) 
dx dx dx 



A matrix and its derivative matrix in general do not commute 



d 1 dA dA 

—A 2 = —A+A—. (K.3) 
dx dx dx 
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The derivative of the inverse of a matrix, follows from j^(AA l ) - 0: 



d , 1 dA 1 

—A - . (K.4) 

dx A dx A 



A function of a single variable that can be expressed in terms of additions and 
multiplications generalizes to a matrix-valued function by replacing the variable 
by the matrix. 

In particular, the exponential of a constant matrix can be defined either by its 
series expansion, or as a limit of an infinite product: 



e A = £n A *< A° = l (K.5) 



k=0 



= lim 1 1 H — A I (K.6) 



W->co \ N 



The first equation follows from the second one by the binomial theorem, so these 
indeed are equivalent definitions. That the terms of order 0(N~ 2 ) or smaller do 
not matter follows from the bound 

x + 8xn \ N j x+e\ 



I x-ey I x + dx N y I x + e\ 
V + N ) < \ + N J < l +- aT/ 



where \6xn\ < e. If lim Sx^ — > 0 as — > oo, the extra terms do not contribute. 
Consider now the determinant 



det(e A ) = lim (det (1 + A/N)) N . 



To the leading order in 1 /N 

1 ? 
det (1 + A IN) = 1 + — tr A + 0(N~ 2 ) . 

hence 



/ l \N 

dete A = lim 1 + — trA + 0(N~ 2 )\ = e tTA (K.7) 
iv^oo \ N J 



Due to non-commutativity of matrices, generalization of a function of several 
variables to a function is not as straightforward. Expression involving several 
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matrices depend on their commutation relations. For example, the commutator 
expansion 

e tA Be -tA _ B + f [X t B] + —[A, [A,B]] + ^y[A, [A, [A,B]]] + • • • (K.8) 

sometimes used to establish the equivalence of the Heisenberg and Schrodinger 
pictures of quantum mechanics follows by recursive evaluation of t derivatives 

-(e' A Be-' A ) = e tA [A,B]e- tA . 
at v ' 

Manipulations of such ilk yield 

e (A +S )/N = e AIN e KIN _ _L[A,B] + 0(N- 3 ) , 

and the Trotter product formula: if B, C and A = B + C are matrices, then 

e A = lim (e B ' N e c ' N ) N (K.9) 



K.2 Operator norms 

(R. Mainieri and P. Cvitanovic) 



The limit used in the above definition involves matrices - operators in 
vector spaces - rather than numbers, and its convergence can be checked using 
tools familiar from calculus. We briefly review those tools here, as throughout the 
text we will have to consider many different operators and how they converge. 



The n — > oo convergence of partial products 

E »=n k*) 



0<m<H 



can be verified using the Cauchy criterion, which states that the sequence {EJ,} 
converges if the differences \\E^ - Ej\\ — > 0 as k, j —> oo. To make sense of this we 
need to define a sensible norm || • • • ||. Norm of a matrix is based on the Euclidean 
norm for a vector: the idea is to assign to a matrix M a norm that is the largest 
possible change it can cause to the length of a unit vector h: 

||M|| = sup||Mn||, ||n|| = l. (K.10) 
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We say that || • || is the operator norm induced by the vector norm || • ||. Construct- 
ing a norm for a finite-dimensional matrix is easy, but had M been an operator in 
an infinite-dimensional space, we would also have to specify the space h belongs 
to. In the finite-dimensional case, the sum of the absolute values of the compo- 
nents of a vector is also a norm; the induced operator norm for a matrix M with 
components M,y in that case can be defined by 

||M|| = maxJjM, 7 |. (K.ll) 

i 



The operator norm (K. 1 1) and the vector norm (K. 10) are only rarely distinguished 
by different notation, a bit of notational laziness that we shall uphold. 

Now that we have learned how to make sense out of norms of operators, we 
can check that 

\\e lA \\ < e m . (K.12) 



exercise K.1 



exercise 2.9 



As | \A 1 1 is a number, the norm of e* A is finite and therefore well defined. In partic- 
ular, the exponential of a matrix is well defined for all values of t, and the linear 
differential equation (4.10) has a solution for all times. 



K.3 Trace class and Hilbert-Schmidt class 



This section is mainly an extract from ref. [9]. Refs. [7, 10, 11, 14] should be 
consulted for more details and proofs. The trace class and Hilbert-Schmidt prop- 
erty will be defined here for linear, in general non-hermitian operators A e £,(91): 
H — > 'H (where ti is a separable Hilbert space). The transcription to matrix 
elements (used in the prior chapters) is simply c$j = (<pi,A<f>j) where {<p n } is an 
orthonormal basis of and ( , ) is the inner product in (see sect. K.5 where 
the theory of von Koch matrices of ref. [12] is discussed). So, the trace is the 
generalization of the usual notion of the sum of the diagonal elements of a matrix; 
but because infinite sums are involved, not all operators will have a trace: 

Definition: 



(a) An operator A is called trace class, A 6 3\, if and only if, for every or- 
thonormal basis, {<f>„}: 

2l<0 n ,A0„>|<oo. (K.13) 

n 

The family of all trace class operators is denoted by J7i . 
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(b) An operator A is called Hilbert-Schmidt, A 6 J2, if and only if, for every 
orthonormal basis, {(p n }\ 

2 l|A^,|| 2 < 00 . 

n 

The family of all Hilbert-Schmidt operators is denoted by J2. 

Bounded operators are dual to trace class operators. They satisfy the following 
condition: \{if/,B(p)\ < C||^||||0|| with C < 00 and \p, (f> e 91. If they have eigenval- 
ues, these are bounded too. The family of bounded operators is denoted by SCH) 
with the norm ||B|| = sup^ 0 ^pjp for <f> e <H. Examples for bounded operators are 
unitary operators and especially the unit matrix. In fact, every bounded operator 
can be written as linear combination of four unitary operators. 

A bounded operator C is compact, if it is the norm limit of finite rank opera- 
tors. 

An operator A is called positive, A > 0, if (Acp, cp) > 0 e ti. Note that 
A* A > 0. We define |A| = VaTa 

The most important properties of the trace and Hilbert-Schmidt classes are 
summarized in (see refs. [7, 9]): 

(a) J~\ and 3~2 are *ideals., i.e., they are vector spaces closed under scalar mul- 

tiplication, sums, adjoints, and multiplication with bounded operators. 

(b) A e Ji if and only if A - BC with B,CeJ" 2 . 

(c) J7i c J~2 c Compact operators. 

(d) For any operator A, we have A € J~2 if En IIA^nll 2 < 00 for a single basis. 
For any operator A > Owe have A e J7i if E« l(0n> A0 n )| < 00 for a single 
basis. 

(e) If A e J7~i, Tr(A) = J^((p n , A0„) is independent of the basis used. 

(f) Tr is linear and obeys Tr(A t ) = Tr(A); Tr(AB) = Tr(BA) if either A e J x 

and B bounded, A bounded and B e J7i or both A, B € J~2. 

(g) $2 endowed with the inner product (A, B>2 = TrCA+B) is a Hilbert space. 

If ||A|| 2 = [ Tr(A^ A) ] 3 , then ||A|| 2 > ||A|| and J 2 is the || || 2 -closure of the 
finite rank operators. 

(h) J~\ endowed with the norm ||A||i = Tr( YAtA) is a Banach space. ||A||i > 

IAH2 > ||A||and^Ti is the || ||i -norm closure of the finite rank operators. The 
dual space of J~\ is the family of bounded operators with the duality 

(B,A> = Tr(BA). 

(i) If A,B e J" 2 , then ||AB||i < ||A|| 2 ||B||2. If A € J 2 and B e B(<H), then 

IIABH2 < ||A|| 2 ||B||. If A e Sf\ and B e then ||AB|h < ||A||i||B||. 
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Note the most important property for proving that an operator is trace class is the 
decomposition (b) into two Hilbert-Schmidt ones, as the Hilbert-Schmidt prop- 
erty can easily be verified in one single orthonormal basis (see (d)). Property (e) 
ensures then that the trace is the same in any basis. Properties (a) and (f) show 
that trace class operators behave in complete analogy to finite rank operators. The 
proof whether a matrix is trace-class (or Hilbert-Schmidt) or not simplifies enor- 
mously for diagonal matrices, as then the second part of property (d) is directly 
applicable: just the moduli of the eigenvalues (or - in case of Hilbert-Schmidt - 
the squares of the eigenvalues) have to be summed up in order to answer that ques- 
tion. A good strategy in checking the trace-class character of a general matrix A is 
therefore the decomposition of that matrix into two matrices B and C where one, 
say C, should be chosen to be diagonal and either just barely of Hilbert-Schmidt 
character leaving enough freedom for its partner B or of trace-class character such 
that one only has to show the boundedness for B. 



K.4 Determinants of trace class operators 

This section is mainly based on refs. [8, 10] which should be consulted for more 
details and proofs. See also refs. [11, 14]. 

Pre-definitions (Alternating algebra and Fock spaces): 

Given a Hilbert space 7Y, ®"*H is defined as the vector space of multi-linear func- 
tional on "H with 0i ® • • • ® <f>„ e ® n< H in case 0i 0„ e *H. /\ n CH) is defined 

as the subspace of g/'T-f spanned by the wedge-product 



where P„ is the group of all permutations of n letters and e{n) - ± 1 depending 
on whether n is an even or odd permutation, respectively. The inner product in 
A n (*H) is given by 



where det{ay] = Y^nep,, t( n ) a \n(\) ' ' ' flW(«)- A"(A) is defined as functor (a functor 
satisfies A"(AB) = A"(A) A"(B)) on A"CK> with 



When n = 0, A"(?0 is defined to be C and A"(A) as 1: C -» C. 

Properties: If A trace class, i.e., A e Si , then for any k, A*(A) is trace class, and 
for any orthonormal basis {cp n } the cumulant 



01 A • • • A <p n - — — 




(01 A • • • A 0,„ TJi A • • • A T] n ) = det [{(pi, ?7y)} 





h<-<h 



< oo 
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is independent of the basis (with the understanding that Tr/\°(A) = 1). 



Definition: Let A e J~\, then det(l + A) is defined as 

oo 

det(l + A) = Yj Tr ( A" (A) ) (K " 14) 

k=0 

Properties: 

Let A be a linear operator on a separable Hilbert space <H and {<pj}™ an or- 
thonormal basis. 

(a) Zr=o Tr ( A*(A)) converges for each A e Ji . 

(b) |det(l + A)| < Y\°j=\ (l + )"y(A)J where ///A) are the singular values of A, 
i.e., the eigenvalues of |A| = VaTa. 

(c) |det(l+A)|<expG|A||i). 

(d) For any Ai,...,A„ e J\, (z\ z n ) det ( 1 + £" =1 z/AjJ is an entire 

analytic function. 

(e) If A,B € Ji.then 

det(l + A)det(l + B) = det (1 + A + B + AB) 
= det((l + A)(l + B)) 
- det((l + B)(l+A)) . (K.15) 

If A e J\ and U unitary, then 

det(u _1 (l + A)U) = det(l + U~ ! AU) - det(l +A) . 

(f) If A e Ji , then (1 + A) is invertible if and only if det(l + A) + 0. 

(g) If A t 0 is an n-times degenerate eigenvalue of A e » then det(l + zA) has 

a zero of order n at z = - 1 1 A. 

(h) For any e, there is a C e (A), depending onA e Ji.so that |det(l + zA)| < 
C e (A)exp(e|z|). 

(i) For any Ae Ji, 

N(A) 

det(l + A) = Y\ (l + ^;(A)) (K.16) 

7=1 

where here and in the following {^ (A)}^^ are the eigenvalues of A counted 
with algebraic multiplicity . 
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(j) Lidskii's theorem: For any A e Si . 

N(A) 

Tr(A) = ^ a/A) < oo . 

7=1 



(k) IfAeJ'i.then 



Tr(/\ fc (A)) 



w(A*'(A)) 



7=1 



^ A yi (A)---^(A)<<x>. 

l<;i<-<A<W(A) 



(1) IfAeJ^then 



det(l+ Z A)=2 Z * 2 A ;i (A)---^(A)<oo. (K.17) 

/fc=0 l<y"i<-<y>^/V(A) 

(m) If A e Si, then for \z\ small (i.e., |z| max|^(A)| < 1) the series ^ Tr {(~ A ) k 
converges and 



det(l + zA) = exp 



exp(Trln(l +zA)) . 



(K.18) 



(n) The Plemelj-Smithies formula: Define a m (A) for A e Si by 



det(l + zA) - Yj z " 



Hi=0 



ml 



Then a m (A) is given by the m x m determinant: 



a m {A) 



Tr(A) m - 1 0 
Tr(A 2 ) Tr(A) m-2 
Tr(A 3 ) Tr(A 2 ) Tr(A) 



Tr(A m ) Tr(A (m_1) ) Tr(A (m ~ 2) ) 



,(m-2)\ 



0 
0 
0 

1 

Tr(A) 



(K.19) 



(K.20) 



with the understanding that ao(A) = 1 and ari(A) = Tr(A). Thus the cumu- 
lants c m (A) = a m (A)/m\ satisfy the following recursion relation 



1 m 

(A) - - y(-l) fc+1 c m _,t(A)Tr(A fe ) form>l 



c 0 (A) = 1. 



(K.21) 
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Note that in the context of quantum mechanics formula (K.19) is the quantum 
analog to the curvature expansion of the semiclassical zeta function with Tr(A") 
corresponding to the sum of all periodic orbits (prime and also repeated ones) of 
total topological length m, i.e., let c m (s.c.) denote the m th curvature term, then the 
curvature expansion of the semiclassical zeta function is given by the recursion 
relation 



In fact, in the cumulant expansion (K.19) as well as in the curvature expansion 
there are large cancelations involved. Let us order - without lost of generality - 
the eigenvalues of the operator A € J7i as follows: 



(This is always possible because of \M < 00 ■) Then, in the standard 
(Plemelj -Smithies) cumulant evaluation of the determinant, eq. (K.19), we have 
enormous cancelations of big numbers, e.g. at the k th cumulant order (k > 3), 
all the intrinsically large 'numbers' A\, A k ~ l A2, . ■ ., A k ~ 2 A2A^, . . . and many more 
have to cancel out exactly until only Zi<y!<-<^<JV(A) ^ji ' ' ' * s finally left over. 
Algebraically, the fact that there are these large cancelations is of course of no 
importance. However, if the determinant is calculated numerically, the big cance- 
lations might spoil the result or even the convergence. Now, the curvature expan- 
sion of the semiclassical zeta function, as it is known today, is the semiclassical 
approximation to the curvature expansion (unfortunately) in the Plemelj -Smithies 
form. As the exact quantum mechanical result is approximated semiclassically, 
the errors introduced in the approximation might lead to big effects as they are 
done with respect to large quantities which eventually cancel out and not - as it 
would be of course better - with respect to the small surviving cumulants. Thus 
it would be very desirable to have a semiclassical analog to the reduced cumulant 
expansion (K.17) or even to (K.16) directly. It might not be possible to find a 
direct semiclassical analog for the individual eigenvalues Aj. Thus the direct con- 
struction of the semiclassical equivalent to (K.16) is rather unlikely. However, in 
order to have a semiclassical "cumulant" summation without large cancelations 
- see (K.17) - it would be just sufficient to find the semiclassical analog of each 
complete cumulant (K.17) and not of the single eigenvalues. Whether this will 
eventually be possible is still an open question. 



K.5 Von Koch matrices 

Implicitly, many of the above properties are based on the theory of von Koch 
matrices [11, 12, 13]: An infinite matrix 1 - A = \\5ju - a/fc||™, consisting of 




for m > 1 



co(s.c) 



1 . 



(K.22) 



\A l \>\A 2 \>--->\A i . l \>\A i \>\A i+l \>--- 
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complex numbers, is called a matrix with an absolutely convergent determinant, 
if the series £ l<z/ifci<z/ 2 fc 2 ' ' ' a j,„k, \ converges, where the sum extends over all pairs 
of systems of indices (71,72, • • • , j n ) and {k\,k2, ■ ■ ■ , k n ) which differ from each 
other only by a permutation, and j\ < 72 < • • • j„ (n - 1,2, • • •)■ Then the limit 

lim det\\6 jk - a jk \\" = det(l - A) 

;l— >oo 

exists and is called the determinant of the matrix 1 - A. It can be represented in 
the form 



00 ~ 00 1 00 

det(l-A) = l-ya ;7 + iV a » a * -1 Y 
La " i\ La a kj a kk v. La 



2! 

i=i j,k=l 



3! 

j,k,m= 1 



®jj &jk Qjn, 

a k j a kk a km + 

@m j Q>mk ®mm 



where the series on the r.h.s. will remain convergent even if the numbers aj k (j, k = 
1, 2, • • •) are replaced by their moduli and if all the terms obtained by expanding 
the determinants are taken with the plus sign. The matrix 1 - A is called von Koch 
matrix, if both conditions 

00 

^Kil < °°> (K-23) 

7=1 



2 M 2 

j,k=l 



< 00 (K.24) 



are fulfilled. Then the following holds (see ref. [11, 13]): (1) Every von Koch 
matrix has an absolutely convergent determinant. If the elements of a von Koch 
matrix are functions of some parameter fi (aj k = aj k (ji), j,k = 1, 2, • • •) and both 
series in the defining condition converge uniformly in the domain of the parameter 
fi, then as n — ^ 00 the determinant detll^ - aj k (jj)\\" tends to the determinant 
det(l + A(ju)) uniformly with respect to fi, over the domain of fi. (2) If the matrices 
1 - A and 1 - B are von Koch matrices, then their product 1 - C = (1 - A)(l - B) 
is a von Koch matrix, and 

det(l - C) = det(l - A) det(l - B) . 

Note that every trace-class matrix A € STi is also a von Koch matrix (and that 
any matrix satisfying condition (K.24) is Hilbert-Schmidt and vice versa). The 
inverse implication, however, is not true: von Koch matrices are not automati- 
cally trace-class. The caveat is that the definition of von Koch matrices is basis- 
dependent, whereas the trace-class property is basis-independent. As the traces 
involve infinite sums, the basis-independence is not at all trivial. An example for 
an infinite matrix which is von Koch, but not trace-class is the following: 

2/7 for i — j — — 1 and j even , 
Ajj - i 2/i for i - j — +1 and i even , 
) else , 
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i.e., 



f 0 


1 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


1/2 


0 


0 


0 


0 


1/2 


0 


0 


0 


0 


0 


0 


0 


0 


1/3 


0 


0 


0 


0 


1/3 


0 



Obviously, condition (K.23) is fulfilled by definition. Second, the condition (K.24) 
is satisfied as 2^1 1 2/« 2 < 00 ■ However, the sum over the moduli of the eigen- 
values is just twice the harmonic series E^Li which does not converge. The 
matrix (K.25) violates the trace-class definition (K.13), as in its eigenbasis the sum 
over the moduli of its diagonal elements is infinite. Thus the absolute convergence 
is traded for a conditional convergence, since the sum over the eigenvalues them- 
selves can be arranged to still be zero, if the eigenvalues with the same modulus 
are summed first. Absolute convergence is of course essential, if sums have to be 
rearranged or exchanged. Thus, the trace-class property is indispensable for any 
controlled unitary transformation of an infinite determinant, as then there will be 
necessarily a change of basis and in general also a re-ordering of the correspond- 
ing traces. Therefore the claim that a Hilbert-Schmidt operator with a vanishing 
trace is automatically trace-class is false. In general, such an operator has to be 
regularized in addition (see next chapter). 



K.6 Regularization 

Many interesting operators are not of trace class (although they might be in some 
J~ p with p > 1 - an operator A is in J~ p iff Tr|A| p < oo in any orthonormal basis). 
In order to compute determinants of such operators, an extension of the cumulant 
expansion is needed which in fact corresponds to a regularization procedure [: , 



as the regulated version of the operator zA. Then the regulated operator fy(zA) is 
trace class, i.e., R n (zA) e J~\. Define now det„(l + zA) - det(l + R n (zA)). Then 
the regulated determinant 



10]: 

E.g. let A e J p with p <n. Define 




deal + zA) = ] (l + zAj(A)) exp £ 




{-zAj(A)) 



< oo. 



(K.26) 




k 
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exists and is finite. The corresponding Plemelj-Smithies formula now reads [10]: 



det„(l + zA) = J\ 



m=0 



ml 



(K.27) 



with a„ (A) given by the m x m determinant: 



<4" } (A) 



(») 1 A 

cr. m - 1 0 

cr' cr . m - 2 

(n) (n) (n) 

cry a\> a\ 



(T 



(n) 



(T 



in) 
m-1 



CT 



in) 
m—2 



0 
0 
0 



cr 



1 

in) 



(K.28) 



where 



in) _ J Tr(A fe ) 
0 



k>n 
k < n - 1 



As Simon [ 0] says simply, the beauty of (K.28) is that we get det„(l + A) from 
the standard Plemelj-Smithies formula (K.19) by simply setting Tr(A), Tr(A 2 ), 
. . ., Tr(A" _1 ) to zero. 

See also ref. [15] where {Aj} are the eigenvalues of an elliptic (pseudo)-differential 
operator H of order m on a compact or bounded manifold of dimension d, 0 < A) < 
A\ < ■ ■ ■ and A^ f +oo. and the Fredholm determinant 



is regulated in the case fi = d/m > 1 as Weierstrass product 



A{A) = I 



k=0 



1 ~ T ' exp 



A_ JM_ 

A k + 2A\" " ' ' + W f t 



(K.29) 



where denotes the integer part of fj.. This is, see ref. [15], the unique entire 
function of order // having zeros at {\} and subject to the normalization conditions 



lnA(O) = — lnA(O) = 
dA 



dAM 



lnA(O) - 0 . 



Clearly (K.29) is the same as (K.26); one just has to identify z = -A, A = 1/H 
and n - 1 = [p.]. An example is the regularization of the spectral determinant 



A(E) = det [(£ - H)] 



(K.30) 
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which - as it stands - would only make sense for a finite dimensional basis (or 
finite dimensional matrices). In ref. [ ] the regulated spectral determinant for the 
example of the hyperbola billiard in two dimensions (thus d = 2, m = 2 and hence 
fi — 1) is given as 

A(£) = det [(E - H)Q(£, H)] 

where 

Q(E,H) = -U~ l e EH [ 

such that the spectral determinant in the eigenbasis of H (with eigenvalues E + 0) 
reads 

A(£) = ]^|l- J-j e E/£ " <oo. 
Note that H 1 is for this example of Hilbert-Schmidt character. 



[ 



Exercises 

K. 1 . Norm of exponential of an operator. Verify inequal- 
ity (K. 12): 

|| e ' A || < e' m . 
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Thermodynamic formalism 



Being Hungarian is not sufficient. You also must be tal- 
ented. 

— Zsa Zsa Gabor 

(G. Vattay) 

In the preceding chapters we characterized chaotic systems via global quanti- 
ties such as averages. It turned out that these are closely related to very fine 
details of the dynamics like stabilities and time periods of individual periodic 
orbits. In statistical mechanics a similar duality exists. Macroscopic systems are 
characterized with thermodynamic quantities (pressure, temperature and chemical 
potential) which are averages over fine details of the system called microstates. 
One of the greatest achievements of the theory of dynamical systems was when 
in the sixties and seventies Bowen, Ruelle and Sinai made the analogy between 
these two subjects explicit. Later this "Thermodynamic Formalism" of dynam- 
ical systems became widely used making it possible to calculate various fractal 
dimensions. We sketch the main ideas of this theory and show how periodic orbit 
theory helps to carry out calculations. 



L.l Renyi entropies 

As we have already seen trajectories in a dynamical system can be characterized 
by their symbolic sequences from a generating Markov partition. We can locate 
the set of starting points M Sl s 2 ...s„ of trajectories whose symbol sequence starts 
with a given set of n symbols s\S2—s n . We can associate many different quantities 
to these sets. There are geometric measures such as the volume V(siS2—S n ), the 
area A(siS2—s„) or the length l(siS2...s n ) of this set. Or in general we can have 
some measure fJL(M slS2 .„ Sn ) = fi(s\S2...s n ) of this set. As we have seen in (22.10) 
the most important is the natural measure, which is the probability that an ergodic 
trajectory visits the setfi(siS2-.s n ) = P(siS2-..s n ). The natural measure is additive. 
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Summed up for all possible symbol sequences of length n it gives the measure of 
the whole state space: 

^ H{S\S2...S n ) = 1 (L.l) 

SlS 2 ...S„ 

expresses probability conservation. Also, summing up for the last symbol we get 
the measure of a one step shorter sequence 

^fl(SlS2...S n ) - /J,(s 1 S2...S n -{). 

As we increase the length (n) of the sequence the measure associated with it de- 
creases typically with an exponential rate. It is then useful to introduce the expo- 
nents 

A(sis 2 ...s n ) = --logn(sis 2 ...s n ). (L.2) 
n 

To get full information on the distribution of the natural measure in the symbolic 
space we can study the distribution of exponents. Let the number of symbol se- 
quences of length n with exponents between A and A + dA be given by Nn{A)dA. 
For large n the number of such sequences increases exponentially. The rate of this 
exponential growth can be characterized by g(A) such that 

N n (A) ~ exp(ngOl)) . 

The knowledge of the distribution N n (A) or its essential part g(A) fully character- 
izes the microscopic structure of our dynamical system. 

As a natural next step we would like to calculate this distribution. However it 
is very time consuming to calculate the distribution directly by making statistics 
for millions of symbolic sequences. Instead, we introduce auxiliary quantities 
which are easier to calculate and to handle. These are called partition sums 

Z n (fi)= AsiS2-s«), (L.3) 

as they are obviously motivated by Gibbs type partition sums of statistical me- 
chanics. The parameter /? plays the role of inverse temperature and E(s\ S2...s n ) 
-\ogfi(sls2—s n ) is the energy associated with the microstate labeled by S[S2---s n 
We are tempted also to introduce something analogous with the Free energy. In 
dynamical systems this is called the Renyi entropy [ ] defined by the growth rate 
of the partition sum 



1 1 ' 
Kft = hm -- — -lo 

n->oo n 1 — p 



2^ f/(s 1 s 2 ...s n ) 



\sis 2 ...s„ 



(L.4) 



thermodyn - 2 1 sep2() ] 2 
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In the special case /? — > 1 we get Kolmogorov entropy 

K x = lim - V -n(sis 2 ...s n )logn(sis 2 ...s n ), 



n 

S\Sl...S n 



while for yS = 0 we recover the topological entropy 
h top = K 0 = lim - log N(ri), 

n—>co n 

where N(n) is the number of existing length n sequences. To connect the partition 
sums with the distribution of the exponents, we can write them as averages over 
the exponents 



Z n (J3)= dAN„(A)exp(-nA/3), 



where we used the definition (L.2). For large n we can replace N„(A) with its 
asymptotic form 



Z n (P) ~ J" dAexp(ng(A))exp(-nAfi). 

For large n this integral is dominated by contributions from those A* which maxi- 
mize the exponent 

g(A) - A/3. 

The exponent is maximal when the derivative of the exponent vanishes 

g\A*)=p. (L.5) 

From this equation we can determine A*(fi). Finally the partition sum is 
Z n {fi) ~ exv(n[g(A*(J3)) - A*(J3)j3]). 

Using the definition (L.4) we can now connect the Renyi entropies and g(A) 

(J3-l)Kp = A*(]3)/3-g(A*(J3)y (L-6) 

Equations (L.5) and (L.6) define the Legendre transform of g(A). This equation is 
analogous with the thermodynamic equation connecting the entropy and the free 
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energy. As we know from thermodynamics we can invert the Legendre trans- 
form. In our case we can express g(A) from the Renyi entropies via the Legendre 
transformation 



g(A) = AIT {A) - (J3*(A) - l)Kp W , (L.7) 
where now ft* {A) can be determined from 



-^[{fi* - = A. (L.8) 



Obviously, if we can determine the Renyi entropies we can recover the distribution 
of probabilities from (L.7) and (L.8). 

The periodic orbit calculation of the Renyi entropies can be carried out by 
approximating the natural measure corresponding to a symbol sequence by the 
expression (22.10) 



H(s u s n ) * — -. (L.9) 

\^S\S 2 ...S n \ 



The partition sum (L.3) now reads 

Z "^*Z^' (L.10) 

where the summation goes for periodic orbits of length n. We can define the 
characteristic function 



fi(z,/0 = exp \-Yj-Z n (J3) . (L.ll) 



According to (L.4) for large n the partition sum behaves as 

Z n {j3) ~ e - n( P~ l)K P. (L.12) 



Substituting this into (L.ll) we can see that the leading zero of the characteristic 
function is 
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On the other hand substituting the periodic orbit approximation L.10) into (L.ll) 
and introducing prime and repeated periodic orbits as usual we get 



Q(z,/3) = exp 



We can see that the characteristic function is the same as the zeta function we 
introduced for Lyapunov exponents (H. 12) except we have zeP y instead of z. Then 
we can conclude that the Renyi entropies can be expressed with the pressure func- 
tion directly as 

P(fi) = (fi-l)Kf,+fty, (L.13) 

since the leading zero of the zeta function is the pressure. The Renyi entropies ffe, 
hence the distribution of the exponents g(A) as well, can be calculated via finding 
the leading eigenvalue of the operator (H.4). 

From (L.13) we can get all the important quantities of the thermodynamic 
formalism. For (3 = 0 we get the topological entropy 

P(0) - -K 0 = -hto P . (L.14) 

For j6 = 1 we get the escape rate 

P(l) = y. (L.15) 

Taking the derivative of (L.13) in /? = 1 we get Pesin's formula [2] connecting 
Kolmogorov entropy and the Lyapunov exponent 

P'{\) =1 = Ki+y. (L.16) 

It is important to note that, as always, these formulas are strictly valid for nice 
hyperbolic systems only. At the end of this Chapter we discuss the important 
problems we are facing in non-hyperbolic cases. 

On figure L.2 we show a typical pressure and g(A) curve computed for the two 
scale tent map of Exercise L.4. We have to mention, that all typical hyperbolic dy- 
namical system produces a similar parabola like curve. Although this is somewhat 
boring we can interpret it like a sign of a high level of universality: The exponents 
A have a sharp distribution around the most probable value. The most probable 
value is A = P'(0) and g(A) = h top is the topological entropy. The average value 
in closed systems is where g(A) touches the diagonal: A = g(A) and 1 = g'(A). 

Next, we are looking at the distribution of trajectories in real space. 



exercise L.1 
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Figure L.l 



Figure L.2: g(A) and P(J3) for the map of exercise L.4 
at a = 3 and b = 3/2. 




L.2 Fractal dimensions 

Hentschel and Procaccia rediscovered a small part of my 
theory. Generalized dimensions are not useful at all. 

— Benoit B. Mandelbrot 

By looking at the repeller we can recognize an interesting spatial structure. 
In the 3-disk case the starting points of trajectories not leaving the system after 
the first bounce form two strips. Then these strips are subdivided into an infinite 
hierarchy of substrings as we follow trajectories which do not leave the system 
after more and more bounces. The finer strips are similar to strips on a larger 
scale. Objects with such self similar properties are called fractals. 

We can characterize fractals via their local scaling properties. The first step is 
to draw a uniform grid on the surface of section. We can look at various measures 
in the square boxes of the grid. The most interesting measure is again the natural 
measure located in the box. By decreasing the size of the grid e the measure in 
a given box will decrease. If the distribution of the measure is smooth then we 
expect that the measure of the ith box is proportional with the dimension of the 
section 

Hi ~ e d . 

If the measure is distributed on a hairy object like the repeller we can observe 
unusual scaling behavior of type 



where a\ is the local "dimension" or Holder exponent of the object. As a is not 
necessarily an integer here we are dealing with objects with fractional dimensions. 
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We can study the distribution of the measure on the surface of section by looking 
at the distribution of these local exponents. We can define 

log/i; 

at = , 

loge 

the local Holder exponent and then we can count how many of them are between 
a and a + da. This is N e {a)da. Again, in smooth objects this function scales 
simply with the dimension of the system 

N e (a) ~ e~ d , 



while for hairy objects we expect an a dependent scaling exponent 

N £ (a) ~ e~ fia \ 



f(a) can be interpreted [7] as the dimension of the points on the surface of section 
with scaling exponent a. We can calculate f(a) with the help of partition sums as 
we did for g(A) in the previous section. First, we define 

Z 6 ( 9 ) = ^f. (L.17) 



Then we would like to determine the asymptotic behavior of the partition sum 
characterized by the r(q) exponent 



Z € (q) ~ e 



The partition sum can be written in terms of the distribution function of as 

Z e (q) = J daN e (a)e qa . 
Using the asymptotic form of the distribution we get 

Z £ (q) ~ J da^ a ' f{a) . 

As e goes to zero the integral is dominated by the term maximizing the exponent. 
This a* can be determined from the equation 



-jL(qa* -f(a*)) = 0, 
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leading to 

q = f\a). 

Finally we can read off the scaling exponent of the partition sum 

r(q) = aq - /(or*). 

In a uniform fractal characterized by a single dimension both a and f(a) col- 
lapse to a = f(a) = D. The scaling exponent then has the form r(q) = (q - \)D. 
In case of non uniform fractals we can introduce generalized dimensions [)] D q 
via the definition 

D q = T(q)/(q - 1). 

Some of these dimensions have special names. For q = 0 the partition sum (L.17) 
counts the number of non empty boxes N^. Consequently 

D 0 = -hm- , 

e^o loge 

is called the box counting dimension. For q = 1 the dimension can be determined 
as the limit of the formulas for q —> 1 leading to 



Di = lim V m log nil log e. 

e— »0 z — ' 



This is the scaling exponent of the Shannon information entropy [ 11] of the dis- 
tribution, hence its name is information dimension. 

Using equisize grids is impractical in most of the applications. Instead, we 
can rewrite (L. 17) into the more convenient form 



V — 1. (L.18) 



If we cover the ith branch of the fractal with a grid of size \ instead of e we can 
use the relation [5] 



m - (L ' 19) 
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the non-uniform grid generalization ofL. 18. Next we show how can we use the 
periodic orbit formalism to calculate fractal dimensions. We have already seen 
that the width of the strips of the repeller can be approximated with the stabilities 
of the periodic orbits placed within them 

u ~ ix? 

Then using this relation and the periodic orbit expression of the natural measure 
we can write (L.19) into the form 

> — ~ 1, (L.20) 

where the summation goes for periodic orbits of length n. The sum for stabilities 
can be expressed with the pressure function again 



\ K:\q~r! ' 



nP(q-T{q)) 

|Ai|4^fo> 



and (L.20) can be written as 

e iyn e -nP(q-T(q)) ^ y 

for large n. Finally we get an implicit formula for the dimensions 

P(q -{q- l)D q ) = qy. (L.21) 

Solving this equation directly gives us the partial dimensions of the multifractal 
repeller along the stable direction. We can see again that the pressure function 
alone contains all the relevant information. Setting q = 0 in (L.21) we can prove 
that the zero of the pressure function is the box-counting dimension of the repeller 

P(A>) - 0. 

Taking the derivative of (L.21) in q = 1 we get 
P'(l)(l-Oi) = 7- 

This way we can express the information dimension with the escape rate and the 
Lyapunov exponent 

Di = 1 - y/X (L.22) 

If the system is bound (y = 0) the information dimension and all other dimensions 
are D q = 1. Also since D\Q is positive (L.22) proves that the Lyapunov exponent 
must be larger than the escape rate A > y in general. 

exercise L.4 
exercise L.5 
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Resume 

In this chapter we have shown that thermodynamic quantities and various frac- 
tal dimensions can be expressed in terms of the pressure function. The pressure 
function is the leading eigenvalue of the operator which generates the Lyapunov 
exponent. In the Lyapunov case is just an auxiliary variable. In thermodynamics 
it plays an essential role. The good news of the chapter is that the distribution of 
locally fluctuating exponents should not be computed via making statistics. We 
can use cyclist formulas for determining the pressure. Then the pressure can be 
found using short cycles + curvatures. Here the head reaches the tail of the snake. 
We just argued that the statistics of long trajectories coded in g(A) and P(J3) can be 
calculated from short cycles. To use this intimate relation between long and short 
trajectories effectively is still a research level problem. 



Commentary 

Remark L.l Mild phase transition. In non-hyperbolic systems the formulas derived 
in this chapter should be modified. As we mentioned in remark 22.1 in non-hyperbolic 
systems the periodic orbit expression of the measure can be 

/io = e r "/|A 0 |', 

where 6 can differ from 1. Usually it is 1 /2. For sufficiently negative ft the corresponding 
term l/|Ao|^ can dominate (L.10) while in (L.3) e 7n /\Ao\® plays no dominant role. In this 
case the pressure as a function of /? can have a kink at the critical point /3 = /3 C where 
fi c log |Ao| = (fi c — l)Kp c + p c J- For P < Pc the pressure and the Renyi entropies differ 

P(fi)*(fl-l)Kp+/3y. 

This phenomena is called phase transition. This is however not a very deep problem. We 
can fix the relation between pressure and the entropies by replacing 1/|A o| with 1 /| Kq\ s in 
(L.10). 

Remark L.2 Hard phase transition. The really deep trouble of thermodynamics is 
caused by intermittency. In that case we have periodic orbits with |Ao| — » 1 as n — > oo. 
Then for/? > 1 the contribution of these orbits dominate both (L.10) and (L.3). Conse- 
quently the partition sum scales as Z„(J3) — > 1 and both the pressure and the entropies are 
zero. In this case quantities connected with P < 1 make sense only. These are for example 
the topological entropy, Kolmogorov entropy, Lyapunov exponent, escape rate, D o and 
D\. This phase transition cannot be fixed. It is probably fair to say that quantities which 
depend on this phase nansition are only of mathematical interest and not very useful for 
characterization of realistic dynamical systems. 
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Exercises 



L.l. 



L.2. 



L.3. 



Thermodynamics in higher dimensions. Define 
Lyapunov exponents as the time averages of the eigen- 
exponents of the Jacobian matrix / 



H<*> = lim-log|A^o)l, 



t— >°o t 

as a generalization of (6.1 1). 

Show that in d dimensions Pesin's formula is 

d 



(L.23) 



K 



■ y, 



(L.24) 



where the summation goes for the positive fi^-s only. 
Hint: Use the ^-dimensional generalization of (L.9) 



7lf[ A M 



where the product goes for the expanding eigenvalues of 
the Jacobian matrix of /9-cycle. (G. Vattay) 

Stadium billiard Kolmogorov entropy. (con- 
tinuation of exercise 8.4) Take a — 1.6 and d — 1 
in the stadium billiard figure 8.1, and estimate the Lya- 
punov exponent by averaging over a very long trajec- 
tory. Biham and Kvale [ 1 4] estimate the discrete time 
Lyapunov to A as 1 .0 ± . 1, the continuous time Lyapunov 
to A as 0.43 + .02, the topological entropy (for their sym- 
bolic dynamics) h as 1.15 ± .03. 

Entropy of rugged-edge billiards. Take a semi-circle 
of diameter e and replace the sides of a unit square by 
Ll/eJ semi-circle arcs. 




L.5. 



(a) Is the billiard ergodic as s — > 0? 

(b) (hard) Show that the entropy of the billiard map is 



— In e + const , 
n 



as s — » 0. (Hint: do not write return maps.) 

(c) (harder) Show that when the semi-circles of the 
stadium billiard are far apart, say L, the entropy 
for the flow decays as 



21nL 

7TL 



L.4. Two scale map. Compute all those quantities - dimen- 
sions, escape rate, entropies, etc. - for the repeller of the 
one dimensional map 



fix) 



1 + ax if 
1 - bx if 



x < 0, 
x > 0. 



(L.25) 



where a and b are larger than 2. Compute the fractal di- 
mension, plot the pressure and compute the f(a) spec- 
trum of singularities. 

Transfer matrix. Take the unimodal map f(x) = 
sin(7rx) of the interval / = [0,1]. Calculate the four 
preimages of the intervals Iq = [0, 1 /2] and I\ - 
[1/2, 1]. Extrapolate f(x) with piecewise linear func- 
tions on these intervals. Find a i, ci2 and b of the previous 
exercise. Calculate the pressure function of this linear 
extrapolation. Work out higher level approximations by 
linearly extrapolating the map on the 2"-th preimages of 
/. 
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Appendix M 

Statistical mechanics recycled 



spin system with long-range interactions can be converted into a chaotic 



dynamical system that is differentiable and low-dimensional. The ther- 



L -™- modynamic limit quantities of the spin system are then equivalent to long 
time averages of the dynamical system. In this way the spin system averages can 
be recast as the cycle expansions. If the resulting dynamical system is analytic, the 
convergence to the thermodynamic limit is faster than with the standard transfer 
matrix techniques. 



M.l The thermodynamic limit 

There are two motivations to recycle statistical mechanics: one gets better control 
over the thermodynamic limit and one gets detailed information on how one is 
converging to it. From this information, most other quantities of physical interst 
can be computed. 

In statistical mechanics one computes the averages of observables. These are 
functions that return a number for every state of the system; they are an abstraction 
of the process of measuring the pressure or temperature of a gas. The average of 
an observable is computed in the thermodynamic limit — the limit of system with 
an arbitrarily large number of particles. The thermodynamic limit is an essential 
step in the computation of averages, as it is only then that one observes the bulk 
properties of matter. 

Without the thermodynamic limit many of the thermodynamic properties of 
matter could not be derived within the framework of statistical mechanics. There 
would be no extensive quantities, no equivalence of ensembles, and no phase tran- 
sitions. From experiments it is known that certain quantities are extensive, that is, 



(R. Mainieri) 
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they are proportional to the size of the system. This is not true for an interact- 
ing set of particles. If two systems interacting via pairwise potentials are brought 
close together, work will be required to join them, and the final total energy will 
not be the sum of the energies of each of the parts. To avoid the conflict between 
the experiments and the theory of Hamiltonian systems, one needs systems with 
an infinite number of particles. In the canonical ensemble the probability of a 
state is given by the Boltzman factor which does not impose the conservation of 
energy; in the microcanonical ensemble energy is conserved but the Boltzmann 
factor is no longer exact. The equality between the ensembles only appears in the 
limit of the number of particles going to infinity at constant density. The phase 
transitions are inteipreted as points of non-analyticity of the free energy in the 
thermodynamic limit. For a finite system the partition function cannot have a zero 
as a function of the inverse temperature /3, as it is a finite sum of positive terms. 

The thermodynamic limit is also of central importance in the study of field 
theories. A field theory can be first defined on a lattice and then the lattice spac- 
ing is taken to zero as the correlation length is kept fixed. This continuum limit 
corresponds to the thermodynamic limit. In lattice spacing units the correlation 
length is going to infinity, and the interacting field theory can be thought of as a 
statistical mechanics model at a phase transition. 

For general systems the convergence towards the thermodynamic limit is slow. 
If the thermodynamic limit exists for an interaction, the convergence of the free 
energy per unit volume / is as an inverse power in the linear dimension of the 
system. 



where n is proportional to V 1 , with V the volume of the <i-dimensional system. 
Much better results can be obtained if the system can be described by a transfer 
matrix. A transfer matrix is concocted so that the trace of its nth power is exactly 
the partition function of the system with one of the dimensions proportional to 
n. When the system is described by a transfer matrix then the convergence is 
exponential, 



and may only be faster than that if all long-range correlations of the system are 
zero — that is, when there are no interactions. The coefficient a depends only on 
the inverse correlation length of the system. 

One of the difficulties in using the transfer matrix techniques is that they seem 
at first limited to systems with finite range interactions. Phase transitions can 
happen only when the interaction is long range. One can try to approximate the 
long range interaction with a series of finite range interactions that have an ever 
increasing range. The problem with this approach is that in a formally defined 




(M.l) 



-an 



(M.2) 



statmech - ldec2001 



ChaosBook.org versionl4.4.1, Apr 21 2013 



APPENDIX M. STATISTICAL MECHANICS RECYCLED 



928 



transfer matrix, not all the eigenvalues of the matrix correspond to eigenvalues of 
the system (in the sense that the rate of decay of correlations is not the ratio of 
eigenvalues). 

Knowledge of the correlations used in conjunction with finite size scaling to 
obtain accurate estimates of the parameters of systems with phase transitions. (Ac- 
curate critical exponents are obtained by series expansions or transfer matrices, 
and infrequently by renormalization group arguments or Monte Carlo.) In a phase 
transition the coefficient a of the exponential convergence goes to zero and the 
convergence to the thermodynamic limit is power-law. 

The computation of the partition function is an example of a functional inte- 
gral. For most interactions these integrals are ill-defined and require some form 
of normalization. In the spin models case the functional integral is very simple, 
as "space" has only two points and only "time" being infinite has to be dealt with. 
The same problem occurs in the computation of the trace of transfer matrices of 
systems with infinite range interactions. If one tries to compute the partition func- 
tion Z„ 

Z n = tr T" 

when T is an infinite matrix, the result may be infinite for any n. This is not to 
say that Z„ is infinite, but that the relation between the trace of an operator and the 
partition function breaks down. We could try regularizing the expression, but as 
we shall see below, that is not necessary, as there is a better physical solution to 
this problem. 

What will described here solves both of these problems in a limited context: 
it regularizes the transfer operator in a physically meaningful way, and as a con- 
sequence, it allows for the faster than exponential convergence to the thermody- 
namic limit and complete determination of the spectrum. The steps to achieve this 
are: 

• Redefine the transfer operator so that there are no limits involved except for 
the thermodynamic limit. 

• Note that the divergences of this operator come from the fact that it acts on 
a very large space. All that is needed is the smallest subspace containing 
the eigenvector corresponding to the largest eigenvalue (the Gibbs state). 

• Rewrite all observables as depending on a local effective field. The eigen- 
vector is like that, and the operator restricted to this space is trace-class. 

• Compute the spectrum of the transfer operator and observe the magic. 
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M.2 Ising models 

The Ising model is a simple model to study the cooperative effects of many small 
interacting magnetic dipoles. The dipoles are placed on a lattice and their interac- 
tion is greatly simplified. There can also be a field that includes the effects of an 
external magnetic field and the average effect of the dipoles among themselves. 
We will define a general class of Ising models (also called spin systems) where the 
dipoles can be in one of many possible states and the interactions extend beyond 
the nearest neighboring sites of the lattice. But before we extend the Ising model, 
we will examine the simplest model in that class. 

M.2.1 Ising model 

One of the simplest models in statistical mechanics is the Ising model. One imag- 
ines that one has a 1 -dimensional lattice with small magnets at each site that can 
point either up or down. 

ooooooooo. 

Each little magnet interacts only with its neighbors. If they both point in the same 
direction, then they contribute an energy -/ to the total energy of the system; and 
if they point in opposite directions, then they contribute +J. The signs are chsen 
so that they prefer to be aligned. Let us suppose that we have n small magnets 
arranged in a line: A line is drawn between two sites to indicate that there is an 
interaction between the small magnets that are located on that site 

O O O O O O O O O . (M.3) 

(This figure can be thought of as a graph, with sites being vertices and interacting 
magnets indicated by edges.) To each of the sites we associate a variable, that we 
call a spin, that can be in either of two states: up (T) or down (J,). This represents 
the two states of the small magnet on that site, and in general we will use the 
notation Xo to represent the set of possible values of a spin at any site; all sites 
assume the same set of values. A configuration consists of assigning a value to 
the spin at each site; a typical configuration is 

O O O O O O O O O . (M.4) 

The set of all configurations for a lattice with n sites is called and is formed 
by the Cartesian product Qo X Qo • • • X Do> the product repeated n times. Each 
configuration a e Q! 1 is a string of n spins 

<r = {o-Q,(Ti,...o- n }, (M.5) 
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In the example configuration (M.4) there are two pairs of spins that have the 
same orientation and six that have the opposite orientation. Therefore the total 
energy H of the configuration is/x6-7x2 = 47. In general we can associate 
an energy H to every configuration 

H(o-) = J]j6( ( r i ,cr i+1 ), (M.6) 



where 



One of the problems that was avoided when computing the energy was what to do 
at the boundaries of the 1 -dimensional chain. Note that as written, (M.6) requires 
the interaction of spin n with spin n + 1. In the absence of phase transitions the 
boundaries do not matter much to the thermodynamic limit and we will connect 
the first site to the last, implementing periodic boundary conditions. 

Thermodynamic quantities are computed from the partition function ^ as 
the size n of the system becomes very large. For example, the free energy per site 
/ at inverse temperature is given by 



■pfifi) = lim -lnZ w . (M.8) 



The partition function Zy 1 ' is computed by a sum that runs over all the possible 
configurations on the 1 -dimensional chain. Each configuration contributes with 
its Gibbs factor exp(-/?//(cr)) and the partition function Z> n) is 



7(n) 



e-P H(cT) . (M.9) 



The partition function can be computed using transfer matrices. This is a 
method that generalizes to other models. At first, it is a little mysterious that 
matrices show up in the study of a sum. To see where they come from, we can 
try and build a configuration on the lattice site by site. The first thing to do is to 
expand out the sum for the energy of the configuration 

Z^ifi) = e P JS (< T l ,0-2)^6(0-2,0-3) _ _ _ ^J&(fT n ,a-{) ^-JyJ 1Q) 

o-efi" 

Let us use the configuration in (M.4). The first site is cr\ =f. As the second site 
is T, we know that the first term in (M.10) is a term e@ J . The third spin is J,, so the 
second term in (M.10) is e~@ J '. If the third spin had been f, then the term would 
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have been e@ J but it would not depend on the value of the first spin en . This means 
that the configuration can be built site by site and that to compute the Gibbs factor 
for the configuration just requires knowing the last spin added. We can then think 
of the configuration as being a weighted random walk where each step of the walk 
contributes according to the last spin added. The random walk take place on the 
transition graph 



Choose one of the two sites as a starting point. Walk along any allowed edge 
making your choices randomly and keep track of the accumulated weight as you 
perform the n steps. To implement the periodic boundary conditions make sure 
that you return to the starting node of the transition graph. If the walk is carried 
out in all possible 2" ways then the sum of all the weights is the partition function. 
To perform the sum we consider the matrix 



As in chapter 1 1 the sum of all closed walks is given by the trace of powers of the 
matrix. These powers can easily be re-expressed in terms of the two eigenvalues 
A\ and A2 of the transfer matrix: 



M.2.2 Averages of observables 

Averages of observables can be re-expressed in terms of the eigenvectors of the 
transfer matrix. Alternatively, one can introduce a modified transfer matrix and 
compute the averages through derivatives. Sounds familiar? 

M.2.3 General spin models 

The more general version of the Ising model — the spin models — will be defined 
on a regular lattice, ZP. At each lattice site there will be a spin variable that can 
assumes a finite number of states identified by the set Qq. 

The transfer operator T was introduced by Kramers and Wannier [1 2] to study 
the Ising model on a strip and concocted so that the trace of its «th power is the 




e' 




(M.ll) 



Z (,l) (/3) = tr T"(J3) = AxtfT + A 2 (j3) n . 



(M.12) 
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partition function Z,, of system when one of its dimensions is n. The method 
can be generalized to deal with any finite -range interaction. If the range of the 
interaction is L, then T is a matrix of size 2 L x 2 L . The longer the range, the larger 
the matrix. 



M.3 Fisher droplet model 

In a series of articles [20], Fisher introduced the droplet model. It is a model for 
a system containing two phases: gas and liquid. At high temperatures, the typical 
state of the system consists of droplets of all sizes floating in the gas phase. As the 
temperature is lowered, the droplets coalesce, forming larger droplets, until at the 
transition temperature, all droplets form one large one. This is a first order phase 
transition. 

Although Fisher formulated the model in 3-dimensions, the analytic solution 
of the model shows that it is equivalent to a 1 -dimensional lattice gas model with 
long range interactions. Here we will show how the model can be solved for an 
arbitrary interaction, as the solution only depends on the asymptotic behavior of 
the interaction. 

The interest of the model for the study of cycle expansions is its relation to 
intermittency. By having an interaction that behaves asymptotically as the scaling 
function for intermittency, one expects that the analytic structure (poles and cuts) 
will be same. 

Fisher used the droplet model to study a first order phase transition |20]. 
Gallavotti [2 1 ] used it to show that the zeta functions cannot in general be ex- 
tended to a meromorphic functions of the entire complex plane. The droplet model 
has also been used in dynamical systems to explain features of mode locking, see 
Artuso [22]. In computing the zeta function for the droplet model we will discover 
that at low temperatures the cycle expansion has a limited radius of convergence, 
but it is possible to factorize the expansion into the product of two functions, each 
of them with a better understood radius of convergence. 

M.3.1 Solution 

The droplet model is a 1 -dimensional lattice gas where each site can have two 
states: empty or occupied. We will represent the empty state by 0 and the occupied 
state by 1. The configurations of the model in this notation are then strings of 
zeros and ones. Each configuration can be viewed as groups of contiguous ones 
separated by one or more zeros. The contiguous ones represent the droplets in the 
model. The droplets do not interact with each other, but the individual particles 
within each droplet do. 

To determine the thermodynamics of the system we must assign an energy 
to every configuration. At very high temperatures we would expect a gaseous 
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phase where there are many small droplets, and as we decrease the temperature 
the droplets would be expected to coalesce into larger ones until at some point 
there is a phase transition and the configuration is dominated by one large drop. 
To construct a solvable model and yet one with a phase transition we need long 
range interaction among all the particles of a droplet. One choice is to assign a 
fixed energy 6 n for the interactions of the particles of a cluster of size n. In a 
given droplet one has to consider all the possible clusters formed by contiguous 
particles. Consider for example the configuration 0111010. It has two droplets, 
one of size three and another of size one. The droplet of size one has only one 
cluster of size one and therefore contributes to the energy of the configuration with 
0\. The cluster of size three has one cluster of size three, two clusters of size two, 
and three clusters of size one; each cluster contributing a ft term to the energy. 
The total energy of the configuration is then 

#(0111010) =40i +202 + 10 3 . (M.13) 

If there where more zeros around the droplets in the above configuration the en- 
ergy would still be the same. The interaction of one site with the others is assumed 
to be finite, even in the ground state consisting of a single droplet, so there is a 
restriction on the sum of the cluster energies given by 

a = ^0„<oo. (M.14) 

n>0 

The configuration with all zeros does not contribute to the energy. 

Once we specify the function 9 n we can computed the energy of any config- 
uration, and from that determine the thermodynamics. Here we will evaluate the 
cycle expansion for the model by first computing the generating function 



G(^),V^ (M.15) 

z_i n 

n>0 



and then considering its exponential, the cycle expansion. Each partition function 
Z n must be evaluated with periodic boundary conditions. So if we were computing 
Z3 we must consider all eight binary sequences of three bits, and when computing 
the energy of a configuration, say 011, we should determine the energy per three 
sites of the long chain 

...011011011011... 

In this case the energy would be 02 + 29\. If instead of 011 we had considered 
one of its rotated shifts, 110 or 101, the energy of the configuration would have 
been the same. To compute the partition function we only need to consider one 
of the configurations and multiply by the length of the configuration to obtain the 
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contribution of all its rotated shifts. The factor l/n in the generating function can- 
cels this multiplicative factor. This reduction will not hold if the configuration 
has a symmetry, as for example 0101 which has only two rotated shift configura- 
tions. To compensate this we replace the l/n factor by a symmetry factor 1/ s(b) 
for each configuration b. The evaluation of G is now reduced to summing over 
all configurations that are not rotated shift equivalent, and we call these the basic 
configurations and the set of all of them B. We now need to evaluate 

G W> = Y ^ e ~ mb) ■ (M.16) 

The notation | • | represents the cardinality of the set. 

Any basic configuration can be built by considering the set of droplets that 
form it. The smallest building block has size two, as we must also put a zero next 
to the one so that when two different blocks get put next to each other they do not 
coalesce. The first few building blocks are 

size droplets 

2 01 (M.17) 

3 001 Oil 

4 0001 0011 0111 

Each droplet of size n contributes with energy 

W n = ^ (n-k + l)0 k . (M.18) 

l<k<n 

So if we consider the sum 

+ z 4 (e -*ff«xx>i) + e -/?#(ooi i) + e - m ou i )} + . . .y (M l9) 

then the power in n will generate all the configurations that are made from many 
droplets, while the z will keep track of the size of the configuration. The factor 
l/n is there to avoid the over-counting, as we only want the basic configurations 
and not its rotated shifts. The l/n factor also gives the correct symmetry factor in 
the case the configuration has a symmetry. The sum can be simplified by noticing 
that it is a logarithmic series 

- In (l - ( Z V^> + z V^ 1 + e~P m -) + •••), (M.20) 

where the H(b) factors have been evaluated in terms of the droplet energies W n . 
A proof of the equality of (M.19) and (M.20) can be given, but we there was not 
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enough space on the margin to write it down. The series that is subtracted from 
one can be written as a product of two series and the logarithm written as 

- In (l - (z 1 + z 2 + z 3 + ■ ■ + zV^ 2 + •••)) (M.21) 

The product of the two series can be directly interpreted as the generating function 
for sequences of droplets. The first series adds one or more zeros to a configuration 
and the second series add a droplet. 

There is a whole class of configurations that is not included in the above sum: 
the configurations formed from a single droplet and the vacuum configuration. 
The vacuum is the easiest, as it has zero energy it only contributes a z- The sum 
of all the null configurations of all sizes is 

V - . (M.22) 
^— ' n 

n>0 

The factor 1 fn is here because the original G had them and the null configurations 
have no rotated shifts. The single droplet configurations also do not have rotated 
shifts so their sum is 



V . (M.23) 

^— ' n 

Because there are no zeros in the above configuration clusters of all size exist and 
the energy of the configuration is n£ ft which we denote by na. 

From the three sums (M.21), (M.22), and (M.23) we can evaluate the generat- 
ing function G to be 

G{z,P) = - ln(l - z) - ln(l - ze~ pa ) - ln(l - -i- Y z"e^ w ") . (M.24) 

1 -z*-* 

n>\ 

The cycle expansion ^ 1 (z,fi) is given by the exponential of the generating 
function e~ G and we obtain 

C\z,P) = (1 - ze' pa ){\ - z(l + z"^ W '0) (M.25) 

n>\ 

To pursue this model further we need to have some assumptions about the 
interaction strengths 6 n . We will assume that the interaction strength decreases 
with the inverse square of the size of the cluster, that is, Q, = -l/n 2 . With this we 
can estimate that the energy of a droplet of size n is asymptotically 

W„ ~ -n + In n + 0(-) . (M.26) 
n 
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If the power chosen for the polynomially decaying interaction had been other than 
inverse square we would still have the droplet term proportional to n, but there 
would be no logarithmic term, and the O term would be of a different power. 
The term proportional to n survives even if the interactions falls off exponentially, 
and in this case the correction is exponentially small in the asymptotic formula. 
To simplify the calculations we are going to assume that the droplet energies are 
exactly 

W„ = -n + \nn (M.27) 

in a system of units where the dimensional constants are one. To evaluate the 
cycle expansion (M.25) we need to evaluate the constant a, the sum of all the 6),. 
One can write a recursion for the 9 n 

On = W n - ^ ~ k + % (M.28) 

1 <k<n 

and with an initial choice for G\ evaluate all the others. It can be verified that in- 
dependent of the choice of 9\ the constant a is equal to the number that multiplies 
the n term in (M.27). In the units used 

a = -l. (M.29) 

For the choice of droplet energy (M.27) the sum in the cycle expansion can be 
expressed in terms of a special function: the Lerch transcendental It is defined 
by 



Mz,s,c) = Y—Z—, (M.30) 



excluding from the sum any term that has a zero denominator. The Lerch function 
converges for \z\ < 1. The series can be analytically continued to the complex 
plane and it will have a branch point at z = 1 with a cut chosen along the pos- 
itive real axis. In terms of Lerch transcendental function we can write the cycle 
expansion (M.25) using (M.27) as 

r\z,p) = (l -ze?)(l -z(l +Hize p ,P, 1))) (M.31) 

This serves as an example of a zeta function that cannot be extended to a mero- 
morphic function of the complex plane as one could conjecture. 

The thermodynamics for the droplet model comes from the smallest root of 
(M.31). The root can come from any of the two factors. For large value of /3 (low 
temperatures) the smallest root is determined from the ( 1 - z$) factor, which gave 
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the contribution of a single large drop. For small ft (large temperatures) the root is 
determined by the zero of the other factor, and it corresponds to the contribution 
from the gas phase of the droplet model. The transition occurs when the smallest 
root of each of the factors become numerically equal. This determines the critical 
temperature fi c through the equation 

1 _ e-defi + & 0 c)) = o (M.32) 

which can be solved numerically. One finds that fi c = 1.40495. The phase tran- 
sition occurs because the roots from two different factors get swapped in their 
roles as the smallest root. This in general leads to a first order phase transition. 
For large f3 the Lerch transcendental is being evaluated at the branch point, and 
therefore the cycle expansion cannot be an analytic function at low temperatures. 
For large temperatures the smallest root is within the radius of convergence of 
the series for the Lerch transcendental, and the cycle expansion has a domain of 
analyticity containing the smallest root. 

As we approach the phase transition point as a function of ft the smallest root 
and the branch point get closer together until at exactly the phase transition they 
collide. This is a sufficient condition for the existence of a first order phase transi- 
tions. In the literature of zeta functions [ 9] there have been speculations on how 
to characterize a phase transition within the formalism. The solution of the Fisher 
droplet model suggests that for first order phase transitions the factorized cycle 
expansion will have its smallest root within the radius of convergence of one of 
the series except at the phase transition when the root collides with a singularity. 
This does not seem to be the case for second order phase transitions. 

The analyticity of the cycle expansion can be restored if we consider separate 
cycle expansions for each of the phases of the system. If we separate the two terms 
of in (M.31), each of them is an analytic function and contains the smallest 
root within the radius of convergence of the series for the relevant /3 values. 



M.4 Scaling functions 

There is a relation between general spin models and dynamical system. If one 
thinks of the boxes of the Markov partition of a hyperbolic system as the states 
of a spin system, then computing averages in the dynamical system is carrying 
out a sum over all possible states. One can even construct the natural measure of 
the dynamical system from a translational invariant "interaction function" call the 
scaling function. 

There are many routes that lead to an explanation of what a scaling function 
is and how to compute it. The shortest is by breaking away from the histori- 
cal development and considering first the presentation function of a fractal. The 
presentation function is a simple chaotic dynamical system (hyperbolic, unlike 
the circle map) that generates the fractal and is closely related to the definition 
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Figure M.l: Construction of the steps of the scal- 
ing function from a Cantor set. From one level to the 
next in the construction of the Cantor set the covers 
are shrunk, each parent segment into two children seg- 

0.4 

ments. The shrinkage of the last level of the construe- § 
tion is plotted and by removing the gaps one has an % 03 
approximation to the scaling function of the Cantor set. 0 2 



of fractals of Hutchinson [24] and the iterated dynamical systems introduced by 
Barnsley and collaborators [ ]. From the presentation function one can derive 
the scaling function, but we will not do it in the most elegant fashion, rather we 
will develop the formalism in a form that is directly applicable to the experimental 
data. 

In the upper part of figure M. 1 we have the successive steps of the construction 
similar to the middle third Cantor set. The construction is done in levels, each 
level being formed by a collection of segments. From one level to the next, each 
"parent" segment produces smaller "children" segments by removing the middle 
section. As the construction proceeds, the segments better approximate the Cantor 
set. In the figure not all the segments are the same size, some are larger and some 
are smaller, as is the case with multifractals. In the middle third Cantor set, the 
ratio between a segment and the one it was generated from is exactly 1/3, but in 
the case shown in the figure the ratios differ from 1/3. If we went through the last 
level of the construction and made a plot of the segment number and its ratio to 
its parent segment we would have a scaling function, as indicated in the figure. 
A function giving the ratios in the construction of a fractal is the basic idea for a 
scaling function. Much of the formalism that we will introduce is to be able to 
give precise names to every segments and to arrange the "lineage" of segments 
so that the children segments have the correct parent. If we do not take these 
precautions, the scaling function would be a "wild function," varying rapidly and 
not approximated easily by simple functions. 

To describe the formalism we will use a variation on the quadratic map that 
appears in the theory of period doubling. This is because the combinatorial ma- 
nipulations are much simpler for this map than they are for the circle map. The 
scaling function will be described for a one dimensional map F as shown in fig- 
ure M.2. Drawn is the map 

F{x) = 5x(l - x) (M.33) 

restricted to the unit interval. We will see that this map is also a presentation 
function. 

It has two branches separated by a gap: one over the left portion of the unit 
interval and one over the right. If we choose a point x at random in the unit 
interval and iterate it under the action of the map F, (M.33), it will hop between the 
branches and eventually get mapped to minus infinity. An orbit point is guaranteed 
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Figure M.2: A Cantor set presentation function. The 
Cantor set is the set of all points that under iteration do 
not leave the interval [0, 1]. This set can be found by 
backwards iterating the gap between the two branches 
of the map. The dotted lines can be used to find these 
backward images. At each step of the construction one 
is left with a set of segments that form a cover of the 
Cantor set. 



to go to minus infinity if it lands in the gap. The hopping of the point defines the 
orbit of the initial point x: x t-> x\ i-» x% i-» • • •. For each orbit of the map F we 
can associate a symbolic code. The code for this map is formed from 0s and Is 
and is found from the orbit by associating a 0 if Xt < 1/2 and a 1 if x t > 1 /2, with 
t = 0,1,2,.... 

Most initial points will end up in the gap region between the two branches. 
We then say that the orbit point has escaped the unit interval. The points that do 
not escape form a Cantor set C (or Cantor dust) and remain trapped in the unit 
interval for all iterations. In the process of describing all the points that do not 
escape, the map F can be used as a presentation of the Cantor set C, and has been 
called a presentation function by Feigenbaum [17]. 

How does the map F "present" the Cantor set? The presentation is done in 
steps. First, we determine the points that do not escape the unit interval in one 
iteration of the map. These are the points that are not part of the gap. These points 
determine two segments, which are an approximation to the Cantor set. In the 
next step we determine the points that do not escape in two iterations. These are 
the points that get mapped into the gap in one iteration, as in the next iteration 
they will escape; these points form the two segments & and at level 1 in 
figure M.2. The processes can be continued for any number of iterations. If we 
observe carefully what is being done, we discover that at each step the pre-images 
of the gap (backward iterates) are being removed from the unit interval. As the 
map has two branches, every point in the gap has two pre-images, and therefore 
the whole gap has two pre-images in the form of two smaller gaps. To generate all 
the gaps in the Cantor set one just has to iterate the gap backwards. Each iteration 
of the gap defines a set of segments, with the nth iterate defining the segments 

at level n. For this map there will be 2" segments at level n, with the first few 
drawn in figure M.2. As n — > oo the segments that remain for at least n iterates 
converge to the Cantor set C. 

The segments at one level form a cover for the Cantor set and it is from a cover 
that all the invariant information about the set is extracted (the cover generated 
from the backward iterates of the gap form a Markov partition for the map as a 

(n) 

dynamical system). The segments {A^ } \ at level n are a refinement of the cover 
formed by segments at level n - 1. From successive covers we can compute the 
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trajectory scaling function, the spectrum of scalings f(a), and the generalized 
dimensions. 

To define the scaling function we must give labels (names) to the segments. 
The labels are chosen so that the definition of the scaling function allows for sim- 
ple approximations. As each segment is generated from an inverse image of the 
unit interval, we will consider the inverse of the presentation function F. Be- 
cause F does not have a unique inverse, we have to consider restrictions of F. Its 
restriction to the first half of the segment, from 0 to 1/2, has a unique inverse, 
which we will call Fq 1 , and its restriction to the second half, from 1/2 to 1, also 
has a unique inverse, which we will call Fj~ . For example, the segment labeled 
A (2) (0, 1) in fi gure M.2 is formed from the inverse image of the unit interval by 
mapping A^ 0) , the unit interval, with Fj" 1 and then F Q , so that the segment 

A (2) (0, 1) - F Q 1 (F- 1 (A (0) )) . (M.34) 

The mapping of the unit interval into a smaller interval is what determines its 
label. The sequence of the labels of the inverse maps is the label of the segment: 



A^(eue 2 ,...,e n ) = F^oF^o...F^(A^) 



The scaling function is formed from a set of ratios of segments length. We use 
around a segment A^' !) (£) to denote its size (length), and define 



w \A^{e l ,e 2 ,...,e n )\ 



We can then arrange the ratios o^ n \e\, ei,...,e n ) next to each other as piecewise 
constant segments in increasing order of their binary label q, ei, . . . , e n so that the 
collection of steps scan the unit interval. As n — > oo this collection of steps will 
converge to the scaling function. 



M.5 Geometrization 



The X operator is a generalization of the transfer matrix. It gets more by consid- 
ering less of the matrix: instead of considering the whole matrix it is possible to 
consider just one of the rows of the matrix. The £, operator also makes explicit 
the vector space in which it acts: that of the observable functions. Observables are 
functions that to each configuration of the system associate a number: the energy, 
the average magnetization, the correlation between two sites. It is in the average 
of observables that one is interested in. Like the transfer matrix, the £, operator 
considers only semi-infinite systems, that is, only the part of the interaction be- 
tween spins to the right is taken into account. This may sound un-symmetric, but 
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it is a simple way to count each interaction only once, even in cases where the 
interaction includes three or more spin couplings. To define the £ operator one 
needs the interaction energy between one spin and all the rest to its right, which is 
given by the function <p. The £ operators defined as 



g(o- 0 cr)e 



of") 
o-oefio 



To each possible value in Dq that the spin <tq can assume, an average of the observ- 
able g is computed weighed by the Boltzmann factor e^>. The formal relations 
that stem from this definition are its relation to the free energy when applied to the 
observable i that returns one for any configuration: 



-J3f(0)= lim -\n\\£ n i\ 

n— >oo fi 



and the thermodynamic average of an observable 

\\r 8 \\ 



(g) = lim 



\\£"i\\ 



Both relations hold for almost all configurations. These relations are part of the- 
orem of Ruelle that enlarges the domain of the Perron-Frobenius theorem and 
sharpens its results. The theorem shows that just as the transfer matrix, the largest 
eigenvalue of the £ operator is related to the free-energy of the spin system. It also 
hows that there is a formula for the eigenvector related to the largest eigenvalue. 
This eigenvector |p) (or the corresponding one for the adjoint £ of X) is the Gibbs 
state of the system. From it all averages of interest in statistical mechanics can be 
computed from the formula 



<g> = <plglp>. 

The Gibbs state can be expressed in an explicit form in terms of the inter- 
actions, but it is of little computational value as it involves the Gibbs state for a 
related spin system. Even then it does have an enormous theoretical value. Later 
we will see how the formula can be used to manipulate the space of observables 
into a more convenient space. 

The geometrization of a spin system converts the shift dynamics (necessary 
to define the Ruelle operator) into a smooth dynamics. This is equivalent to the 
mathematical problem in ergodic theory of finding a smooth embedding for a 
given Bernoulli map. 

The basic idea for the dynamics is to establish the a set of maps F ak such that 



^(0) = 0 
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and 



F<ri ° F<T 2 ° ■ ■ ■ ° F<r n (Q) = <p{+, 0-1,0-2,..., o- n , -,-,...)■ 

This is a formal relation that expresses how the interaction is to be converted into 
a dynamical systems. In most examples F (Tk is a collection of maps from a subset 
of R D to itself. 

If the interaction is complicated, then the dimension of the set of maps may 
be infinite. If the resulting dynamical system is infinite have we gained anything 
from the transformation? The gain in this case is not in terms of added speed of 
convergence to the thermodynamic limit, but in the fact that the Ruelle operator 
is of trace-class and all eigenvalues are related to the spin system and not artifacts 
of the computation. 

The construction of the higher dimensional system is done by borrowing the 
state space reconstruction technique from dynamical systems. State space recon- 
struction can be done in several ways: by using delay coordinates, by using deriva- 
tives of the position, or by considering the value of several independent observ- 
ables of the system. All these may be used in the construction of the equivalent 
dynamics. Just as in the study of dynamical systems, the exact method does not 
matter for the determination of the thermodynamics (f(a) spectra, generalized di- 
mension), also in the construction of the equivalent dynamics the exact choice of 
observable does not matter. 

We will only consider configurations for the half line. This is because for 
translational invariant interactions the thermodynamic limit on half line is the 
same as in the whole line. One can prove this by considering the difference in 
a thermodynamic average in the line and in the semiline and compare the two as 
the size of the system goes to infinity. 

When the interactions are long range in principle one has to specify the bound- 
ary conditions to be able to compute the interaction energy of a configuration in a 
finite box. If there are no phase transitions for the interaction, then which bound- 
ary conditions are chosen is irrelevant in the thermodynamic limit. When com- 
puting quantities with the transfer matrix, the long range interaction is truncated 
at some finite range and the truncated interaction is then use to evaluate the trans- 
fer matrix. With the Ruelle operator the interaction is never truncated, and the 
boundary must be specified. 

The interaction </>(cr) is any function that returns a number on a configuration. 
In general it is formed from pairwise spin interactions 



with different choices of J{n) leading to different models. If J{n) - 1 only if n - 1 
and ) otherwise, then one has the nearest neighbor Ising model. If J(n) = n 2 , then 
one has the inverse square model relevant in the study of the Kondo problem. 




n>0 
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Let us say that each site of the lattice can assume two values +, - and the set 
of all possible configurations of the semiline is the set Q. Then an observable g 
is a function from the set of configurations Q. to the reals. Each configuration is 
indexed by the integers from 0 up, and it is useful to think of the configuration as 
a string of spins. One can append a spin ij) to its beginning, n V cr, in which case 
r] is at site 0, a»o at site 1 , and so on. 

The Ruelle operator £, is defined as 

This is a positive and bounded operator over the space of bounded observables. 
There is a generalization of the Perron-Frobenius theorem by Ruelle that estab- 
lishes that the largest eigenvalue of X is isolated from the rest of the spectrum and 
gives the thermodynamics of the spin system just as the largest eigenvalue of the 
transfer matrix does. Ruelle also gave a formula for the eigenvector related to the 
largest eigenvalue. 

The difficulty with it is that the relation between the partition function and the 
trace of its nth power, trX" = Z n no longer holds. The reason is that the trace of 
the Ruelle operator is ill-defined, it is infinite. 

We now introduce a special set of observables {x\(cr), . . . ,x\(cr)}. The idea 
is to choose the observables in such a way that from their values on a particular 
configuration cr the configuration can be reconstructed. We also introduce the 
interaction observables ha- 0 . 

To geometrize spin systems, the interactions are assumed to be translationally 
invariant. The spins o~k will only assume a finite number of values. For simplic- 
ity, we will take the interaction <p among the spins to depend only on pairwise 
interactions, 

0(o-) = <f)(o- Q , 0-1,0-2, ■■■) = Jqo-q + ^ Sa-^a-Jxin) , (M.35) 

n>0 

and limit o~k to be in {+, -}. For the 1-dimensional Ising model, Jq is the external 
magnetic field and J\ (n) = 1 if n = 1 and 0 otherwise. For an exponentially decay- 
ing interaction J\{n) = e~ an . Two- and 3-dimensional models can be considered 
in this framework. For example, a strip of spins of L x 00 with helical boundary 
conditions is modeled by the potential J\(n) = 8, u \ + 8 n ^. 

The transfer operator T was introduced by Kramers and Wannier [1 2] to study 
the Ising model on a strip and concocted so that the trace of its nth power is the 
partition function 7^ of system when one of its dimensions is n. The method can be 
generalized to deal with any finite-range interaction. If the range of the interaction 
is L, then T is a matrix of size 2 L X 2 L . The longer the range, the larger the matrix. 
When the range of the interaction is infinite one has to define the T operator by 
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its action on an observable g. Just as the observables in quantum mechanics, g 
is a function that associates a number to every state (configuration of spins). The 
energy density and the average magnetization are examples of observables. From 
this equivalent definition one can recover the usual transfer matrix by making all 
quantities finite range. For a semi-infinite configuration cr = {cr 0 , cr\, . . .}: 

Tg(cr) = g(+ V cr)e~ m+Vcr) + g(- V cr)^ ( - Vcr) . (M.36) 

By + V cr we mean the configuration obtained by prepending + to the beginning 
of cr resulting in the configuration {+, cr 0 , cr\,. . .}. When the range becomes in- 
finite, tr T n is infinite and there is no longer a connection between the trace and 
the partition function for a system of size n (this is a case where matrices give 
the wrong intuition). Ruelle [13] generalized the Perron-Frobenius theorem and 
showed that even in the case of infinite range interactions the largest eigenvalue 
of the T operator is related to the free-energy of the spin system and the corre- 
sponding eigenvector is related to the Gibbs state. By applying T to the constant 
observable u, which returns 1 for any configuration, the free energy per site / is 
computed as 

- pf(j3) = Km - In \\T n u\\ . (M.37) 

li^oo fi 

To construct a smooth dynamical system that reproduces the properties of T, 
one uses the phase space reconstruction technique of Packard et al. [ ] and Tak- 
ens [7], and introduces a vector of state observables x(cr) = {xi(cr), . . . , X£>(cr)}. 
To avoid complicated notation we will limit the discussion to the example x(cr) = 
{x+(cr),x-(cr)}, with x+(cr) = (f>{+ V cr) and x„(cr) = cf>{- V cr); the more general 
case is similar and used in a later example. The observables are restricted to those 
g for which, for all configurations cr, there exist an analytic function G such that 
G(x\{cr), . . . , xd(ct)) = g(cr). This at first seems a severe restriction as it may ex- 
clude the eigenvector corresponding to the Gibbs state. It can be checked that this 
is not the case by using the formula given by Ruelle [ ] for this eigenvector. A 
simple example where this formalism can be carried out is for the interaction (p(cr) 
with pairwise exponentially decaying potential J\{n) = a" (with \a\ < 1). In this 
case <p(cr) - X„ >0 cv- 0iO - n c?" and the state observables are ^ (cr) = £n>o S +t(Tn a n and 
x_(cr) = Yju>o d-,o-„a"- In this case the observable x+ gives the energy of + spin at 
the origin, and x_ the energy of a - spin. 

Using the observables x+ and x_, the transfer operator can be re-expressed as 

TG (x(cr)) = ^ G (x + (?7 V cr) , x_ (77 V cr)) e ^ (tT) . (M.38) 
rie{+-\ 

In this equation the only reference to the configuration cr is when computing the 
new observable values x + (tj V cr) and x^{rj V cr). The iteration of the function that 
gives these values in terms of x+(cr) and x_(cr) is the dynamical system that will 
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reproduce the properties of the spin system. For the simple exponentially decaying 
potential this is given by two maps, F + and F . The map F + takes {x + {cr), x+(cr)} 
into |i + (+V(r), x_(+ V cr)} which is {a{ \ + x+), ax-} and the map F_ takes {x + , X-} 
into {ax + ,a(l + xJ)}. In a more general case we have maps F n that take x(cr) to 
x(r\ V cr). 

We can now define a new operator £ 



£G(x) d =TG(x(cr))= J] G(F v (x))e-^, (M.39) 

76{+-J 

where all dependencies on cr have disappeared — if we know the value of the state 
observables x, the action of £ on G can be computed. 

A dynamical system is formed out of the maps F v . They are chosen so 
that one of the state variables is the interaction energy. One can consider the 
two maps F + and F- as the inverse branches of a hyperbolic map /, that is, 
f~ l (x) = {F+(x), F-(x)}. Studying the thermodynamics of the interaction <p is 
equivalent to studying the long term behavior of the orbits of the map /, achiev- 
ing the transformation of the spin system into a dynamical system. 

Unlike the original transfer operator, the £ operator — acting in the space 
of observables that depend only on the state variables — is of trace-class (its 
trace is finite). The finite trace gives us a chance to relate the trace of £! to the 
partition function of a system of size n. We can do better. As most properties of 
interest (thermodynamics, fall-off of correlations) are determined directly from its 
spectrum, we can study instead the zeros of the Fredholm determinant det (1 -z£) 
by the technique of cycle expansions developed for dynamical systems |2]. A 
cycle expansion consists of finding a power series expansion for the determinant 
by writing det(l - z£) = exp(tr ln(l - z£))- The logarithm is expanded into a 
power series and one is left with terms of the form tr £ l to evaluate. For evaluating 
the trace, the £ operator is equivalent to 

£G(x) = f dyS(y-f(x))e-P>G(y) (M.40) 

JR D 



from which the trace can be computed: 



tr£" = > — — - (M.41) 

^ |det (1 - d x f° n \x)) | 

x=f<°»)(x) ' V XJ V ,n 

with the sum running over all the fixed points of f on) (all spin configurations of a 
given length). Here / (o " ) is / composed with itself n times, and H(x) is the energy 
of the configuration associated with the point x. In practice the map / is never 
constructed and the energies are obtained directly from the spin configurations. 
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To compute the value of trJJ 1 we must compute the value of d x f ° n ^; this 
involves a functional derivative. To any degree of accuracy a number x in the 
range of possible interaction energies can be represented by a finite string of spins 
e, such as x = <p{+, 6Q,e\,...,-, -,...)■ By choosing the sequence e to have a 
large sequence of spins -, the number x can be made as small as needed, so in 
particular we can represent a small variation by <p{rf). As x+{e) - <p{+ V e), from 
the definition of a derivative we have: 



dxRx) = ^55) ' (M42) 



where rf- m ^ is a sequence of spin strings that make <i>{j\ m) ) smaller and smaller. By 
substituting the definition of cp in terms of its pairwise interaction J(n) = ria" 7 
and taking the limit for the sequences rf m ' = {+,-,-,..., T] m+ \, rj m+ 2, . . .} one 
computes that the limit is a if y - 1, 1 if y < 1, and 0 if y > 1. It does not 
depend on the positive value of s. When y < 1 the resulting dynamical system is 
not hyperbolic and the construction for the operator £, fails, so one cannot apply 
it to potentials such as (1/2)^". One may solve this problem by investigating the 
behavior of the formal dynamical system as y — > 0. 

The manipulations have up to now assumed that the map / is smooth. If 
the dimension D of the embedding space is too small, / may not be smooth. 
Determining under which conditions the embedding is smooth is a complicated 
question [15]. But in the case of spin systems with pairwise interactions it is 
possible to give a simple rule. If the interaction is of the form 

0(cr) = ^ 6^ n Pk{n)d k ( M - 43 > 

n>l k 

where are polynomials and \ak\ < 1, then the state observables to use are 
x s ,k(c) = 5j <5+,o-„rc s fl^- For each k one uses xq^, x\^, ... up to the largest power 
in the polynomial p^. An example is the interaction with J\{n) = ti 2 (3/10)". It 
leads to a 3-dimensional system with variables xq$, x\ t o, and X2,o- The action 
of the map F + for this interaction is illustrated figure M.3. Plotted are the pairs 
{cp(+ V <r), <p(+ V + V cr)}. This can be seen as the strange attractor of a chaotic 
system for which the variables xq^, x^o, and X2,o provide a good (analytic) em- 
bedding. 

The added smoothness and trace-class of the X operator translates into faster 
convergence towards the thermodynamic limit. As the reconstructed dynamics is 
analytic, the convergence towards the thermodynamic limit is faster than expo- 
nential [23, 16]. We will illustrate this with the polynomial-exponential interac- 
tions (M.43) with y — 1, as the convergence is certainly faster than exponential 
if y > 1, and the case of a" has been studied in terms of another Fredholm de- 
terminant by Gutzwiller [17]. The convergence is illustrated in figure M.4 for the 
interaction ?i 2 (3/10)". Plotted in the graph, to illustrate the transfer matrix conver- 
gence, are the number of decimal digits that remain unchanged as the range of the 
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Figure M.3: The spin adding map F + for the po- 
tential J(n) = 2;i 2 a Q '". The action of the map takes 
the value of the interaction energy between + and the 
semi-infinite configuration {cr,, ct 2 , <t 3 , . . .) and returns 
the interaction energy between + and the configuration 
{+, cr l , 0-2,0-3, ■■■}■ 



0.5 1 
<K+va) 



I 1 1 1 1 I 1 1 1 1 I 



Figure M.4: Number of digits for the Fredholm 
method (•) and the transfer function method (x). The 
size refers to the largest cycle considered in the Fred- 
holm expansions, and the truncation length in the case 
of the transfer matrix. 



10 

size 



interaction is increased. Also in the graph are the number of decimal digits that 
remain unchanged as the largest power of tr £? considered. The plot is effectively 
a logarithmic plot and straight lines indicate exponentially fast convergence. The 
curvature indicates that the convergence is faster than exponential. By fitting, one 
can verify that the free energy is converging to its limiting value as exp(-?l 4 ^ 3) ). 
Cvitanovic [ ] has estimated that the Fredholm determinant of a map on a D di- 
mensional space should converge as exp(-/P +1 ^ Z)) ), which is confirmed by these 
numerical simulations. 



Resume 



The geometrization of spin systems strengthens the connection between statistical 
mechanics and dynamical systems. It also further establishes the value of the 
Fredholm determinant of the £, operator as a practical computational tool with 
applications to chaotic dynamics, spin systems, and semiclassical mechanics. The 
example above emphasizes the high accuracy that can be obtained: by computing 
the shortest 14 periodic orbits of period 5 or less it is possible to obtain three digit 
accuracy for the free energy. For the same accuracy with a transfer matrix one 
has to consider a 256 x 256 matrix. This make the method of cycle expansions 
practical for analytic calculations. 
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Remark M.l Presentation functions. The best place to read about Feigenbaum's 
work is in his review article published in Los Alamos Science (reproduced in various 
reprint collections and conference proceedings, such as ref. [ 5]). Feigenbaum's Journal 
of Statistical Physics article [ 1 7] is the easiest place to learn about presentation functions. 



Remark M.2 Interactions are smooth In most computational schemes for thermody- 
namic quantities the translation invariance and the smoothness of the basic interaction are 
never used. In Monte Carlo schemes, aside from the periodic boundary conditions, the in- 
teraction can be arbitrary. In principle for each configuration it could be possible to have a 
different energy. Schemes such as the Sweneson-Wang cluster flipping algorithm use the 
fact that interaction is local and are able to obtain dramatic speed-ups in the equilibration 
time for the dynamical Monte Carlo simulation. In the geometrization program for spin 
systems, the interactions are assumed translation invariant and smooth. The smoothness 
means that any interaction can be decomposed into a series of terms that depend only on 
the spin arrangement and the distance between spins: 

4>(o-q, o- u o- 2 , ...) = Joo-q + ^ S(o- 0 , cr n )Ji(n) + ^ 6(o-q, cr ni , cr m )J 2 {ni,n 2 ) + ■■■ 

where the are symmetric functions of their arguments and the 6 are arbitrary discrete 
functions. This includes external constant fields (Jo), but it excludes site dependent fields 
such as a random external magnetic field. 



Exercises 

M. 1 . Not all Banach spaces are also Hilbert. If we are 

given a norm ||- 1 of a Banach space B, it may be possible 
to find an inner product (■ , ■ ) (so that B is also a Hilbert 
space H) such that for all vectors / e B, we have 

11/11 = </,/) 1/2 - 

This is the norm induced by the scalar product. If we 
cannot find the inner product how do we know that we 
just are not being clever enough? By checking the paral- 
lelogram law for the norm. A Banach space can be made 
into a Hilbert space if and only if the norm satisfies the 
parallelogram law. The parallelogram law says that for 
any two vectors / and g the equality 

ll/ + Sl| 2 + ll/-Sl| 2 =2||/|| 2 + 2||s|| 2 , 

must hold. 



Consider the space of bounded observables with the 
norm given by ||a|| = sup^QN |fl(cr)|. Show that there 
is no scalar product that will induce this norm. 

M.2. Automaton for a droplet. Find the transition graph 
and the weights on the edges so that the energies of con- 
figurations for the droplet model are correctly generated. 
For any string starting in zero and ending in zero your 
diagram should yield a configuration the weight e H '°"', 
with H computed along the lines of (M.l 3) and (M.18). 

Hint: the transition graph is infinite. 

M.3. Spectral determinant for a" interactions. Compute 
the spectral determinant for 1 -dimensional Ising model 
with the interaction 

4>(o-) = ^ a k 6(o- 0 , o- k ) . 

k>0 
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Take a as a number smaller than 1/2. 

(a) What is the dynamical system this generates? That 
is, find F+ and F- as used in (M.39). 



(b) Show that 



a 0 
0 a 



MA. Ising model on a thin strip. Compute the transfer ma- 
trix for the Ising model defined on the graph 




Assume that whenever there is a bond connecting two 
sites, there is a contribution J5{a-j,crj) to the energy. 

M.5. Infinite symbolic dynamics. Let u be a func- 

tion that returns zero or one for every infinite binary 
string: <x : {0,1 } M — » {0,1}. Its value is represented 
by <r{e\, en, . . .) where the e, are either 0 or 1. We will 
now define an operator T that acts on observables on the 
space of binary strings. A function a is an observable if 
it has bounded variation, that is, if 

||a|| = sup \a(e\, 62, . . .)| < 00. 



For these functions 

Ta(e\,en, ■ . .) = a(0, e\, £2, ■ . .)cr(0, e\, ei, . . .) + a{\ 

The function <x is assumed such that any of T's "matrix 
representations" in (a) have the Markov property (the 
matrix, if read as an adjacency graph, corresponds to 
a graph where one can go from any node to any other 
node). 

(a) (easy) Consider a finite version T„ of the operator 
T: 

T„a(e l ,e 2 ,...,e„) = 

a(0, eu ft, ... , e„_i)o-(0, e u ez, ... , e„_i) 
a(l, ei, 62, . . . , e„_i)o-(l, e\, 62,..., e„-\) 

Show that T„ is a 2" x 2" matrix. Show that its 
trace is bounded by a number independent of n. 

(b) (medium) With the operator norm induced by the 
function norm, show that T is a bounded operator. 

(c) (hard) Show that T is not trace-class. (Hint: check 
if T is compact). 

Classes of operators are nested; trace-class < compact < 
bounded. 
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Noise/quantum corrections 



(G. Vattay) 




he Gutzwiller trace formula is only a good approximation to the quantum 



mechanics when % is small. Can we improve the trace formula by adding 



quantum corrections to the semiclassical terms? A similar question can 
be posed when the classical deterministic dynamics is disturbed by some way 
Gaussian white noise with strength D. The deterministic dynamics then can be 
considered as the weak noise limit D — > 0. The effect of the noise can be taken 
into account by adding noise corrections to the classical trace formula. A formal 
analogy exists between the noise and the quantum problem. This analogy allows 
us to treat the noise and quantum collections together. 



N.l Periodic orbits as integrable systems 

From now on, we use the language of quantum mechanics, since it is more con- 
venient to visualize the results there. Where it is necessary we will discuss the 
difference between noise and quantum cases. 

First, we would like to introduce periodic orbits from an unusual point of 
view, which can convince you, that chaotic and integrable systems are in fact 
not as different from each other, than we might think. If we start orbits in the 
neighborhood of a periodic orbit and look at the picture on the Poincare section 
we can see a regular picture. For stable periodic orbits the points form small 
ellipses around the center and for unstable orbits they form hyperbolas (See Fig. 
N.l). 

Figure N.l: Poincare section close to a stable and an unstable periodic orbit 
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The motion close to a periodic orbits is regular in both cases. This is due to 
the fact, that we can linearize the Hamiltonian close to an orbit, and linear systems 
are always integrable. The linearized Hamilton's equations close to the periodic 
orbit (q p (t) + q, p p {t) + p) look like 

q = +d 2 pq H(q p (t),p p (t))q + d 2 pp H(q p (t),p p (t))p, (N.l) 
p = -d 2 qq H{q p {t),p p {t))q-d 2 qp H{q p {t),p p {t))p, (N.2) 

where the new coordinates q and p are relative to a periodic orbit. This linearized 
equation can be regarded as a d dimensional oscillator with time periodic frequen- 
cies. These equations are representing the equation of motion in a redundant way 
since more than one combination of q, p and t determines the same point of the 
phase space. This can be cured by an extra restriction on the variables, a con- 
straint the variables should fulfill. This constraint can be derived from the time 
independence or stationarity of the full Hamiltonian 

d t H(q p (t) + q, Pp (t)+p) = 0. (N.3) 

Using the linearized form of this constraint we can eliminate one of the linearized 
equations. It is very useful, although technically difficult, to do one more transfor- 
mation and to introduce a coordinate, which is parallel with the Hamiltonian flow 
(jt||) and others which are orthogonal. In the orthogonal directions we again get 
linear equations. These equations with x\\ dependent rescaling can be transformed 
into normal coordinates, so that we get tiny oscillators in the new coordinates 
with constant frequencies. This result has first been derived by Poincare for equi- 
librium points and later it was extended for periodic orbits by V.I. Arnol'd and 
co-workers. In the new coordinates, the Hamiltonian reads as 

1 d ~ l 1 

Ho(.X\\,P\\,Xn,Pn) - jP\ + ^0||) + ^ -jiPl ± ^1x1), (N.4) 

which is the general form of the Hamiltonian in the neighborhood of a periodic 
orbit. The ± sign denotes, that for stable modes the oscillator potential is posi- 
tive while for an unstable mode it is negative. For the unstable modes, u is the 
Lyapunov exponent of the orbit 

(o n = \nA Pt JT p , (N.5) 

where A p>n is the expanding eigenvalue of the Jacobi matrix. For the stable direc- 
tions the eigenvalues of the Jacobi matrix are connected with u as 



A p ,„ = e~ ia, " T ". (N.6) 
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The Hamiltonian close to the periodic orbit is integrable and can be quantized by 
the Bohr-Sommerfeld rules. The result of the Bohr-Sommerfeld quantization for 
the oscillators gives the energy spectra 



+ i j for stable modes, (N.7) 
i n + - J for unstable modes, 



where j n = 0, 1, .... It is convenient to introduce the index s„ = 1 for stable and 
s n = -i for unstable directions. The parallel mode can be quantized implicitly 
trough the classical action function of the mode: 



1 / m n n\ 

p\\dx\\ = ^S\\(E m ) = n\m+ -i-j , (N.8) 



where m p is the topological index of the motion in the parallel direction. This 
latter condition can be rewritten by a very useful trick into the equivalent form 

(1 _ e iSn(E m )/n-im p n/2j _ q (N.9) 



The eigen-energies of a semiclassically quantized periodic orbit are all the possi- 
ble energies 



d-\ 

E — E m + ^ ' E n . 



(N.10) 



This relation allows us to change in (N.9) E m with the full energy minus the os- 
cillator energies E m - E - 2„ E n . All the possible eigenenergies of the periodic 
orbit then are the zeroes of the expression 



Ap(£) = (1 — g ! ' S l]( £_ 2n^n w »C/n + VWS-»>V''/ 2 ) 



(N.ll) 



JU..;Jd-l 



If we Taylor expand the action around E to first order 



S ll (E + e)*S ll (E) + T(E)e, 



(N.12) 



where T(E) is the period of the orbit, and use the relations of co and the eigenvalues 
of the Jacobi matrix, we get the expression of the Selberg product 



A p (E) = 



7l,-Jd-l 



( JS p (E)/H~im p n/2" 

1 - 

n a(1/2+;„) 
1 In 1 ^p,n 



(N.13) 
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If we use the right convention for the square root we get exactly the d dimensional 
expression of the Selberg product formula we derived from the Gutzwiller trace 
formula in ? . Just here we derived it in a different way! The function Ap(E) is 
the semiclassical zeta function for one prime orbit. 

Now, if we have many prime orbits and we would like to construct a function 
which is zero, whenever the energy coincides with the BS quantized energy of one 
of the periodic orbits, we have to take the product of these determinants: 

A(E) = Y\A p (E). (N.14) 

p 

The miracle of the semiclassical zeta function is, that if we take infinitely many 
periodic orbits, the infinite product will have zeroes not at these energies, but close 
to the eigen=energies of the whole system ! 

So we learned, that both stable and unstable orbits are integrable systems and 
can be individually quantized semiclassically by the old Bohr-Sommerfeld rules. 
So we almost completed the program of Sommerfeld to quantize general systems 
with the method of Bohr. Let us have a remark here. In addition to the Bohr- 
Sommerfeld rules, we used the unjustified approximation (N.I2). Sommerfeld 
would never do this I At that point we loose some important precision compared 
to the BS rules and we get somewhat worse results than a semiclassical formula 
is able to do. We will come back to this point later when we discuss the quantum 
corrections. To complete the program of full scale Bohr-Sommerfeld quantiza- 
tion of chaotic systems we have to go beyond the linear approximation around the 
periodic orbit. 

The Hamiltonian close to a periodic orbit in the parallel and normal coordi- 
nates can be written as the 'harmonic' plus 'anaharmonic' perturbation 

H(x\\,p\\,x n ,p n ) = H Q (x\\,p\\,x n ,p n ) + H A {x\\,x n ,p n ), (N.15) 

where the anaharmonic part can be written as a sum of homogeneous polynomials 
of x n and p„ with x\\ dependent coefficients: 

H A {x\\,x„,p n ) = Y,H k (x h x n ,p n ) (N.16) 

k=3 

H\x h X n , Pn ) = H L«S X \0 X 'nPn" (N-17) 

This classical Hamiltonian is hopeless from Sommerfeld's point of view, since 
it is non integrable. However, Birkhoff in 1927 3 introduced the concept of nor- 
mal form, which helps us out from this problem by giving successive integrable 
approximation to a non-integrable problem. Let's learn a bit more about it! 

3 It is really a pity, that in 1926 Schrodinger introduced the wave mechanics and blocked the 
development of Sommerfeld's concept. 
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N.2 The Birkhoff normal form 



Birkhoff studied the canonical perturbation theory close to an equilibrium point of 
a Hamiltonian. Equilibrium point is where the potential has a minimum VU = 0 
and small perturbations lead to oscillatory motion. We can linearize the prob- 
lem and by introducing normal coordinates Xn and conjugate momentums p n the 
quadratic part of the Hamiltonian will be a set of oscillators 

d 1 

H 0 (x n , Pn) = J] 7z(p 2 n + <JA)- (N.18) 

The full Hamiltonian can be rewritten with the new coordinates 

H(x n ,p n ) = H 0 (x n ,p n ) + H A {x n ,p n ), (N.19) 

where Ha is the anaharmonic part of the potential in the new coordinates. The 
anaharmonic part can be written as a series of homogeneous polynomials 



H A (Xn,Pn) = Yj&iXn.Pn), (N.20) 

7=3 

H\x n ,p n ) = J] KJf 1 , (N.21) 

\l\+\m\=j 



where hj are real constants and we used the multi-indices / := (4, /rf) with 
definitions 



y^Jn,* 1 '■- x x x x^...x A d . 



Birkhoff showed, that that by successive canonical transformations one can in- 
troduce new momentums and coordinates such, that in the new coordinates the 
anaharmonic part of the Hamiltonian up to any given n polynomial will depend 
only on the variable combination 



t„ - l -(pl + aj 2 n xl), (N.22) 



where x n and p n are the new coordinates and momentums, but oj,, is the original 
frequency. This is called the Birkhoff normal form of degree ,/V: 

N 

H(x n , Pn ) = ^ H\t { , T d ), (N.23) 

7=2 
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where W are homogeneous degree j polynomials of r-s. This is an integrable 
Hamiltonian, the non-integrability is pushed into the remainder, which consists of 
polynomials of degree higher than N. We run into trouble only when the oscillator 
frequencies are commensurate e.g. it is possible to find a set of integers 17^ such 
that the linear combination 

d 

a> n m n , 

n=\ 

vanishes. This extra problem has been solved by Gustavson in 1966 and we 
call the object Birkhoff-Gustavson normal form. The procedure of the succes- 
sive canonical transformations can be computerized and can be carried out up to 
high orders (~ 20). 

Of course, we pay a price for forcing the system to be integrable up to degree 
N. For a non-integrable system the high order terms behave quite wildly and the 
series is not convergent. Therefore we have to use this tool carefully. Now, we 
learned how to approximate a non-integrable system with a sequence of integrable 
systems and we can go back and carry out the BS quantization. 



N.3 Bohr-Sommerfeld quantization of periodic orbits 

There is some difference between equilibrium points and periodic orbits. The 
Hamiltonian (N.4) is not a sum of oscillators. One can transform the parallel 
part, describing circulation along the orbit, into an oscillator Hamiltonian, but this 
would make the problem extremely difficult. Therefore, we carry out the canonical 
transformations dictated by the Birkhoff procedure only in the orthogonal direc- 
tions. The x\\ coordinate plays the role of a parameter. After the transformation up 
to order N the Hamiltonian (N. 17) is 

N 

H(X\\,p\\,Tl,...T d -\) = flo(*||./>||»Tl, ...,T d _l)+^ U j (x\\,Tl, -,Td-l),Q^.24) 

j=2 

where U J is a jth order homogeneous polynomial of r-s with x[\ dependent co- 
efficients. The orthogonal part can be BS quantized by quantizing the individual 
oscillators, replacing r-s as we did in (N.8). This leads to a one dimensional 
effective potential indexed by j\, j d ~\ 

H(x\\,P\\Ju-,jd-i) = -xp\ + U(x n ) + ^hs n u} n (j n + 1/2) + (N.25) 

1 n=\ 

N 

+ ^ l U k (x\\,hsia)i(j 1 + l/2),^5 2 w 2 0'2 + 1/2), ...,Tis d -iu d -.i{j d -.i + 1/2)), 

k=2 
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where j„ can be any non-negative integer. The term with index k is proportional 
with ft k due to the homogeneity of the polynomials. 

The parallel mode now can be BS quantized for any given set of j-s 



S p {E,jx,..„j d -i) = 




(N.26) 



d-i 

E- ^ fts n (o n (j n + 1/2) - U(x\\,ji, ...,jd-l) = InUm + m p /2), 
n=l 

where U contains all the x\\ dependent terms of the Hamiltonian. The spectral 
determinant becomes 

A P (E)= Y\ (l- e iS " {E ' jl '-' jd - l)/n - m " n/2 ). (N.27) 

jl,—,jd-l 

This expression completes the Sommerfeld method and tells us how to quan- 
tize chaotic or general Hamiltonian systems. Unfortunately, quantum mechanics 
postponed this nice formula until our book. 

This formula has been derived with the help of the semiclassical Bohr-Sommerfeld 
quantization rule and the classical normal form theory. Indeed, if we expand S p 
in the exponent in the powers of fi 

N 

s P - h k Sk, 

k=0 

we get more than just a constant and a linear term. This formula already gives 
us corrections to the semiclassical zeta function in all powers of fi. There is a 
very attracting feature of this semiclassical expansion, fi in S p shows up only 
in the combination Hs n co n (j n + 1/2). A term proportional with ft can only be a 
homogeneous expression of the oscillator energies s^a> n (j n + 1/2). For example 
in two dimensions there is only one possibility of the functional form of the order 
k term 

S k = c k (E)-co k n (j + l/2) k , 

where ct(E) is the only function to be determined. 

The corrections derived sofar are doubly semiclassical, since they give semi- 
classical corrections to the semiclassical approximation. What can quantum me- 
chanics add to this ? As we have stressed in the previous section, the exact quan- 
tum mechanics is not invariant under canonical transformations. In other context, 
this phenomenon is called the operator ordering problem. Since the operators x 
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and p do not commute, we run into problems, when we would like to write down 
operators for classical quantities like j?p 2 . On the classical level the four possible 
orderings xpxp, ppxx, pxpx and xxpp are equivalent, but they are different in 
the quantum case. The expression for the energy (N.26) is not exact. We have to 
go back to the level of the Schrodinger equation if we would like to get the exact 
expression. 



N.4 Quantum calculation of fi corrections 



The Gutzwiller trace formula has originally been derived from the saddle point 
approximation of the Feynman path integral form of the propagator. The exact 
trace is a path-sum for all closed paths of the system 

MW.O.J^a.J^, (N.28, 

where J Dx denotes the discretization and summation for all paths of time length 
t in the limit of the infinite refinement and S (x, t) is the classical action calculated 
along the path. The trace in the saddle point calculation is a sum for classical 
periodic orbits and zero length orbits, since these are the extrema of the action 
6S(x, t) - 0 for closed paths: 

TrG(*, x', t) = g 0 (t) + £ f £>f P e iS( t'> +x >' m/n , (N.29) 



where go(0 is the zero length orbit contribution. We introduced the new coordi- 
nate ij p with respect to the periodic orbit x p (t), x = £ p + x p (t). Now, each path 
sum J D^ p is computed in the vicinity of periodic orbits. Since the saddle points 
are taken in the configuration space, only spatially distinct periodic orbits, the so 
called prime periodic orbits, appear in the summation. Sofar nothing new has 
been invented. If we continue the standard textbook calculation scheme, we have 
to Taylor expand the action in g p and keep the quadratic term in the exponent 
while treating the higher order terms as corrections. Then we can compute the 
path integrals with the help of Gaussian integrals. The key point here is that we 
don't compute the path sum directly. We use the correspondence between path 
integrals and partial differential equations. This idea comes from Maslov |5] and 
a good summary is in ref. [6]. We search for that Schrodinger equation, which 
leads to the path sum 



Of (P<£p + *pWA/* t ( N .30) 
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where the action around the periodic orbit is in a multi-dimensional Taylor ex- 
panded form: 



S(x,t) = |] s n (t)(x - x p (t)) n /nl. (N.31) 



The symbol n = (n\,n2, . denotes the multi index in d dimensions, n! = 
]~[f =1 «/! the multi factorial and (x - x p (t)) n = Ylf = i(xi - x p ,i(t)) n ', respectively. 
The expansion coefficients of the action can be determined from the Hamilton- 
Jacobi equation 

d,S + j(VS) 2 + U = 0, (N.32) 

in which the potential is expanded in a multidimensional Taylor series around the 
orbit 

U(x) = ^ «»(')(* - XpWf/nL (N.33) 



The Schrodinger equation 

h 2 

m t ift = Hip = - —AiJ/ + Uif/, (N.34) 

with this potential also can be expanded around the periodic orbit. Using the WKB 
ansatz 



iff = <pe iS/n , (N.35) 

we can construct a Schrodinger equation corresponding to a given order of the 
Taylor expansion of the classical action. The Schrodinger equation induces the 
Hamilton- Jacobi equation (N.32) for the phase and the transport equation of Maslov 
and Fjedoriuk [7] for the amplitude: 

dtp + VifiVS + j>fiAS - jAif = 0. (N.36) 

This is the partial differential equation, solved in the neighborhood of a periodic 
orbit with the expanded action (N.31), which belongs to the local path-sum (N.30). 

If we know the Green's function G p (^,^', t) corresponding to the local equa- 
tion (N.36), then the local path sum can be converted back into a trace: 



f 



Df p e i/n 2n 5nGv„W,0^/n! = Tr(7n(£, ?,t). (N.37) 
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The saddle point expansion of the trace in terms of local traces then becomes 

TrG(x, x',t) = TrG w (x, x', t) + Y TrG p (£, ? , t), (N.38) 

p 

where Gw(x, xf,t) denotes formally the Green's function expanded around zero 
length (non moving) periodic orbits, known as the Weyl term [ ]. Each Green's 
function can be Fourier-Laplace transformed independently and by definition we 
get in the energy domain: 

TrG(x, x',E) = g 0 (E) + ^ TrG p (£ f , E). (N.39) 

p 

Note that we do not need here to take further saddle points in time, since we 
are dealing with exact time and energy domain Green's functions. indexGreen's 
function! energy dependent 

The spectral determinant is a function which has zeroes at the eigen-energies 
E n of the Hamilton operator H. Formally it is 

A(£) = det {E-H) = Y\(E - E n ). 

n 

The logarithmic derivative of the spectral determinant is the trace of the energy 
domain Green's function: 

TrG(x, *,E) = Ya = l0g ^ (N - 40) 

n 11 

We can define the spectral determinant A p (E) also for the local operators and we 
can write 

TvG,M, f, E) = -^ log A P (E). (N.41) 

Using (N.39) we can express the full spectral determinant as a product for the 
sub-determinants 

HE) = e WiE) Y\A p (E), 

p 

rE 

where W(E) = J go(E')dE' is the term coming from the Weyl expansion. 

The construction of the local spectral determinants can be done easily. We 
have to consider the stationary eigenvalue problem of the local Schrodinger prob- 
lem and keep in mind, that we are in a coordinate system moving together with 
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the periodic orbit. If the classical energy of the periodic orbit coincides with an 
eigen-energy E of the local Schrodinger equation around the periodic orbit, then 
the corresponding stationary eigenfunction fulfills 

xjfp^, t + T p ) = J d?G p {%, r , t + TpWpg, t) = e- iET ' /n iJ, p ({, t), (N.42) 

where T p is the period of the prime orbit p. If the classical energy of the periodic 
orbit is not an eigen-energy of the local Schrodinger equation, the non-stationary 
eigenfunctions fulfill 

^ p (£, t + T p ) = J d?G p (g, r , t + Tp)typ{g, t) = e- iET P /n A l p (E)^ p (t), (N.43) 

where 1 = (l\ , I2, ...) is a multi-index of the possible quantum numbers of the local 
Schrodinger equation. If the eigenvalues ^,{E) are known the local functional 
determinant can be written as 



A P (E) = Y\(l ~ A P (E)), 



(N.44) 



since A p (E) is zero at the eigen=energies of the local Schrodinger problem. We 
can insert the ansatz (N.35) and reformulate (N.43) as 



e^ ,+T 'V P (f + T p ) - e- iET "' n A l p (E)e^V P (t). 



(N.45) 



The phase change is given by the action integral for one period S(t + T p ) — S(t) ■ 
J 0 p L(t)dt. Using this and the identity for the action S P (E) of the periodic orbit 



Sn{E) 



§ pdq= X ' 



L(t)dt + ET P , 



(N.46) 



we get 



e&pWiftJt + T p ) = A l JE)cp l Jt). 



(N.47) 



Introducing the eigen-equation for the amplitude 



if X Jt + T p ) = RUE^Jt), 



(N.48) 



the local spectral determinant can be expressed as a product for the quantum num- 
bers of the local problem: 



A p (E) = y\(l-R lp (E)e-* s " iE) ) 



(N.49) 
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Since fi is a small parameter we can develop a perturbation series for the am- 
plitudes <p p (t) - Zm=o (t) which can be inserted into the equation (N.36) 
and we get an iterative scheme starting with the semiclassical solution $• >; 

d t ip m + V (0) V,S + ^f m AS = 0, (N.50) 
The eigenvalue can also be expanded in powers of ih/2: 



= exp(C»»)|l + |ci>(f) 2 (I<C»V + C £») + .... (N.52) 

The eigenvalue equation (N.48) in fi expanded form reads as 
<pf\t + T p ) = exp(C[>« 0) (0, 

<pf\t + T p ) = exp(Cj>f (,) + C y?\t) + (Cg + ^(C«) 2 )^(53) 

and so on. These equations are the conditions selecting the eigenvectors and 
eigenvalues and they hold for all t. 

It is very convenient to expand the functions (p p "\x, t) in Taylor series around 
the periodic orbit and to solve the equations (N.51) in this basis [10], since only 
a couple of coefficients should be computed to derive the first corrections. This 
technical part we are going to publish elsewhere |9]. One can derive in general 
the zero order term c[ 0) = inv p + Yfi=\ (k + 5) u p,u where u p j = log A p j are 
the logarithms of the eigenvalues of the monodromy matrix M p and v p is the 
topological index of the periodic orbit. The first correction is given by the integral 

Jo 



When the theory is applied for billiard systems, the wave function should 
fulfill the Dirichlet boundary condition on hard walls, e.g. it should vanish on the 
wall. The wave function determined from (N.36) behaves discontinuously when 
the trajectory x p (i) hits the wall. For the simplicity we consider a two dimensional 
billiard system here. The wave function on the wall before the bounce (to ) is 
given by 

if, in (x, y(x), t) = <p(x, y(x), ^y^M.M,)/^ (N-54) 



qmnoise - 19jun2003 



ChaosBook.org versionl4.4.1, Apr 21 2013 



APPENDIX N. NOISE/QUANTUM CORRECTIONS 



963 



where y(x) - Y^x 2 /2! + y3X 3 /3! + Y4X 4 /4! + ... is the parametrization of the wall 
around the point of reflection (see Fig 1.). The wave function on the wall after the 
bounce (f+o) is 

xj, out {x, y(x), t) = <p(x, y(x), , +o)e W*),r + o)/* (N . 55) 

The sum of these wave functions should vanish on the hard wall. This implies that 
the incoming and the outgoing amplitudes and the phases are related as 

S (x, y(x), f_ 0 ) - S (x, y(x), t +Q ), (N.56) 

and 

<p(x, y(x), ?„ 0 ) = ~<p(x, y(x), f +0 ). (N.57) 

The minus sign can be interpreted as the topological phase coming from the hard 
wall. 

Now we can reexpress the spectral determinant with the local eigenvalues: 
A(E) = e W(E) ]~[ ri (1 ~ R iA E ^ Sp ^ E) y ( N - 58 ) 

p 1 

This expression is the quantum generalization of the semiclassical Selberg-product 
formula [11]. A similar decomposition has been found for quantum Baker maps 
in ref. [12]. The functions 

£\E) = Y\(l - R lp (E)e* s » (E) ) (N.59) 
p 

are the generalizations of the Ruelle type [23] zeta functions. The trace formula 
can be recovered from (N.40): 

1 ^ dlogRiJE) R lp (E)e-fi s i>W 

lfl p\ dE \-R\, p (E)en s o (E) 

We can rewrite the denominator as a sum of a geometric series and we get 

TrG(E) = g 0 (E) + - 2](T P (E) - m B dE py J )(R lp (E))'eT' rS ^ E) .(N.6l) 

p,r,\ 

The new index r can be interpreted as the repetition number of the prime orbit 
p. This expression is the generalization of the semiclassical trace formula for 
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the exact quantum mechanics. We would like to stress here, that the perturbation 
calculus introduced above is just one way to compute the eigenvalues of the local 
Schrodinger problems. Non-perturbative methods can be used to calculate the 
local eigenvalues for stable, unstable and marginal orbits. Therefore, our trace 
formula is not limited to integrable or hyperbolic systems, it can describe the 
most general case of systems with mixed phase space. 

The semiclassical trace formula can be recovered by dropping the sub-leading 

<yy) r 1(0) 

term -ifid log R\ p (E)/dE and using the semiclassical eigenvalue (E) - e p = 
e -iv p n g-2,('i +1 /2)M P ,i _ Summation for the indexes /, yields the celebrated semiclas- 
sical amplitude 



E 



(/?. (0) (£■))'' = — . (N.62) 

l 'P y IdetCl-Mpi 1 / 2 



To have an impression about the improvement caused by the quantum correc- 
tions we have developed a numerical code [13] which calculates the first correc- 
tion C (1 J for general two dimensional billiard systems . The first correction de- 
pends only on some basic data of the periodic orbit such as the lengths of the free 
flights between bounces, the angles of incidence and the first three Taylor expan- 
sion coefficients Y%, F3, Y4 of the wall in the point of incidence. To check that our 
new local method gives the same result as the direct calculation of the Feynman in- 
tegral, we computed the first H correction (5 q for the periodic orbits of the 3-disk 
scattering system [ I ] where the quantum corrections have been We have found 
agreement up to the fifth decimal digit, while our method generates these numbers 
with any desired precision. Unfortunately, the / + 0 coefficients cannot be com- 
pared to ref. [15], since the / dependence was not realized there due to the lack of 
general formulas (N.58) and (N.59). However, the I dependence can be checked 
on the 2 disk scattering system [16]. On the standard example [14, 15, 16, 18], 
when the distance of the centers (R) is 6 times the disk radius (a), we got 



C, (1) - -— (-0.625/ 3 - 0.3125/ 2 + 1.4375/ + 0.625). 
V2£ 



For 1 = 0 and 1 this has been confirmed by A. Wirzba [17], who was able to 
compute Cq from his exact quantum calculation. Our method makes it possible 
to utilize the symmetry reduction of Cvitanovic and Eckhardt and to repeat the 
fundamental domain cycle expansion calculation of ref. [1 8] with the first quan- 
tum correction. We computed the correction to the leading 226 prime periodic 
orbits with 10 or less bounces in the fundamental domain. Table I. shows the 
numerical values of the exact quantum calculation [16], the semiclassical cycle 
expansion [ ] and our corrected calculation. One can see, that the error of the 
corrected calculation vs. the error of the semiclassical calculation decreases with 
the wave-number. Besides the improved results, a fast convergence up to six dec- 
imal digits can be observed, which is just three decimal digits in the full domain 
calculation [15]. 
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Figure N.2: A typical bounce on a billiard wall. The wall can be characterized by the local expan- 
sion y(x) = Y 2 x 2 /2\ + Y 3 x 3 /3\ + Y 4 x 4 /4\ + .... 

Table N.l: Real part of the resonances (Re k) of the 3-disk scattering system at disk separation 6:1. 
Semiclassical and first corrected cycle expansion versus exact quantum calculation and the error of 
the semiclassical <5 XC divided by the error of the first correction 5c orr - The magnitude of the error in 
the imaginary part of the resonances remains unchanged. 



Quantum 


Semiclassical 


First correction 


8scl8corr 


0.697995 


0.758313 


0.585150 


0.53 


2.239601 


2.274278 


2.222930 


2.08 


3.762686 


3.787876 


3.756594 


4.13 


5.275666 


5.296067 


5.272627 


6.71 


6.776066 


6.793636 


6.774061 


8.76 


30.24130 


30.24555 


30.24125 


92.3 


31.72739 


31.73148 


31.72734 


83.8 


32.30110 


32.30391 


32.30095 


20.0 


33.21053 


33.21446 


33.21048 


79.4 


33.85222 


33.85493 


33.85211 


25.2 


34.69157 


34.69534 


34.69152 


77.0 
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Appendix S 
Projects 



You are urged to work through the essential steps in a project that combines 
the techniques learned in the course with some application of interest to 
you for other reasons. It is OK to share computer programs and such, but 
otherwise each project should be distinct, not a group project. The essential steps 
are: 

• Dynamics 

1 . construct a symbolic dynamics 

2. count prime cycles 

3. prune inadmissible itineraries, construct transition graphs if appropri- 
ate 

4. implement a numerical simulator for your problem 

5. compute a set of the shortest periodic orbits 

6. compute cycle stabilities 

• Averaging, numerical 

1. estimate by numerical simulation some observable quantity, like the 
escape rate, 

2. or check the flow conservation, compute something like the Lyapunov 
exponent 

• Averaging, periodic orbits 

1 . implement the appropriate cycle expansions 

2. check flow conservation as function of cycle length truncation, if the 
system is closed 

3. implement desymmetrization, factorization of zeta functions, if dy- 
namics possesses a discrete symmetry 
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4. compute a quantity like the escape rate as a leading zero of a spectral 
determinant or a dynamical zeta function. 

5. or evaluate a sequence of truncated cycle expansions for averages, 
such as the Lyapunov exponent or/and diffusion coefficients 

6. compute a physically intersting quantity, such as the conductance 

7. compute some number of the classical and/or quantum eigenvalues, if 
appropriate 
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S.l Deterministic diffusion, zig-zag map 



To illustrate the main idea of chapter 25, tracking of a globally diffusing orbit 
by the associated confined orbit restricted to the fundamental cell, we consider 
a class of simple 1 -dimensional dynamical systems, chains of piecewise linear 
maps, where all transport coefficients can be evaluated analytically. The transla- 
tional symmetry (25.10) relates the unbounded dynamics on the real line to the 
dynamics restricted to a "fundamental cell" - in the present example the unit in- 
terval curled up into a circle. An example of such map is the sawtooth map 

Ax x e [0, 1/4 + 1/4A] 

f(x) = { -Ax + (A+l)/2 xe [1/4 + 1/4A.3/4- 1/4A] . (S.l) 

Ax + (1-A) xe [3/4- 1/4A, 1] 

The corresponding circle map f(x) is obtained by modulo the integer part. The 
elementary cell map f(x) is sketched in figure S.l. The map has the symmetry 
property 

/(*) = -/(-*), (S-2) 

so that the dynamics has no drift, and all odd derivatives of the generating function 
(25.3) with respect to /? evaluated at /3 = 0 vanish. 

The cycle weights are given by 
, t ^-. (S.3) 
The diffusion constant formula for 1 -dimensional maps is 



D = \~rr (S - 4) 

2 (n) ( 



where the "mean cycle time" is given by 
d 1 



<«>f = z- 



Z, , n„, + • • • + n„, 
(-1)*"^ 7 T~ > (S.5) 



"&f(0,z) 

the mean cycle displacement squared by 

Xh Pl +---+h„.) 2 



( h2 )r^?ih = -Z^ T'"aT ■ (S - 6) 

'f d/3 2 £(fi,l) p=0 ^ \A pi ---A Pk \ 



Problems/projDDiffl.tex 7aug2002 



ChaosBook.org version!4.4.1, Apr21 2013 



APPENDIX S. PROJECTS 



970 




1 45 2 67 3 X 1 45 2 67 3 

(a) (b) 




14 5 2 6 7 3 x 




14 5 2 6 7 3 
(d) 



Figure S.l: (a)-(f) The sawtooth map (S.l) for the 
6 values of parameter a for which the folding point 
of the map aligns with the endpoint of one of the 7 
intervals and yields a finite Markov partition (from 
ref. [1]). The corresponding transition graphs are 
given in figure S.2. 




and the sum is over all distinct non-repeating combinations of prime cycles. Most 
of results expected in this projects require no more than pencil and paper compu- 
tations. 

Implementing the symmetry factorization (25.35) is convenient, but not es- 
sential for this project, so if you find sect. 21.1.1 too long a read, skip the sym- 
metrization. 



S.1.1 The full shift 



Take the map (S.l) and extend it to the real line. As in example of figure 25.3, 
denote by a the critical value of the map (the maximum height in the unit cell) 

.1 1 A+ 1 
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Describe the symbolic dynamics that you obtain when a is an integer, and derive 
the formula for the diffusion constant: 

_ (A 1)(A - 3) f or a = 4a -1, aeZ. (S.8) 
96A 

If you are going strong, derive also the fromula for the half-integer a = (2k + l)/2, 
A = 4a + 1 case and email it to DasBuch@nbi.dk. You will need to partition M 
into the left and right half, Wfc = Ms U Mg, as in the derivation of (25.21). 



exercise 25.1 



S.1.2 Subshifts of finite type 



We now work out an example when the partition is Markov, although the slope is 
not an integer number. The key step is that of having a partition where intervals 
are mapped onto unions of intervals. Consider for example the case in which 
A = 4a — 1, where 1 < a < 2. A first partition is constructed from seven intervals, 
which we label {My, M4, M5, M%, Me, Mj, AI3}, with the alphabet ordered as 
the intervals are laid out along the unit interval. In general the critical value a will 
not correspond to an interval border, but now we choose a such that the critical 
point is mapped onto the right border of Mi , as in figure S. 1 (a). The critical value 
of /() is /(tx") = a - 1 = (A - 3)/4. Equating this with the right border of M , 
x = 1 /A, we obtain a quadratic equation with the expanding solution A = 4. We 
have that /(AI4) = /(AI5) - Mi, so the transition matrix (14.1) is given by 



<f>' = T<f> = 



1 1 


1 


1 


1 


0 


0 


1 \ 


1 


0 


0 


1 


0 


0 


1 


1 


0 


0 


1 


0 


0 


1 


1 


0 


0 


1 


0 


0 


1 


1 


0 


0 


1 


0 


0 


1 


. 1 


0 


0 


1 


1 


1 


1, 



(pi 

<f>4 
<p5 
4>2 
<p6 
<f>7 
\<f>3J 



(S.9) 



and the dynamics is unrestricted in the alphabet 



{1,41, 51, 2, 63, 73,3,}. 



One could diagonalize (S.9) on the computer, but, as we saw in chapter 14, the 
transition graph figure S.2(b) corresponding to figure S.l (a) offers more insight 
into the dynamics. The dynamical zeta function 



l/£ = 1 - (fi + h + h) - 2(t u + tyj) 
z z 2 

lit = i_3±_4coshy6--. (S.10) 
A A 2 



follows from the loop expansion (15.15) of sect. 15.3. 
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Figure S.2: (a) The sawtooth map (S.l) partition 
tree for figure S . 1 (a) ; while intervals M i , M2 , M3 
map onto the whole unit interval, f(Mi) = 
f(Mi) = f(Mi) = M, intervals M^,Ms map 
onto Mi only, /(Ma) = f(Ms) = Mi, and sim- 
ilarly for intervals Me, Mj. An initial point start- 
ing out in the interval Mi, M2 or M3 can land 
anywhere on the unit interval, so the subtrees orig- 
inating from the corresponding nodes on the parti- 
tion three are similar to the whole tree and can be 
identified (as, for example, in figure 14.5), yield- 
ing (b) the transition graph for the Markov parti- 
tion of figure S.l (a), (c) the transition graph in the 
compact notation of (25.26). 




(b) 1 



The material flow conservation sect. 20.3.2 and the symmetry factorization 
(25.35) yield 



1 



<T(0, i) 



which indeed is satisfied by the given value of A. Conversely, we can use the 
desired Markov partition topology to write down the corresponding dynamical 
zeta function, and use the 1/^(0, 1) = 0 condition to fix A. For more complicated 
transition matrices the factorization (25.35) is very helpful in reducing the order 
of the polynomial condition that fixes A. 

The diffusion constant follows from (25.36) and (S.4) 



D = - 



1 1 



2 A+ 1 



10 



Think up other non-integer values of the parameter for which the symbolic dy- 
namics is given in terms of Markov partitions: in particular consider the cases 
illustrated in figure S.l and determine for what value of the parameter a each of 
them is realized. Work out the transition graph, symmetrization factorization and 
the diffusion constant, and check the material flow conservation for each case. De- 
rive the diffusion constants listed in table S.l. It is not clear why the final answers 
tend to be so simple. Numerically, the case of figure S.l (c) appears to yield the 
maximal diffusion constant. Does it? Is there an argument that it should be so? 

The seven cases considered here (see table S.l, figure S.l and (S.8)) are the 
7 simplest complete Markov partitions, the criterion being that the critical points 
map onto partition boundary points. This is, for example, what happens for uni- 
modal tent map; if the critical point is preperiodic to an unstable cycle, the gram- 
mar is complete. The simplest example is the case in which the tent map critical 
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figure S.l 


A 


D 


(a) 
(b) 

(c) 

V" ) 

(d) 
(e) 

(f) 


3 
4 

V5 + 2 

i(VT7 + 5) 
5 

±(V33 + 5) 
2V2 + 3 

i(V33 + 7) 
7 


0 

1 

10 
1 

2V5 
2 

Vl7 
2 

i + |V33 
1 

2V2 
1 + 1 

4 + 4V33 
2 
7 



Table S.l: The diffusion constant as function of the slope A for the a = 1, 2 values of ( S.8) 
and the 6 Markov partitions of figure S.l 

point is preperiodic to a unimodal map 3-cycle, in which case the grammar is of 
golden mean type, with _00_ substring prohibited (see figure 14.5). In case at 
hand, the "critical" point is the junction of branches 4 and 5 (symmetry automat- 
ically takes care of the other critical point, at the junction of branches 6 and 7), 
and for the cases considered the critical point maps into the endpoint of each of 
the seven branches. 

One can fill out parameter a axis arbitrarily densely with such points - each of 
the 7 primary intervals can be subdivided into 7 intervals obtained by 2-nd iterate 
of the map, and for the critical point mapping into any of those in 2 steps the 
grammar (and the corresponding cycle expansion) is finite, and so on. 

5.1.3 Diffusion coefficient, numerically 

(optional:) 

Attempt a numerical evaluation of 

D=\\im-{^\. (S.ll) 

Study the convergence by comparing your numerical results to the exact answers 
derived above. Is it better to use few initial x and average for long times, or to 
use many initial x. for shorter times? Or should one fit the distribution of £ with 
a Gaussian and get the D this way? Try to plot dependence of D on A; perhaps 
blow up a small region to show that the dependance of D on the parameter A is 
fractal. Compare with figure 25.5 and figures in refs. [1, 2, 9, 8, 10]. 

5.1.4 D is a nonuniform function of the parameters 

(optional:) 

The dependence of D on the map parameter A is rather unexpected - even though 
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for larger A more points are mapped outside the unit cell in one iteration, the 
diffusion constant does not necessarily grow. An interpretation of this lack of 
monotonicity would be interesting. 

You can also try applying periodic orbit theory to the sawtooth map ($.1) for 
a random "generic" value of the parameter A, for example A = 6. The idea is to 
bracket this value of A by the nearby ones, for which higher and higher iterates 
of the critical value a - (A + l)/4 fall onto the partition boundaries, compute the 
exact diffusion constant for each such approximate Markov partition, and study 
their convergence toward the value of D for A = 6. Judging how difficult such 
problem is already for a tent map (see sect. 15.5 and appendix E.l), this is too 
ambitious for a week-long exam. 
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S.2 Deterministic diffusion, sawtooth map 

To illustrate the main idea of chapter 25, tracking of a globally diffusing orbit 
by the associated confined orbit restricted to the fundamental cell, we consider 
in more detail the class of simple 1 -dimensional dynamical systems, chains of 
piecewise linear maps (25.9). The translational symmetry (25.10) relates the un- 
bounded dynamics on the real line to the dynamics restricted to a "fundamental 
cell" - in the present example the unit interval curled up into a circle. The corre- 
sponding circle map f(x) is obtained by modulo the integer part. The elementary 
cell map f(x) is sketched in figure 25.3. The map has the symmetry property 

f{x) = -f(-x) , (S.12) 

so that the dynamics has no drift, and all odd derivatives of the generating function 
(25.3) with respect to /? evaluated at /? = 0 vanish. 

The cycle weights are given by 
, p = ^. (S.13, 
The diffusion constant formula for 1 -dimensional maps is 

D=\\± (S.14) 

2 (n) ( 

where the "mean cycle time" is given by 
d 1 



{n)^ = Z- 



the mean cycle displacement squared by 



z =i ^ i a p, • • • 



\ n k ~ W 2 WJ) - - " 'A- •••A..I ' (S " 16) 



0=0 I^Pl ' ' ' ^Pk 



and the sum is over all distinct non-repeating combinations of prime cycles. Most 
of results expected in this projects require no more than pencil and paper compu- 
tations. 



S.2.1 The full shift 



Reproduce the formulas of sect. 25.2 for the diffusion constant D for A both even 
and odd integer. 
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fi 0 1 1 TP 9^ zl 


A 


n 

IS 


(a) 


4 

2+ V6 


I 

i-IVS 


(b) 
(c) 


2V2 + 2 
5 


15+2 V2 
16+4 V2 
1 


(d) 
(e) 


3+ V5 
3 + V7 
6 


5 A-1 

2 3A-4 
5A-4 
3A-2 
5 
6 



Table S.2: The diffusion constant as function of the slope A for the A = 4, 6 values of 
(25.20) and the 5 Markov partitions like the one indicated in figure 25.4. 

5.2.2 Subshifts of finite type 

We now work out examples when the partition is Markov, although the slope is 
not an integer number. The key step is that of having a partition where intervals 
are mapped onto unions of intervals. 

Start by reproducing the formula (25.28) of sect. 25.2.2 for the diffusion con- 
stant D for the Markov partition, the case where the critical point is mapped onto 
the right border of 

Think up other non-integer values of the parameter A for which the symbolic 
dynamics is given in terms of Markov partitions: in particular consider the remain- 
ing four cases for which the critical point is mapped onto a border of a partition in 
one iteration. Work out the transition graph symmetrization factorization and the 
diffusion constant, and check the material flow conservation for each case. Fill in 
the diffusion constants missing in table S.2. It is not clear why the final answers 
tend to be so simple. What value of A appears to yield the maximal diffusion 
constant? 

The 7 cases considered here (see table S.2 and figure 25.4) are the 7 simplest 
complete Markov partitions in the 4 < A < 6 interval, the criterion being that the 
critical points map onto partition boundary points. In case at hand, the "critical" 
point is the highest point of the left branch of the map (symmetry automatically 
takes care of the other critical point, the lowest point of the left branch), and for 
the cases considered the critical point maps into the endpoint of each of the seven 
branches. 

One can fill out parameter a axis arbitrarily densely with such points - each of 
the 6 primary intervals can be subdivided into 6 intervals obtained by 2-nd iterate 
of the map, and for the critical point mapping into any of those in 2 steps the 
grammar (and the corresponding cycle expansion) is finite, and so on. 

5.2.3 Diffusion coefficient, numerically 

(optional:) 
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Attempt a numerical evaluation of 



Z n — >oo fi \ I 



(S.17) 



Study the convergence by comparing your numerical results to the exact answers 
derived above. Is it better to use few initial x and average for long times, or to 
use many initial jc for shorter times? Or should one fit the distribution of x with 
a Gaussian and get the D this way? Try to plot dependence of D on A; perhaps 
blow up a small region to show that the dependance of D on the parameter A is 
fractal. Compare with figure 25.5 and figures in refs. [1, 2, 9, 8, 10]. 

S.2.4 D is a nonuniform function of the parameters 

( optional:) 

The dependence of D on the map parameter A is rather unexpected - even though 
for larger A more points are mapped outside the unit cell in one iteration, the 
diffusion constant does not necessarily grow. Figure 25.5 taken from ref. [9] il- 
lustrates the fractal dependence of diffusion constant on the map parameter. An 
interpretation of this lack of monotonicity would be interesting. 

You can also try applying periodic orbit theory to the sawtooth map (25.9) for 
a random "generic" value of the parameter A, for example A = 4.5. The idea is 
to bracket this value of A by the nearby ones, for which higher and higher iter- 
ates of the critical value a = A/2 fall onto the partition boundaries, compute the 
exact diffusion constant for each such approximate Markov partition, and study 
their convergence toward the value of D for A = 4.5. Judging how difficult such 
problem is already for a tent map (see sect. 15.5 and appendix E.l), this is too 
ambitious for a week-long exam. 



Problems/projDDifT2.tex 7aug2002 



ChaosBook.org versionl4.4. 1, Apr21 2013 



